Single model workers configs should mean a less aggressive memory cleanup scheme #140

tazlin · 2024-02-25T20:06:34Z

The primary intent behind leaving a certain amount of free system ram is to allow a cushion for potentially very large other models to load (such as SDXL models). However, in the situation where the worker is configured only to run a single model, the memory conditions become much more predictable and will fail anyway if an OOM occurs.

If the worker has one model only
- If the model has only a single model file
  - Keep the model entirely on VRAM 100% of the time
- If the model has multiple models (as is the case with Stable Cascade)
  - Avoid offloading to disk if possible, swapping the models only between RAM and VRAM.

If failures are met in this situation, its likely the model overhead would only be encouraging the worker to run in very poor memory conditions (as they would constantly be loading off disk for little to no reason).

tazlin added the enhancement New feature or request label Feb 25, 2024

db0 transferred this issue from Haidra-Org/horde-worker-reGen Feb 28, 2024

db0 transferred this issue from Haidra-Org/AI-Horde-image-model-reference Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single model workers configs should mean a less aggressive memory cleanup scheme #140

Single model workers configs should mean a less aggressive memory cleanup scheme #140

tazlin commented Feb 25, 2024

Single model workers configs should mean a less aggressive memory cleanup scheme #140

Single model workers configs should mean a less aggressive memory cleanup scheme #140

Comments

tazlin commented Feb 25, 2024