EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-03-19 16:16:00 +08:00

History

rattus 513b0c46fb Add RAM Pressure cache mode (#10454 ) * execution: Roll the UI cache into the outputs Currently the UI cache is parallel to the output cache with expectations of being a content superset of the output cache. At the same time the UI and output cache are maintained completely seperately, making it awkward to free the output cache content without changing the behaviour of the UI cache. There are two actual users (getters) of the UI cache. The first is the case of a direct content hit on the output cache when executing a node. This case is very naturally handled by merging the UI and outputs cache. The second case is the history JSON generation at the end of the prompt. This currently works by asking the cache for all_node_ids and then pulling the cache contents for those nodes. all_node_ids is the nodes of the dynamic prompt. So fold the UI cache into the output cache. The current UI cache setter now writes to a prompt-scope dict. When the output cache is set, just get this value from the dict and tuple up with the outputs. When generating the history, simply iterate prompt-scope dict. This prepares support for more complex caching strategies (like RAM pressure caching) where less than 1 workflow will be cached and it will be desirable to keep the UI cache and output cache in sync. * sd: Implement RAM getter for VAE * model_patcher: Implement RAM getter for ModelPatcher * sd: Implement RAM getter for CLIP * Implement RAM Pressure cache Implement a cache sensitive to RAM pressure. When RAM headroom drops down below a certain threshold, evict RAM-expensive nodes from the cache. Models and tensors are measured directly for RAM usage. An OOM score is then computed based on the RAM usage of the node. Note the due to indirection through shared objects (like a model patcher), multiple nodes can account the same RAM as their individual usage. The intent is this will free chains of nodes particularly model loaders and associate loras as they all score similar and are sorted in close to each other. Has a bias towards unloading model nodes mid flow while being able to keep results like text encodings and VAE. * execution: Convert the cache entry to NamedTuple As commented in review. Convert this to a named tuple and abstract away the tuple type completely from graph.py.		2025-10-30 17:39:02 -04:00
..
audio_encoders	Support the HuMo model. (#9903 )	2025-09-17 00:12:48 -04:00
cldm	Replace print with logging (#6138 )	2024-12-20 16:24:55 -05:00
comfy_types	LoRA Trainer: LoRA training node in weight adapter scheme (#8446 )	2025-06-13 19:25:59 -04:00
extra_samplers	Uni pc sampler now works with audio and video models.	2025-01-18 05:27:58 -05:00
image_encoders	Add Hunyuan 3D 2.1 Support (#8714 )	2025-09-04 20:36:20 -04:00
k_diffusion	Fix depending on asserts to raise an exception in BatchedBrownianTree and Flash attn module (#9884 )	2025-09-15 20:05:03 -04:00
ldm	Fix batch size above 1 giving bad output in chroma radiance. (#10394 )	2025-10-18 23:15:34 -04:00
sd1_tokenizer	Silence clip tokenizer warning. (#8934 )	2025-07-16 14:42:07 -04:00
t2i_adapter	Controlnet refactor.	2024-06-27 18:43:11 -04:00
taesd
text_encoders	Implement gemma 3 as a text encoder. (#10241 )	2025-10-06 22:08:08 -04:00
weight_adapter	Fix LoRA Trainer bugs with FP8 models. (#9854 )	2025-09-20 21:24:48 -04:00
checkpoint_pickle.py
cli_args.py	Add RAM Pressure cache mode (#10454 )	2025-10-30 17:39:02 -04:00
clip_config_bigg.json
clip_model.py	USO style reference. (#9677 )	2025-09-02 15:36:22 -04:00
clip_vision_config_g.json
clip_vision_config_h.json
clip_vision_config_vitl_336_llava.json	Support llava clip vision model.	2025-03-06 00:24:43 -05:00
clip_vision_config_vitl_336.json
clip_vision_config_vitl.json	Add support for unCLIP SD2.x models.	2023-04-01 23:19:15 -04:00
clip_vision_siglip_384.json	Support new flux model variants.	2024-11-21 08:38:23 -05:00
clip_vision_siglip_512.json	Support 512 siglip model.	2025-04-05 07:01:01 -04:00
clip_vision.py	Some changes to the previous hunyuan PR. (#9725 )	2025-09-04 20:39:02 -04:00
conds.py	Add some warnings and prevent crash when cond devices don't match. (#9169 )	2025-08-04 04:20:12 -04:00
context_windows.py	Make step index detection much more robust (#9392 )	2025-08-17 18:54:07 -04:00
controlnet.py	Fix Race condition in --async-offload that can cause corruption (#10501 )	2025-10-29 17:17:46 -04:00
diffusers_convert.py	Remove useless code.	2025-01-24 06:15:54 -05:00
diffusers_load.py
float.py	Clamp output when rounding weight to prevent Nan.	2024-10-19 19:07:10 -04:00
gligen.py	Remove some useless code. (#8812 )	2025-07-06 07:07:39 -04:00
hooks.py	Hooks Part 2 - TransformerOptionsHook and AdditionalModelsHook (#6377 )	2025-01-11 12:20:23 -05:00
latent_formats.py	Add support for Chroma Radiance (#9682 )	2025-09-13 17:58:43 -04:00
lora_convert.py	Implement the USO subject identity lora. (#9674 )	2025-09-01 18:54:02 -04:00
lora.py	Support the omnigen2 umo lora. (#9886 )	2025-09-15 18:10:55 -04:00
model_base.py	Mixed Precision Quantization System (#10498 )	2025-10-28 16:20:53 -04:00
model_detection.py	Mixed Precision Quantization System (#10498 )	2025-10-28 16:20:53 -04:00
model_management.py	Fix Race condition in --async-offload that can cause corruption (#10501 )	2025-10-29 17:17:46 -04:00
model_patcher.py	Add RAM Pressure cache mode (#10454 )	2025-10-30 17:39:02 -04:00
model_sampling.py	Refactor model sampling sigmas code. (#10250 )	2025-10-08 17:49:02 -04:00
nested_tensor.py	WIP way to support multi multi dimensional latents. (#10456 )	2025-10-23 21:21:14 -04:00
ops.py	Fix small performance regression with fp8 fast and scaled fp8. (#10537 )	2025-10-29 19:29:01 -04:00
options.py
patcher_extension.py	Fix order of inputs nested merge_nested_dicts (#10362 )	2025-10-15 16:47:26 -07:00
pixel_space_convert.py	Changes to the previous radiance commit. (#9851 )	2025-09-13 18:03:34 -04:00
quant_ops.py	Fix small performance regression with fp8 fast and scaled fp8. (#10537 )	2025-10-29 19:29:01 -04:00
rmsnorm.py	Add warning when using old pytorch. (#9347 )	2025-08-15 00:22:26 -04:00
sample.py	Fix mistake. (#10484 )	2025-10-25 23:07:29 -04:00
sampler_helpers.py	Added context window support to core sampling code (#9238 )	2025-08-13 21:33:05 -04:00
samplers.py	WIP way to support multi multi dimensional latents. (#10456 )	2025-10-23 21:21:14 -04:00
sd1_clip_config.json
sd1_clip.py	Disable prompt weights for qwen. (#9438 )	2025-08-20 01:08:11 -04:00
sd.py	Add RAM Pressure cache mode (#10454 )	2025-10-30 17:39:02 -04:00
sdxl_clip.py	Add a T5TokenizerOptions node to set options for the T5 tokenizer. (#7803 )	2025-04-25 19:36:00 -04:00
supported_models_base.py	Mixed Precision Quantization System (#10498 )	2025-10-28 16:20:53 -04:00
supported_models.py	Lower wan memory estimation value a bit. (#9964 )	2025-09-20 22:09:35 -04:00
utils.py	WIP way to support multi multi dimensional latents. (#10456 )	2025-10-23 21:21:14 -04:00