EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-06-17 13:30:16 +08:00

History

Jedrzej Kosinski bece6b2aec Some checks are pending Python Linting / Run Ruff (push) Waiting to run Details Python Linting / Run Pylint (push) Waiting to run Details multigpu: refactor deepclone_multigpu + register cached_patcher_init for CLIP/VAE; SelectDevice retargets via deepclone - ModelPatcher.deepclone_multigpu: remove copy.deepcopy fallback. Require cached_patcher_init (raise a descriptive RuntimeError if missing) and always go through clone(model_override=...) with empty backup containers so the per-device clone owns a pristine, unpatched module instead of a deepcopy of an already-loaded/already-patched one. Also call register_load_device on the new patcher so ModelPatcherDynamic per-device bookkeeping (e.g. dynamic_pins) is populated for the requested load device. - comfy/sd.py: register cached_patcher_init on the CLIP and VAE patchers returned by load_checkpoint_guess_config, and on the patcher returned by load_diffusion_model's companion paths. Add load_checkpoint_clip_patcher, load_checkpoint_vae_patcher, and load_vae_patcher reload helpers so the same loader context can be reused to produce per-device clones. - nodes.py: VAELoader registers cached_patcher_init on the produced VAE's patcher when there is a single backing file (skip for pixel_space and composite image-TAESDs which aren't addressable by a single path). - comfy_extras/nodes_multigpu.py: SelectModelDevice / SelectCLIPDevice / SelectVAEDevice now retarget via deepclone_multigpu when the requested device differs from the current load_device, so the consumed model is not just relabeled but actually rehomed onto the chosen device. Verified on runner-2 (2x RTX 4090, comfy-aimdo 0.4.4): - 10/10 focused unit tests (deepclone behavior, missing-factory error path, SelectDevice behavior). - Device-switch-after-consumption end-to-end (SD1.5) produces bit-identical PNGs on cuda:0 and cuda:1. - Z Image multigpu CFG split: ~1.90x speedup (10.5s vs 19.9s steady). - Qwen Image multigpu CFG split (real text negative, cfg=4): ~1.69x speedup (32.5s vs 54.8s steady) -- matches pre-refactor numbers. - Baseline (patch stashed) and patched produce identical timings on both models, so the refactor is performance-neutral. Amp-Thread-ID: https://ampcode.com/threads/T-019e5783-b810-74b1-8ca9-09d675de1479 Co-authored-by: Amp <amp@ampcode.com>		2026-05-23 19:11:48 -07:00
..
audio_encoders	Fix fp16 audio encoder models (#12811 )	2026-03-06 18:20:07 -05:00
background_removal	Add support for BiRefNet background remove model (CORE-46) (#12747 )	2026-05-08 17:59:24 +08:00
cldm
comfy_types	fix: use frontend-compatible format for Float gradient_stops (#12789 )	2026-03-12 10:14:28 -07:00
extra_samplers
image_encoders	feat: Support MoGe (CORE-168) (#13878 )	2026-05-15 10:34:56 +08:00
k_diffusion	feat: Support HiDream-O1-Image (CORE-187) (#13817 )	2026-05-11 20:35:53 -07:00
ldm	Merge remote-tracking branch 'origin/master' into worksplit-multigpu	2026-05-21 12:17:59 -07:00
sd1_tokenizer
t2i_adapter
taesd	Add high quality preview support for Flux2 latents (#13496 )	2026-04-29 19:37:30 -04:00
text_encoders	Support Stable Audio 3 model. (#14010 )	2026-05-20 11:34:22 -04:00
weight_adapter	MPDynamic: force load flux img_in weight (Fixes flux1 canny+depth lora crash) (#12446 )	2026-02-15 20:30:09 -05:00
bg_removal_model.py	Fix BiRefNet issue (#13966 )	2026-05-19 05:03:22 +08:00
cli_args.py	Merge remote-tracking branch 'origin/master' into worksplit-multigpu	2026-05-21 12:17:59 -07:00
clip_config_bigg.json
clip_model.py	Support the siglip 2 naflex model as a clip vision model. (#11831 )	2026-01-12 17:05:54 -05:00
clip_vision_config_g.json
clip_vision_config_h.json
clip_vision_config_vitl_336_llava.json
clip_vision_config_vitl_336.json
clip_vision_config_vitl.json
clip_vision_siglip2_base_naflex.json	Support the siglip 2 naflex model as a clip vision model. (#11831 )	2026-01-12 17:05:54 -05:00
clip_vision_siglip_384.json
clip_vision_siglip_512.json
clip_vision.py	Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )	2026-02-01 01:01:11 -05:00
conds.py	Cleanups to the last PR. (#12646 )	2026-02-26 01:30:31 -05:00
context_windows.py	feat: Context windows - add causal_window_fix to improve blending of context windows (CORE-100) (#13563 )	2026-05-05 16:40:53 -07:00
controlnet.py	Free QwenFunControlNet base_model reference in cleanup	2026-05-21 11:35:54 -07:00
deploy_environment.py	Add deploy environment header (Comfy-Env) to partner node API calls (#13425 )	2026-05-04 20:17:56 -07:00
diffusers_convert.py
diffusers_load.py
float.py	feat: Support mxfp8 (#12907 )	2026-03-14 18:36:29 -04:00
gligen.py
hooks.py	Fix typos (#10986 )	2026-05-08 17:14:45 +08:00
latent_formats.py	Support Stable Audio 3 model. (#14010 )	2026-05-20 11:34:22 -04:00
lora_convert.py	Use torch RMSNorm for flux models and refactor hunyuan video code. (#12432 )	2026-02-13 15:35:13 -05:00
lora.py	Multi-threaded load of models from disk (big load time speedups & Offload to disk) (CORE-43,CORE-152,CORE-164,CORE-165,CORE-117) (#13802 )	2026-05-20 17:03:58 -07:00
memory_management.py	memory_management: replace thread refusal with mutex	2026-05-23 01:00:30 +10:00
model_base.py	Support Stable Audio 3 model. (#14010 )	2026-05-20 11:34:22 -04:00
model_detection.py	Support Stable Audio 3 model. (#14010 )	2026-05-20 11:34:22 -04:00
model_management.py	SelectXDevice: address code-review follow-ups	2026-05-22 22:29:45 -07:00
model_patcher.py	multigpu: refactor deepclone_multigpu + register cached_patcher_init for CLIP/VAE; Select*Device retargets via deepclone	2026-05-23 19:11:48 -07:00
model_prefetch.py	prefetch: guard against no offload (#13703 )	2026-05-04 12:56:05 -07:00
model_sampling.py	feat: Support HiDream-O1-Image (CORE-187) (#13817 )	2026-05-11 20:35:53 -07:00
multigpu.py	Defer @pollockjj's tiled-VAE and UPSCALE_MODEL MultiGPU lanes (#14066 )	2026-05-22 16:44:29 -07:00
nested_tensor.py	WIP way to support multi multi dimensional latents. (#10456 )	2025-10-23 21:21:14 -04:00
ops.py	Multi-threaded load of models from disk (big load time speedups & Offload to disk) (CORE-43,CORE-152,CORE-164,CORE-165,CORE-117) (#13802 )	2026-05-20 17:03:58 -07:00
options.py
patcher_extension.py
pinned_memory.py	Multi-threaded load of models from disk (big load time speedups & Offload to disk) (CORE-43,CORE-152,CORE-164,CORE-165,CORE-117) (#13802 )	2026-05-20 17:03:58 -07:00
pixel_space_convert.py
quant_ops.py	Enable triton comfy kitchen via cli-arg (#12730 )	2026-05-03 14:07:21 -04:00
rmsnorm.py	feat: Gemma4 text generation support (CORE-30) (#13376 )	2026-05-02 22:46:15 -04:00
sample.py	Initial work to make downscale_ratio_temporal work. (#13972 )	2026-05-18 23:01:43 -04:00
sampler_helpers.py	Merge remote-tracking branch 'origin/master' into merge-master-into-worksplit-multigpu	2026-05-19 21:43:51 -07:00
samplers.py	Merge branch 'master' into worksplit-multigpu	2026-05-22 23:05:58 -07:00
sd1_clip_config.json
sd1_clip.py	feat: Support Qwen3.5 text generation models (#12771 )	2026-03-25 22:48:28 -04:00
sd.py	multigpu: refactor deepclone_multigpu + register cached_patcher_init for CLIP/VAE; Select*Device retargets via deepclone	2026-05-23 19:11:48 -07:00
sdxl_clip.py
supported_models_base.py	Fix some custom nodes. (#11134 )	2025-12-05 18:25:31 -05:00
supported_models.py	Support Stable Audio 3 model. (#14010 )	2026-05-20 11:34:22 -04:00
utils.py	Defer @pollockjj's tiled-VAE and UPSCALE_MODEL MultiGPU lanes (#14066 )	2026-05-22 16:44:29 -07:00