EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-03-20 00:24:59 +08:00

Author	SHA1	Message	Date
ReinerBforartists	385221234c	Fixes mutable default arguments in the wan vae. In Python, mutable default arguments are evaluated once at function definition time and shared across all subsequent calls. This is a well-known Python pitfall: ```python # BAD: this list is shared across ALL calls to forward() def forward(self, x, feat_cache=None, feat_idx=[0]): feat_idx[0] += 1 # modifies the shared default list! ``` In `comfy/ldm/wan/vae.py` and `comfy/ldm/wan/vae2_2.py`, the `forward` methods of `Resample`, `ResidualBlock`, `Down_ResidualBlock`, `Up_ResidualBlock`, `Encoder3d` and `Decoder3d` all use `feat_idx=[0]` as a default argument. Since `feat_idx[0]` is incremented inside these methods, the default value accumulates between inference runs. On the second run, `feat_idx[0]` no longer starts at `0` but at whatever value it reached at the end of the first run, causing incorrect cache indexing throughout the entire encoder and decoder. Fix: ```python # GOOD: a new list is created for every call that doesn't pass feat_idx def forward(self, x, feat_cache=None, feat_idx=None): # Fix: mutable default argument feat_idx=[0] would persist between calls if feat_idx is None: feat_idx = [0] ``` Observed impact: On AMD/ROCm hardware this bug caused 4-5x slower inference on all runs after the first with WAN VAE. After this fix, only Run 2 remains slightly slower (due to a separate MIOpen kernel cache issue), while Run 3 and beyond are now as fast as Run 1. The bug likely affects all hardware to some degree as incorrect cache indexing causes unnecessary recomputation. Related issues exists in the ROCm tracker and in the ComfyUI tracker. https://github.com/ROCm/ROCm/issues/6008 https://github.com/Comfy-Org/ComfyUI/issues/12672#issuecomment-4059981039	2026-03-14 09:57:05 +01:00
rattus128	95ca2e56c8	WAN2.2: Fix cache VRAM leak on error (#10308 ) Same change pattern as `7e8dd275c2` applied to WAN2.2 If this suffers an exception (such as a VRAM oom) it will leave the encode() and decode() methods which skips the cleanup of the WAN feature cache. The comfy node cache then ultimately keeps a reference this object which is in turn reffing large tensors from the failed execution. The feature cache is currently setup at a class variable on the encoder/decoder however, the encode and decode functions always clear it on both entry and exit of normal execution. Its likely the design intent is this is usable as a streaming encoder where the input comes in batches, however the functions as they are today don't support that. So simplify by bringing the cache back to local variable, so that if it does VRAM OOM the cache itself is properly garbage when the encode()/decode() functions dissappear from the stack.	2025-10-13 15:23:11 -04:00
comfyanonymous	1e638a140b	Tiny wan vae optimizations. (#9136 )	2025-08-01 05:25:38 -04:00
comfyanonymous	c60dc4177c	Remove unecessary clones in the wan2.2 VAE. (#9083 )	2025-07-28 14:48:19 -04:00
comfyanonymous	a88788dce6	Wan 2.2 support. (#9080 )	2025-07-28 08:00:23 -04:00

5 Commits