ComfyUI/comfy at abfea891efb1617099f52d7c13487aa46e541cde - ComfyUI - EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-07-21 23:41:28 +08:00

Files

T

History

RyanOnTheInside abfea891ef Fix conditioning mask normalization for arbitrary spatial dimensions.

May also resolve #9784 — the mask normalization fixes a class of dimensionality mismatches that can cause the `y, x = torch.where(mask)` crash in `get_mask_aabb`, though the root cause in that report is unconfirmed.

## Summary

`resolve_areas_and_cond_masks_multidim` assumes 2D spatial masks. This breaks for 1D audio models (StableAudio1, ACEAudio15) because upstream code (`ConditioningSetMask`, `set_mask_for_conditioning`) unconditionally unsqueezes masks with `ndim < 3`, corrupting valid `[B, L]` masks into `[1, B, L]` before they reach the sampler.

This PR:
- Normalizes masks to `[batch, *spatial_dims]` using `dims` as the source of truth
- Adds a 1D resize path via `F.interpolate(mode='linear')`
- Guards `set_area_to_bounds` with `len(dims) == 2` to prevent crashes on non-2D masks (the existing `get_mask_aabb` and `H, W, Y, X` unpacking are 2D-only)

The root cause is the hardcoded `if len(mask.shape) < 3` in `nodes.py:242` and `hooks.py:725`. Fixing it there would require threading latent dimensionality into the conditioning nodes — a much larger change. Normalizing in `resolve_areas_and_cond_masks_multidim` where `dims` is already available is the minimal fix.

Fully backwards compatible for existing 2D image and 3D video workflows.

## Test plan

- [x] 26 unit tests covering 1D/2D/3D mask normalization, resize, and `set_area_to_bounds` guard (`tests-unit/comfy_test/samplers_test.py`)
- [x] 2D image regression with hook masking: [lorahookmasking.json](https://github.com/Kosinkadink/ComfyUI/blob/workflows/lorahookmasking.json)
- [x] 2D image with `set_area_to_bounds` ("mask bounds" mode) — no crash, correct area computation
- [x] 1D audio with conditioning mask: [acestep-1.5-prompt-lora-blending.json](https://github.com/ryanontheinside/ComfyUI_RyanOnTheInside/blob/main/examples/ace1.5/acestep-1.5-prompt-lora-blending.json) (requires custom nodes that patch this function pending upstream)

2026-02-15 09:45:14 -05:00

..

Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )

2026-02-01 01:01:11 -05:00

Add better error message for common error. (#10846 )

2025-11-23 04:55:22 -05:00

Add support for dev-only nodes. (#12106 )

2026-01-27 13:03:29 -08:00

Uni pc sampler now works with audio and video models.

2025-01-18 05:27:58 -05:00

Add Hunyuan 3D 2.1 Support (#8714 )

2025-09-04 20:36:20 -04:00

ace15: Use dynamic_vram friendly trange (#12409 )

2026-02-11 14:53:42 -05:00

Add working Qwen 2512 ControlNet (Fun ControlNet) support (#12359 )

2026-02-13 22:23:52 -05:00

Silence clip tokenizer warning. (#8934 )

2025-07-16 14:42:07 -04:00

Controlnet refactor.

2024-06-27 18:43:11 -04:00

Support LTX2 tiny vae (taeltx_2) (#11929 )

2026-01-21 23:03:51 -05:00

Add left padding to LTXAV text encoder. (#12456 )

2026-02-13 21:56:54 -05:00

[Trainer] training with proper offloading (#12189 )

2026-02-10 21:45:19 -05:00

cli_args.py

Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )

2026-02-01 01:01:11 -05:00

clip_config_bigg.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

clip_model.py

Support the siglip 2 naflex model as a clip vision model. (#11831 )

2026-01-12 17:05:54 -05:00

clip_vision_config_g.json

Add support for clip g vision model to CLIPVisionLoader.

2023-08-18 11:13:29 -04:00

clip_vision_config_h.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_config_vitl_336_llava.json

Support llava clip vision model.

2025-03-06 00:24:43 -05:00

clip_vision_config_vitl_336.json

support clip-vit-large-patch14-336 (#4042 )

2024-07-17 13:12:50 -04:00

clip_vision_config_vitl.json

Add support for unCLIP SD2.x models.

2023-04-01 23:19:15 -04:00

clip_vision_siglip2_base_naflex.json

Support the siglip 2 naflex model as a clip vision model. (#11831 )

2026-01-12 17:05:54 -05:00

clip_vision_siglip_384.json

Support new flux model variants.

2024-11-21 08:38:23 -05:00

clip_vision_siglip_512.json

Support 512 siglip model.

2025-04-05 07:01:01 -04:00

clip_vision.py

Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )

2026-02-01 01:01:11 -05:00

conds.py

Add some warnings and prevent crash when cond devices don't match. (#9169 )

2025-08-04 04:20:12 -04:00

context_windows.py

Add handling for vace_context in context windows (#11386 )

2025-12-30 14:40:42 -08:00

controlnet.py

Add working Qwen 2512 ControlNet (Fun ControlNet) support (#12359 )

2026-02-13 22:23:52 -05:00

diffusers_convert.py

Remove useless code.

2025-01-24 06:15:54 -05:00

diffusers_load.py

load_unet -> load_diffusion_model with a model_options argument.

2024-08-12 23:20:57 -04:00

float.py

Optimize nvfp4 lora applying. (#11866 )

2026-01-14 00:49:38 -05:00

gligen.py

Remove some useless code. (#8812 )

2025-07-06 07:07:39 -04:00

hooks.py

New Year ruff cleanup. (#11595 )

2026-01-01 22:06:14 -05:00

latent_formats.py

Basic support for the ace step 1.5 model. (#12237 )

2026-02-03 00:06:18 -05:00

lora_convert.py

Use torch RMSNorm for flux models and refactor hunyuan video code. (#12432 )

2026-02-13 15:35:13 -05:00

lora.py

Support ace step 1.5 base model loras. (#12252 )

2026-02-03 13:54:23 -05:00

memory_management.py

Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )

2026-02-01 01:01:11 -05:00

model_base.py

Make built in lora training work on anima. (#12402 )

2026-02-10 22:04:32 -05:00

model_detection.py

Use torch RMSNorm for flux models and refactor hunyuan video code. (#12432 )

2026-02-13 15:35:13 -05:00

model_management.py

dynamic_vram: Fix windows Aimdo crash + Fix LLM performance (#12408 )

2026-02-11 14:50:16 -05:00

model_patcher.py

dynamic_vram: Training fixes (#12442 )

2026-02-13 15:29:37 -05:00

model_sampling.py

Refactor model sampling sigmas code. (#10250 )

2025-10-08 17:49:02 -04:00

nested_tensor.py

WIP way to support multi multi dimensional latents. (#10456 )

2025-10-23 21:21:14 -04:00

ops.py

dynamic_vram: Fix windows Aimdo crash + Fix LLM performance (#12408 )

2026-02-11 14:50:16 -05:00

options.py

Only parse command line args when main.py is called.

2023-09-13 11:38:20 -04:00

patcher_extension.py

Fix order of inputs nested merge_nested_dicts (#10362 )

2025-10-15 16:47:26 -07:00

pinned_memory.py

fix pinning with model defined dtype (#12208 )

2026-02-01 08:42:32 -08:00

pixel_space_convert.py

Changes to the previous radiance commit. (#9851 )

2025-09-13 18:03:34 -04:00

quant_ops.py

Optimize nvfp4 lora applying. (#11866 )

2026-01-14 00:49:38 -05:00

rmsnorm.py

Add warning when using old pytorch. (#9347 )

2025-08-15 00:22:26 -04:00

sample.py

Make regular empty latent node work properly on flux 2 variants. (#12050 )

2026-01-23 19:50:48 -05:00

sampler_helpers.py

[Trainer] training with proper offloading (#12189 )

2026-02-10 21:45:19 -05:00

samplers.py

Fix conditioning mask normalization for arbitrary spatial dimensions.

2026-02-15 09:45:14 -05:00

sd1_clip_config.json

Fix potential issue with non clip text embeddings.

2024-07-30 14:41:13 -04:00

sd1_clip.py

Support generating attention masks for left padded text encoders. (#12454 )

2026-02-13 20:15:23 -05:00

sd.py

sd: delay VAE dtype archive until after override (#12388 )

2026-02-10 13:37:46 -05:00

sdxl_clip.py

Add a T5TokenizerOptions node to set options for the T5 tokenizer. (#7803 )

2025-04-25 19:36:00 -04:00

supported_models_base.py

Fix some custom nodes. (#11134 )

2025-12-05 18:25:31 -05:00

supported_models.py

Use torch RMSNorm for flux models and refactor hunyuan video code. (#12432 )

2026-02-13 15:35:13 -05:00

utils.py

Remove unsafe pickle loading code that was used on pytorch older than 2.4 (#12473 )

2026-02-14 22:53:52 -05:00

windows.py

Reduce RAM usage, fix VRAM OOMs, and fix Windows shared memory spilling with adaptive model loading (#11845 )

2026-02-01 01:01:11 -05:00