ComfyUI/comfy/ldm
Jukka Seppänen fd109325db
Kandinsky5 model support (#10988)
* Add Kandinsky5 model support

lite and pro T2V tested to work

* Update kandinsky5.py

* Fix fp8

* Fix fp8_scaled text encoder

* Add transformer_options for attention

* Code cleanup, optimizations, use fp32 for all layers originally at fp32

* ImageToVideo -node

* Fix I2V, add necessary latent post process nodes

* Support text to image model

* Support block replace patches (SLG mostly)

* Support official LoRAs

* Don't scale RoPE for lite model as that just doesn't work...

* Update supported_models.py

* Rever RoPE scaling to simpler one

* Fix typo

* Handle latent dim difference for image model in the VAE instead

* Add node to use different prompts for clip_l and qwen25_7b

* Reduce peak VRAM usage a bit

* Further reduce peak VRAM consumption by chunking ffn

* Update chunking

* Update memory_usage_factor

* Code cleanup, don't force the fp32 layers as it has minimal effect

* Allow for stronger changes with first frames normalization

Default values are too weak for any meaningful changes, these should probably be exposed as advanced node options when that's available.

* Add image model's own chat template, remove unused image2video template

* Remove hard error in ReplaceVideoLatentFrames -node

* Update kandinsky5.py

* Update supported_models.py

* Fix typos in prompt template

They were now fixed in the original repository as well

* Update ReplaceVideoLatentFrames

Add tooltips
Make source optional
Better handle negative index

* Rename NormalizeVideoLatentFrames -node

For bit better clarity what it does

* Fix NormalizeVideoLatentStart node out on non-op
2025-12-05 22:20:22 -05:00
..
ace
audio
aura
cascade
chroma Implement the Ovis image model. (#11030) 2025-12-01 20:56:17 -05:00
chroma_radiance
cosmos
flux Implement the Ovis image model. (#11030) 2025-12-01 20:56:17 -05:00
genmo
hidream
hunyuan3d
hunyuan3dv2_1
hunyuan_video Implement temporal rolling VAE (Major VRAM reductions in Hunyuan and Kandinsky) (#10995) 2025-12-02 22:49:29 -05:00
hydit
kandinsky5 Kandinsky5 model support (#10988) 2025-12-05 22:20:22 -05:00
lightricks
lumina Remove line made unnecessary (and wrong) after transformer_options was added to NextDiT's _forward definition (#11118) 2025-12-05 14:05:38 -05:00
mmaudio/vae
models Flux 2 (#10879) 2025-11-25 10:50:19 -05:00
modules Implement temporal rolling VAE (Major VRAM reductions in Hunyuan and Kandinsky) (#10995) 2025-12-02 22:49:29 -05:00
omnigen
pixart
qwen_image
wan
common_dit.py
util.py