EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-07-15 19:09:09 +08:00

Author	SHA1	Message	Date
doctorpangloss	e7d0cc457d	fix tests, replace broken llava and fix transformers videos issue	2025-12-11 14:23:05 -08:00
doctorpangloss	a00c902067	Merge branch 'master' of github.com:comfyanonymous/ComfyUI into merge/0.3.76-snapshot	2025-12-09 08:58:52 -08:00
comfyanonymous	878db3a727	Implement the Ovis image model. (#11030 )	2025-12-01 20:56:17 -05:00
Haoming	c38e7d6599	block info (#10841 )	2025-11-26 20:28:44 -08:00
comfyanonymous	6b573ae0cb	Flux 2 (#10879 )	2025-11-25 10:50:19 -05:00
comfyanonymous	d526974576	Fix hunyuan 3d 2.0 (#10792 )	2025-11-18 16:46:19 -05:00
comfyanonymous	f60923590c	Use same code for chroma and flux blocks so that optimizations are shared. (#10746 )	2025-11-14 01:28:05 -05:00
rattus	94c298f962	flux: reduce VRAM usage (#10737 ) Cleanup a bunch of stack tensors on Flux. This take me from B=19 to B=22 for 1600x1600 on RTX5090.	2025-11-13 16:02:03 -08:00
comfyanonymous	2abd2b5c20	Make ScaleROPE node work on Flux. (#10686 )	2025-11-08 15:52:02 -05:00
contentis	4cd881866b	Use single apply_rope function across models (#10547 )	2025-11-04 20:10:11 -05:00
doctorpangloss	be56a14e65	Merge commit 'a4787ac83bf6c83eeb459ed80fc9b36f63d2a3a7' of github.com:comfyanonymous/ComfyUI into fix-merge	2025-10-21 10:53:43 -07:00
rattus128	653ceab414	Reduce Peak WAN inference VRAM usage - part II (#10062 ) * flux: math: Use _addcmul to avoid expensive VRAM intermediate The rope process can be the VRAM peak and this intermediate for the addition result before releasing the original can OOM. addcmul_ it. * wan: Delete the self attention before cross attention This saves VRAM when the cross attention and FFN are in play as the VRAM peak.	2025-09-27 18:14:16 -04:00
doctorpangloss	a9a0f96408	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-09-22 14:29:50 -07:00
rattus128	e42682b24e	Reduce Peak WAN inference VRAM usage (#9898 ) * flux: Do the xq and xk ropes one at a time This was doing independendent interleaved tensor math on the q and k tensors, leading to the holding of more than the minimum intermediates in VRAM. On a bad day, it would VRAM OOM on xk intermediates. Do everything q and then everything k, so torch can garbage collect all of qs intermediates before k allocates its intermediates. This reduces peak VRAM usage for some WAN2.2 inferences (at least). * wan: Optimize qkv intermediates on attention As commented. The former logic computed independent pieces of QKV in parallel which help more inference intermediates in VRAM spiking VRAM usage. Fully roping Q and garbage collecting the intermediates before touching K reduces the peak inference VRAM usage.	2025-09-16 19:21:14 -04:00
Jedrzej Kosinski	d7f40442f9	Enable Runtime Selection of Attention Functions (#9639 ) * Looking into a @wrap_attn decorator to look for 'optimized_attention_override' entry in transformer_options * Created logging code for this branch so that it can be used to track down all the code paths where transformer_options would need to be added * Fix memory usage issue with inspect * Made WAN attention receive transformer_options, test node added to wan to test out attention override later * Added *kwargs to all attention functions so transformer_options could potentially be passed through Make sure wrap_attn doesn't make itself recurse infinitely, attempt to load SageAttention and FlashAttention if not enabled so that they can be marked as available or not, create registry for available attention * Turn off attention logging for now, make AttentionOverrideTestNode have a dropdown with available attention (this is a test node only) * Make flux work with optimized_attention_override * Add logs to verify optimized_attention_override is passed all the way into attention function * Make Qwen work with optimized_attention_override * Made hidream work with optimized_attention_override * Made wan patches_replace work with optimized_attention_override * Made SD3 work with optimized_attention_override * Made HunyuanVideo work with optimized_attention_override * Made Mochi work with optimized_attention_override * Made LTX work with optimized_attention_override * Made StableAudio work with optimized_attention_override * Made optimized_attention_override work with ACE Step * Made Hunyuan3D work with optimized_attention_override * Make CosmosPredict2 work with optimized_attention_override * Made CosmosVideo work with optimized_attention_override * Made Omnigen 2 work with optimized_attention_override * Made StableCascade work with optimized_attention_override * Made AuraFlow work with optimized_attention_override * Made Lumina work with optimized_attention_override * Made Chroma work with optimized_attention_override * Made SVD work with optimized_attention_override * Fix WanI2VCrossAttention so that it expects to receive transformer_options * Fixed Wan2.1 Fun Camera transformer_options passthrough * Fixed WAN 2.1 VACE transformer_options passthrough * Add optimized to get_attention_function * Disable attention logs for now * Remove attention logging code * Remove _register_core_attention_functions, as we wouldn't want someone to call that, just in case * Satisfy ruff * Remove AttentionOverrideTest node, that's something to cook up for later	2025-09-12 18:07:38 -04:00
doctorpangloss	179c2d35c8	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-09-03 12:04:32 -07:00
comfyanonymous	e3018c2a5a	uso -> uxo/uno as requested. (#9688 )	2025-09-02 16:12:07 -04:00
comfyanonymous	3412d53b1d	USO style reference. (#9677 ) Load the projector.safetensors file with the ModelPatchLoader node and use the siglip_vision_patch14_384.safetensors "clip vision" model and the USOStyleReferenceNode.	2025-09-02 15:36:22 -04:00
comfyanonymous	27e067ce50	Implement the USO subject identity lora. (#9674 ) Use the lora with FluxContextMultiReferenceLatentMethod node set to "uso" and a ReferenceLatent node with the reference image.	2025-09-01 18:54:02 -04:00
comfyanonymous	b5ac6ed7ce	Fixes to make controlnet type models work on qwen edit and kontext. (#9581 )	2025-08-27 15:26:28 -04:00
doctorpangloss	443bb45eaf	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-08-26 13:59:45 -07:00
Jedrzej Kosinski	fc247150fe	Implement EasyCache and Invent LazyCache (#9496 ) * Attempting a universal implementation of EasyCache, starting with flux as test; I screwed up the math a bit, but when I set it just right it works. * Fixed math to make threshold work as expected, refactored code to use EasyCacheHolder instead of a dict wrapped by object * Use sigmas from transformer_options instead of timesteps to be compatible with a greater amount of models, make end_percent work * Make log statement when not skipping useful, preparing for per-cond caching * Added DIFFUSION_MODEL wrapper around forward function for wan model * Add subsampling for heuristic inputs * Add subsampling to output_prev (output_prev_subsampled now) * Properly consider conds in EasyCache logic * Created SuperEasyCache to test what happens if caching and reuse is moved outside the scope of conds, added PREDICT_NOISE wrapper to facilitate this test * Change max reuse_threshold to 3.0 * Mark EasyCache/SuperEasyCache as experimental (beta) * Make Lumina2 compatible with EasyCache * Add EasyCache support for Qwen Image * Fix missing comma, curse you Cursor * Add EasyCache support to AceStep * Add EasyCache support to Chroma * Added EasyCache support to Cosmos Predict t2i * Make EasyCache not crash with Cosmos Predict ImagToVideo latents, but does not work well at all * Add EasyCache support to hidream * Added EasyCache support to hunyuan video * Added EasyCache support to hunyuan3d * Added EasyCache support to LTXV (not very good, but does not crash) * Implemented EasyCache for aura_flow * Renamed SuperEasyCache to LazyCache, hardcoded subsample_factor to 8 on nodes * Eatra logging when verbose is true for EasyCache	2025-08-22 22:41:08 -04:00
doctorpangloss	dfc47e0611	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-08-22 13:24:52 -07:00
comfyanonymous	c308a8840a	Add FluxKontextMultiReferenceLatentMethod node. (#9356 ) This node is only useful if someone trains the kontext model to properly use multiple reference images via the index method. The default is the offset method which feeds the multiple images like if they were stitched together as one. This method works with the current flux kontext model.	2025-08-15 15:50:39 -04:00
doctorpangloss	a7aff3565b	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-06-26 16:57:25 -07:00
comfyanonymous	ef5266b1c1	Support Flux Kontext Dev model. (#8679 )	2025-06-26 11:28:41 -04:00
doctorpangloss	f6339e8115	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-06-24 12:52:54 -07:00
comfyanonymous	f7fb193712	Small flux optimization. (#8611 )	2025-06-20 05:37:32 -04:00
comfyanonymous	7e9267fa77	Make flux controlnet work with sd3 text enc. (#8599 )	2025-06-19 18:50:05 -04:00
doctorpangloss	1d901e32eb	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-06-17 13:20:31 -07:00
doctorpangloss	82388d51a2	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-06-17 10:35:10 -07:00
comfyanonymous	8a4ff747bd	Fix mistake in last commit. (#8496 ) * Move to right place.	2025-06-11 15:13:29 -04:00
comfyanonymous	af1eb58be8	Fix black images on some flux models in fp16. (#8495 )	2025-06-11 15:09:11 -04:00
comfyanonymous	4248b1618f	Let chroma TE work on regular flux. (#8429 )	2025-06-05 10:07:17 -04:00
doctorpangloss	040a324346	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-03-29 15:57:24 -07:00
comfyanonymous	3b19fc76e3	Allow disabling pe in flux code for some other models.	2025-03-18 05:09:25 -04:00
comfyanonymous	e8e990d6b8	Cleanup code.	2025-03-16 06:29:12 -04:00
comfyanonymous	9aac21f894	Fix issues with new hunyuan img2vid model and bumb version to v0.3.26	2025-03-09 05:07:22 -04:00
comfyanonymous	7395b0c0d1	Support new hunyuan video i2v model. Use the new "v2 (replace)" guidance type in HunyuanImageToVideo and set image_interleave to 4 on the "Text Encode Hunyuan Video" node.	2025-03-08 20:34:47 -05:00
doctorpangloss	693038738a	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-02-24 09:39:26 -08:00
HishamC	b124256817	Fix for running via DirectML (#6542 ) * Fix for running via DirectML Fix DirectML empty image generation issue with Flux1. add CPU fallback for unsupported path. Verified the model works on AMD GPUs * fix formating * update casual mask calculation	2025-02-11 17:11:32 -05:00
doctorpangloss	a3452f6e6a	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-01-28 13:45:51 -08:00
comfyanonymous	fb2ad645a3	Add FluxDisableGuidance node to disable using the guidance embed.	2025-01-20 14:50:24 -05:00
doctorpangloss	631d9e44c6	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2025-01-16 09:58:02 -08:00
comfyanonymous	31831e6ef1	Code refactor.	2025-01-16 07:23:54 -05:00
comfyanonymous	6320d05696	Slightly lower hunyuan video memory usage.	2025-01-16 00:23:01 -05:00
comfyanonymous	b7572b2f87	Fix and enforce no trailing whitespace.	2024-12-31 03:16:37 -05:00
doctorpangloss	7655be873c	Updates to support Hunyuan Video	2024-12-25 22:39:12 -08:00
doctorpangloss	0fd407ae87	Merge branch 'master' of github.com:comfyanonymous/ComfyUI	2024-12-24 16:48:03 -08:00
comfyanonymous	bda1482a27	Basic Hunyuan Video model support.	2024-12-16 19:35:40 -05:00

1 2 3

104 Commits