EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-06-24 16:59:29 +08:00

Author	SHA1	Message	Date
Jukka Seppänen	e2a800e7ef	Fix for HunyuanVideo1.5 meanflow distil (#11212 )	2025-12-09 16:59:16 -05:00
Lodestone	b9fb542703	add chroma-radiance-x0 mode (#11197 )	2025-12-08 23:33:29 -05:00
comfyanonymous	56fa7dbe38	Properly load the newbie diffusion model. (#11172 ) Some checks failed Python Linting / Run Ruff (push) Waiting to run Details Python Linting / Run Pylint (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run Details Execution Tests / test (macos-latest) (push) Waiting to run Details Execution Tests / test (ubuntu-latest) (push) Waiting to run Details Execution Tests / test (windows-latest) (push) Waiting to run Details Test server launches without errors / test (push) Waiting to run Details Unit Tests / test (macos-latest) (push) Waiting to run Details Unit Tests / test (ubuntu-latest) (push) Waiting to run Details Unit Tests / test (windows-2022) (push) Waiting to run Details Generate Pydantic Stubs from api.comfy.org / generate-models (push) Has been cancelled Details There is still one of the text encoders missing and I didn't actually test it.	2025-12-07 07:44:55 -05:00
Jukka Seppänen	fd109325db	Kandinsky5 model support (#10988 ) * Add Kandinsky5 model support lite and pro T2V tested to work * Update kandinsky5.py * Fix fp8 * Fix fp8_scaled text encoder * Add transformer_options for attention * Code cleanup, optimizations, use fp32 for all layers originally at fp32 * ImageToVideo -node * Fix I2V, add necessary latent post process nodes * Support text to image model * Support block replace patches (SLG mostly) * Support official LoRAs * Don't scale RoPE for lite model as that just doesn't work... * Update supported_models.py * Rever RoPE scaling to simpler one * Fix typo * Handle latent dim difference for image model in the VAE instead * Add node to use different prompts for clip_l and qwen25_7b * Reduce peak VRAM usage a bit * Further reduce peak VRAM consumption by chunking ffn * Update chunking * Update memory_usage_factor * Code cleanup, don't force the fp32 layers as it has minimal effect * Allow for stronger changes with first frames normalization Default values are too weak for any meaningful changes, these should probably be exposed as advanced node options when that's available. * Add image model's own chat template, remove unused image2video template * Remove hard error in ReplaceVideoLatentFrames -node * Update kandinsky5.py * Update supported_models.py * Fix typos in prompt template They were now fixed in the original repository as well * Update ReplaceVideoLatentFrames Add tooltips Make source optional Better handle negative index * Rename NormalizeVideoLatentFrames -node For bit better clarity what it does * Fix NormalizeVideoLatentStart node out on non-op	2025-12-05 22:20:22 -05:00
comfyanonymous	43071e3de3	Make old scaled fp8 format use the new mixed quant ops system. (#11000 )	2025-12-05 14:35:42 -05:00
comfyanonymous	878db3a727	Implement the Ovis image model. (#11030 )	2025-12-01 20:56:17 -05:00
comfyanonymous	e9aae31fa2	Z Image model. (#10892 ) Some checks are pending Python Linting / Run Ruff (push) Waiting to run Details Python Linting / Run Pylint (push) Waiting to run Details Build package / Build Test (3.10) (push) Waiting to run Details Build package / Build Test (3.11) (push) Waiting to run Details Build package / Build Test (3.12) (push) Waiting to run Details Build package / Build Test (3.13) (push) Waiting to run Details Build package / Build Test (3.9) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run Details Execution Tests / test (macos-latest) (push) Waiting to run Details Execution Tests / test (ubuntu-latest) (push) Waiting to run Details Execution Tests / test (windows-latest) (push) Waiting to run Details Test server launches without errors / test (push) Waiting to run Details Unit Tests / test (macos-latest) (push) Waiting to run Details Unit Tests / test (ubuntu-latest) (push) Waiting to run Details Unit Tests / test (windows-2022) (push) Waiting to run Details	2025-11-25 18:41:45 -05:00
comfyanonymous	6b573ae0cb	Flux 2 (#10879 )	2025-11-25 10:50:19 -05:00
comfyanonymous	943b3b615d	HunyuanVideo 1.5 (#10819 ) * init * update * Update model.py * Update model.py * remove print * Fix text encoding * Prevent empty negative prompt Really doesn't work otherwise * fp16 works * I2V * Update model_base.py * Update nodes_hunyuan.py * Better latent rgb factors * Use the correct sigclip output... * Support HunyuanVideo1.5 SR model * whitespaces... * Proper latent channel count * SR model fixes This also still needs timesteps scheduling based on the noise scale, can be used with two samplers too already * vae_refiner: roll the convolution through temporal Work in progress. Roll the convolution through time using 2-latent-frame chunks and a FIFO queue for the convolution seams. * Support HunyuanVideo15 latent resampler * fix * Some cleanup Co-Authored-By: comfyanonymous <121283862+comfyanonymous@users.noreply.github.com> * Proper hyvid15 I2V channels Co-Authored-By: comfyanonymous <121283862+comfyanonymous@users.noreply.github.com> * Fix TokenRefiner for fp16 Otherwise x.sum has infs, just in case only casting if input is fp16, I don't know if necessary. * Bugfix for the HunyuanVideo15 SR model * vae_refiner: roll the convolution through temporal II Roll the convolution through time using 2-latent-frame chunks and a FIFO queue for the convolution seams. Added support for encoder, lowered to 1 latent frame to save more VRAM, made work for Hunyuan Image 3.0 (as code shared). Fixed names, cleaned up code. * Allow any number of input frames in VAE. * Better VAE encode mem estimation. * Lowvram fix. * Fix hunyuan image 2.1 refiner. * Fix mistake. * Name changes. * Rename. * Whitespace. * Fix. * Fix. --------- Co-authored-by: kijai <40791699+kijai@users.noreply.github.com> Co-authored-by: Rattus <rattus128@gmail.com>	2025-11-20 22:44:43 -05:00
contentis	8817f8fc14	Mixed Precision Quantization System (#10498 ) * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Fix missing keys * Rename quant dtype parameter * Rename quant dtype parameter * Fix unittests for CPU build	2025-10-28 16:20:53 -04:00
comfyanonymous	dad076aee6	Speed up chroma radiance. (#10395 )	2025-10-18 23:19:52 -04:00
comfyanonymous	8aea746212	Implement gemma 3 as a text encoder. (#10241 ) Not useful yet.	2025-10-06 22:08:08 -04:00
comfyanonymous	dc95b6acc0	Basic WIP support for the wan animate model. (#9939 )	2025-09-19 03:07:17 -04:00
comfyanonymous	9288c78fc5	Support the HuMo model. (#9903 )	2025-09-17 00:12:48 -04:00
blepping	c1297f4eb3	Add support for Chroma Radiance (#9682 ) * Initial Chroma Radiance support * Minor Chroma Radiance cleanups * Update Radiance nodes to ensure latents/images are on the intermediate device * Fix Chroma Radiance memory estimation. * Increase Chroma Radiance memory usage factor * Increase Chroma Radiance memory usage factor once again * Ensure images are multiples of 16 for Chroma Radiance Add batch dimension and fix channels when necessary in ChromaRadianceImageToLatent node * Tile Chroma Radiance NeRF to reduce memory consumption, update memory usage factor * Update Radiance to support conv nerf final head type. * Allow setting NeRF embedder dtype for Radiance Bump Radiance nerf tile size to 32 Support EasyCache/LazyCache on Radiance (maybe) * Add ChromaRadianceStubVAE node * Crop Radiance image inputs to multiples of 16 instead of erroring to be in line with existing VAE behavior * Convert Chroma Radiance nodes to V3 schema. * Add ChromaRadianceOptions node and backend support. Cleanups/refactoring to reduce code duplication with Chroma. * Fix overriding the NeRF embedder dtype for Chroma Radiance * Minor Chroma Radiance cleanups * Move Chroma Radiance to its own directory in ldm Minor code cleanups and tooltip improvements * Fix Chroma Radiance embedder dtype overriding * Remove Radiance dynamic nerf_embedder dtype override feature * Unbork Radiance NeRF embedder init * Remove Chroma Radiance image conversion and stub VAE nodes Add a chroma_radiance option to the VAELoader builtin node which uses comfy.sd.PixelspaceConversionVAE Add a PixelspaceConversionVAE to comfy.sd for converting BHWC 0..1 <-> BCHW -1..1	2025-09-13 17:58:43 -04:00
comfyanonymous	e01e99d075	Support hunyuan image distilled model. (#9807 )	2025-09-10 23:17:34 -04:00
comfyanonymous	85e34643f8	Support hunyuan image 2.1 regular model. (#9792 )	2025-09-10 02:05:07 -04:00
Yousef R. Gamaleldin	261421e218	Add Hunyuan 3D 2.1 Support (#8714 )	2025-09-04 20:36:20 -04:00
comfyanonymous	88aee596a3	WIP Wan 2.2 S2V model. (#9568 )	2025-08-27 01:10:34 -04:00
comfyanonymous	f7bd5e58dd	Make it easier to implement future qwen controlnets. (#9485 )	2025-08-21 23:18:04 -04:00
comfyanonymous	1702e6df16	Implement wan2.2 camera model. (#9357 ) Use the old WanCameraImageToVideo node.	2025-08-15 17:29:58 -04:00
comfyanonymous	560d38f34c	Wan2.2 fun control support. (#9292 )	2025-08-12 23:26:33 -04:00
comfyanonymous	c012400240	Initial support for qwen image model. (#9179 )	2025-08-04 22:53:25 -04:00
comfyanonymous	a88788dce6	Wan 2.2 support. (#9080 )	2025-07-28 08:00:23 -04:00
comfyanonymous	ec70ed6aea	Omnigen2 model implementation. (#8669 )	2025-06-25 19:35:57 -04:00
comfyanonymous	d6a2137fc3	Support Cosmos predict2 image to video models. (#8535 ) Use the CosmosPredict2ImageToVideoLatent node.	2025-06-14 21:37:07 -04:00
comfyanonymous	251f54a2ad	Basic initial support for cosmos predict2 text to image 2B and 14B models. (#8517 )	2025-06-13 07:05:23 -04:00
comfyanonymous	a0651359d7	Return proper error if diffusion model not detected properly. (#8272 )	2025-05-25 05:28:11 -04:00
comfyanonymous	1c2d45d2b5	Fix typo in last PR. (#8144 ) More robust model detection for future proofing.	2025-05-15 19:02:19 -04:00
comfyanonymous	56b6ee6754	Detection code to make ltxv models without config work. (#7986 )	2025-05-07 21:28:24 -04:00
comfyanonymous	16417b40d9	Initial ACE-Step model implementation. (#7972 )	2025-05-07 08:33:34 -04:00
comfyanonymous	08ff5fa08a	Cleanup chroma PR.	2025-04-30 20:57:30 -04:00
Silver	4ca3d84277	Support for Chroma - Flux1 Schnell distilled with CFG (#7355 ) * Upload files for Chroma Implementation * Remove trailing whitespace * trim more trailing whitespace..oops * remove unused imports * Add supported_inference_dtypes * Set min_length to 0 and remove attention_mask=True * Set min_length to 1 * get_mdulations added from blepping and minor changes * Add lora conversion if statement in lora.py * Update supported_models.py * update model_base.py * add uptream commits * set modelType.FLOW, will cause beta scheduler to work properly * Adjust memory usage factor and remove unnecessary code * fix mistake * reduce code duplication * remove unused imports * refactor for upstream sync * sync chroma-support with upstream via syncbranch patch * Update sd.py * Add Chroma as option for the OptimalStepsScheduler node	2025-04-30 20:57:00 -04:00
comfyanonymous	ce22f687cc	Support for WAN VACE preview model. (#7711 ) * Support for WAN VACE preview model. * Remove print.	2025-04-21 14:40:29 -04:00
comfyanonymous	c14429940f	Support loading WAN FLF model.	2025-04-17 12:04:48 -04:00
comfyanonymous	9ad792f927	Basic support for hidream i1 model.	2025-04-15 17:35:05 -04:00
thot experiment	83e839a89b	Native LotusD Implementation (#7125 ) * draft pass at a native comfy implementation of Lotus-D depth and normal est * fix model_sampling kludges * fix ruff --------- Co-authored-by: comfyanonymous <121283862+comfyanonymous@users.noreply.github.com>	2025-03-21 14:04:15 -04:00
comfyanonymous	11f1b41bab	Initial Hunyuan3Dv2 implementation. Supports the multiview, mini, turbo models and VAEs.	2025-03-19 16:52:58 -04:00
comfyanonymous	e1474150de	Support fp8_scaled diffusion models that don't use fp8 matrix mult.	2025-03-07 04:39:21 -05:00
comfyanonymous	93fedd92fe	Support LTXV 0.9.5. Credits: Lightricks team.	2025-03-05 00:13:49 -05:00
comfyanonymous	4ced06b879	WIP support for Wan I2V model.	2025-02-26 01:49:43 -05:00
comfyanonymous	63023011b9	WIP support for Wan t2v model.	2025-02-25 17:20:35 -05:00
maedtb	5715be2ca9	Fix Hunyuan unet config detection for some models. (#6877 ) The change to support 32 channel hunyuan models is missing the `key_prefix` on the key. This addresses a complain in the comments of `acc152b674`.	2025-02-19 07:14:45 -05:00
Jukka Seppänen	acc152b674	Support loading and using SkyReels-V1-Hunyuan-I2V (#6862 ) * Support SkyReels-V1-Hunyuan-I2V * VAE scaling * Fix T2V oops * Proper latent scaling	2025-02-18 17:06:54 -05:00
comfyanonymous	e5ea112a90	Support Lumina 2 model.	2025-02-04 04:16:30 -05:00
comfyanonymous	3aaabb12d4	Implement Cosmos Image/Video to World (Video) diffusion models. Use CosmosImageToVideoLatent to set the input image/video.	2025-01-14 05:14:10 -05:00
comfyanonymous	2ff3104f70	WIP support for Nvidia Cosmos 7B and 14B text to world (video) models.	2025-01-10 09:14:16 -05:00
comfyanonymous	b7572b2f87	Fix and enforce no trailing whitespace.	2024-12-31 03:16:37 -05:00
comfyanonymous	d170292594	Remove some trailing white space.	2024-12-27 18:02:30 -05:00
City	bddb02660c	Add PixArt model support (#6055 ) * PixArt initial version * PixArt Diffusers convert logic * pos_emb and interpolation logic * Reduce duplicate code * Formatting * Use optimized attention * Edit empty token logic * Basic PixArt LoRA support * Fix aspect ratio logic * PixArtAlpha text encode with conds * Use same detection key logic for PixArt diffusers	2024-12-20 15:25:00 -05:00
comfyanonymous	bda1482a27	Basic Hunyuan Video model support.	2024-12-16 19:35:40 -05:00
Chenlei Hu	d9d7f3c619	Lint all unused variables (#5989 ) * Enable F841 * Autofix * Remove all unused variable assignment	2024-12-12 17:59:16 -05:00
comfyanonymous	5e16f1d24b	Support Lightricks LTX-Video model.	2024-11-22 08:46:39 -05:00
comfyanonymous	8f0009aad0	Support new flux model variants.	2024-11-21 08:38:23 -05:00
comfyanonymous	5e29e7a488	Remove scaled_fp8 key after reading it to silence warning.	2024-11-06 04:56:42 -05:00
comfyanonymous	daa1565b93	Fix diffusers flux controlnet regression.	2024-10-30 13:11:34 -04:00
comfyanonymous	09fdb2b269	Support SD3.5 medium diffusers format weights and loras.	2024-10-30 04:24:00 -04:00
comfyanonymous	13b0ff8a6f	Update SD3 code.	2024-10-28 21:58:52 -04:00
comfyanonymous	5cbb01bc2f	Basic Genmo Mochi video model support. To use: "Load CLIP" node with t5xxl + type mochi "Load Diffusion Model" node with the mochi dit file. "Load VAE" with the mochi vae file. EmptyMochiLatentVideo node for the latent. euler + linear_quadratic in the KSampler node.	2024-10-26 06:54:00 -04:00
comfyanonymous	0075c6d096	Mixed precision diffusion models with scaled fp8. This change allows supports for diffusion models where all the linears are scaled fp8 while the other weights are the original precision.	2024-10-21 18:12:51 -04:00
comfyanonymous	a68bbafddb	Support diffusion models with scaled fp8 weights.	2024-10-19 23:47:42 -04:00
Scorpinaus	9465b23432	Added SD15_Inpaint_Diffusers model support for unet_config_from_diffusers_unet function (#4565 )	2024-08-23 03:57:08 -04:00
comfyanonymous	75b9b55b22	Fix issues with #4302 and support loading diffusers format flux.	2024-08-10 21:28:24 -04:00
comfyanonymous	c19dcd362f	Controlnet code refactor.	2024-08-07 12:59:28 -04:00
comfyanonymous	3b71f84b50	ONNX tracing fixes.	2024-08-04 15:45:43 -04:00
comfyanonymous	1589b58d3e	Basic Flux Schnell and Flux Dev model implementation.	2024-08-01 09:49:29 -04:00
comfyanonymous	a5f4292f9f	Basic hunyuan dit implementation. (#4102 ) * Let tokenizers return weights to be stored in the saved checkpoint. * Basic hunyuan dit implementation. * Fix some resolutions not working. * Support hydit checkpoint save. * Init with right dtype. * Switch to optimized attention in pooler. * Fix black images on hunyuan dit.	2024-07-25 18:21:08 -04:00
comfyanonymous	334ba48cea	More generic unet prefix detection code.	2024-07-23 14:13:32 -04:00
comfyanonymous	a3dffc447a	Support AuraFlow Lora and loading model weights in diffusers format. You can load model weights in diffusers format using the UNETLoader node.	2024-07-13 13:51:40 -04:00
comfyanonymous	9f291d75b3	AuraFlow model implementation.	2024-07-11 16:52:26 -04:00
comfyanonymous	5e1fced639	Cleaner support for loading different diffusion model types.	2024-07-11 11:37:31 -04:00
comfyanonymous	f8f7568d03	Basic SD3 controlnet implementation. Still missing the node to properly use it.	2024-06-27 18:43:11 -04:00
comfyanonymous	0d6a57938e	Support loading diffusers SD3 model format with UNETLoader node.	2024-06-19 22:21:18 -04:00
comfyanonymous	bb1969cab7	Initial support for the stable audio open model.	2024-06-15 12:14:56 -04:00
comfyanonymous	8c4a9befa7	SD3 Support.	2024-06-10 14:06:23 -04:00
comfyanonymous	58812ab8ca	Support SDXS 512 model.	2024-04-12 22:12:35 -04:00
comfyanonymous	575acb69e4	IP2P model loading support. This is the code to load the model and inference it with only a text prompt. This commit does not contain the nodes to properly use it with an image input. This supports both the original SD1 instructpix2pix model and the diffusers SDXL one.	2024-03-31 03:10:28 -04:00
comfyanonymous	327ca1313d	Support SDXS 0.9	2024-03-27 23:58:58 -04:00
comfyanonymous	65397ce601	Replace prints with logging and add --verbose argument.	2024-03-10 12:14:23 -04:00
comfyanonymous	b3e97fc714	Koala 700M and 1B support. Use the UNET Loader node to load the unet file to use them.	2024-02-28 12:10:11 -05:00
comfyanonymous	f2d1d16f4f	Support Stable Cascade Stage B lite.	2024-02-16 23:41:23 -05:00
comfyanonymous	f83109f09b	Stable Cascade Stage C.	2024-02-16 10:55:08 -05:00
comfyanonymous	2c4e92a98b	Fix regression.	2024-01-02 14:41:33 -05:00
comfyanonymous	a47f609f90	Auto detect out_channels from model.	2024-01-02 01:50:57 -05:00
comfyanonymous	b454a67bb9	Support segmind vega model.	2023-12-12 19:09:53 -05:00
comfyanonymous	5d6dfce548	Fix importing diffusers unets.	2023-11-24 20:35:29 -05:00
comfyanonymous	871cc20e13	Support SVD img2vid model.	2023-11-23 19:41:33 -05:00
comfyanonymous	107e78b1cb	Add support for loading SSD1B diffusers unet version. Improve diffusers model detection.	2023-11-16 23:12:55 -05:00
comfyanonymous	6ec3f12c6e	Support SSD1B model and make it easier to support asymmetric unets.	2023-10-27 14:45:15 -04:00
comfyanonymous	9a55dadb4c	Refactor code so model can be a dtype other than fp32 or fp16.	2023-10-13 14:41:17 -04:00
comfyanonymous	76cdc809bf	Support more controlnet models.	2023-09-23 18:47:46 -04:00
comfyanonymous	7931ff0fd9	Support SDXL inpaint models.	2023-09-01 15:22:52 -04:00
comfyanonymous	2c97c30256	Support small diffusers controlnet so both types are now supported.	2023-08-16 12:45:56 -04:00
comfyanonymous	53f326a3d8	Support diffusers mini controlnets.	2023-08-16 12:28:01 -04:00
comfyanonymous	585a062910	Print unet config when model isn't detected.	2023-08-13 01:39:48 -04:00
comfyanonymous	78e7958d17	Support controlnet in diffusers format.	2023-07-21 22:58:16 -04:00
comfyanonymous	af7a49916b	Support loading unet files in diffusers format.	2023-07-05 17:38:59 -04:00
comfyanonymous	4376b125eb	Remove useless code.	2023-06-29 00:26:33 -04:00
comfyanonymous	f87ec10a97	Support base SDXL and SDXL refiner models. Large refactor of the model detection and loading code.	2023-06-22 13:03:50 -04:00

1 2 3

149 Commits