EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-07-17 11:58:21 +08:00

History

Jedrzej Kosinski 94bcb5701e Some checks failed Python Linting / Run Ruff (push) Has been cancelled Details Python Linting / Run Pylint (push) Has been cancelled Details Cube3D: reuse shared Flux RoPE (comfy-kitchen optimized kernel) Replace cube's bespoke complex-number RoPE (torch.polar / view_as_complex) with ComfyUI's shared Flux rotary embedding (comfy.ldm.flux.math): * precompute_freqs_cis now returns Flux's real rotation freqs via rope(). * apply_rotary_emb applies them via apply_rope1, which at inference dispatches to comfy-kitchen's optimized apply_rope kernel (comfy.quant_ops.ck). q and k are still rotated separately to preserve the decode-time position asymmetry. The pairing convention (adjacent dims) and rotation math are identical, so token outputs are unchanged. The only numerical difference is that rope() computes the rotation angles in fp64 before casting to fp32 (cube's original used fp32), so output now matches upstream to fp32 rounding (~1e-6 on rotated q/k in a standalone check) rather than bit-for-bit. Greedy argmax token selection is unaffected. Deviation note: this is a deliberate, documented divergence from a strict upstream port, taken to gain the shared optimized kernel. Needs GPU parity re-validation on the 2x4090 box (kosin-X570-AORUS-ULTRA) before merge. Co-authored-by: Amp <amp@ampcode.com> Amp-Thread-ID: https://ampcode.com/threads/T-019f013b-5892-71b9-af6b-c2ef28c67d2b		2026-06-25 18:15:15 -07:00
..
audio_encoders
background_removal	Some cast/dtype fixes for the birefnet and dino3 models. (#14217 )	2026-06-01 14:35:26 -07:00
cldm
comfy_types	Remove useless annotations imports. (#14105 )	2026-05-25 19:23:29 -07:00
extra_samplers
image_encoders	Depth anything 3 (Core-135) (#13853 )	2026-06-10 09:28:24 +08:00
k_diffusion	Cube3D: use channels-first 1D latent (B,1,L) like Hunyuan3Dv2	2026-06-14 23:14:17 -07:00
ldm	Cube3D: reuse shared Flux RoPE (comfy-kitchen optimized kernel)	2026-06-25 18:15:15 -07:00
sd1_tokenizer
t2i_adapter
taesd	Add high quality preview support for Flux2 latents (#13496 )	2026-04-29 19:37:30 -04:00
text_encoders	Allow custom templates with Ideogram4 TE (#14374 )	2026-06-09 21:11:05 +08:00
weight_adapter
bg_removal_model.py	Fix background removal mask output shape (#14171 )	2026-05-29 09:14:32 -07:00
cli_args.py	add --high-ram option (#14437 )	2026-06-12 07:53:33 -07:00
clip_config_bigg.json
clip_model.py
clip_vision_config_g.json
clip_vision_config_h.json
clip_vision_config_vitl_336_llava.json
clip_vision_config_vitl_336.json
clip_vision_config_vitl.json
clip_vision_siglip2_base_naflex.json
clip_vision_siglip_384.json
clip_vision_siglip_512.json
clip_vision.py	Some cast/dtype fixes for the birefnet and dino3 models. (#14217 )	2026-06-01 14:35:26 -07:00
conds.py
context_windows.py	feat: Context windows - add causal_window_fix to improve blending of context windows (CORE-100) (#13563 )	2026-05-05 16:40:53 -07:00
controlnet.py	MultiGPU Work Units For Accelerated Sampling (CORE-184) (#7063 )	2026-05-25 18:26:40 -07:00
deploy_environment.py	Add deploy environment header (Comfy-Env) to partner node API calls (#13425 )	2026-05-04 20:17:56 -07:00
diffusers_convert.py
diffusers_load.py
float.py	float: use CK stochastic rounding cuda kernel (#13971 )	2026-05-28 19:23:42 -07:00
gligen.py
hooks.py	Fix typos (#10986 )	2026-05-08 17:14:45 +08:00
latent_formats.py	Cube3D: use channels-first 1D latent (B,1,L) like Hunyuan3Dv2	2026-06-14 23:14:17 -07:00
lora_convert.py
lora.py	Add LoRA key mapping for LTXV/LTXAV models (#14349 )	2026-06-09 09:57:58 -04:00
memory_management.py	Threaded Loader performance fixes / improvements (+ Aimdo 0.4.6) (#14116 )	2026-05-30 15:20:04 -04:00
model_base.py	Add native Roblox Cube3D text-to-3D support	2026-06-14 20:21:37 -07:00
model_detection.py	Cube3D: document convention deviations + drop unused VAE flag (review aid)	2026-06-14 23:58:14 -07:00
model_management.py	add --high-ram option (#14437 )	2026-06-12 07:53:33 -07:00
model_patcher.py	[Trainer/bug] Ensure model is not inference mode (CORE-72) (#13400 )	2026-06-09 23:07:47 -04:00
model_prefetch.py	Threaded Loader performance fixes / improvements (+ Aimdo 0.4.6) (#14116 )	2026-05-30 15:20:04 -04:00
model_sampling.py	feat: Support HiDream-O1-Image (CORE-187) (#13817 )	2026-05-11 20:35:53 -07:00
multigpu.py	fix (MultiGPU): prevent freeze on manual abort when using MultiGPU CFG Split (#14235 )	2026-06-02 10:05:24 -07:00
nested_tensor.py
ops.py	add --high-ram option (#14437 )	2026-06-12 07:53:33 -07:00
options.py
patcher_extension.py	Remove useless annotations imports. (#14105 )	2026-05-25 19:23:29 -07:00
pinned_memory.py	Fix interoperation with external source of pinned memory pressure (#14252 )	2026-06-05 08:39:35 -07:00
pixel_space_convert.py
quant_ops.py	Enable triton comfy kitchen via cli-arg (#12730 )	2026-05-03 14:07:21 -04:00
rmsnorm.py	feat: Gemma4 text generation support (CORE-30) (#13376 )	2026-05-02 22:46:15 -04:00
sample.py	Revert "Add SeedVR2 support (CORE-6) (#14110 )" (#14359 )	2026-06-08 18:00:20 -04:00
sampler_helpers.py	MultiGPU Work Units For Accelerated Sampling (CORE-184) (#7063 )	2026-05-25 18:26:40 -07:00
samplers.py	fix(multigpu): replace hardcoded torch.cuda.set_device with device-agnostic set_torch_device (#14191 )	2026-05-30 21:18:42 -04:00
sd1_clip_config.json
sd1_clip.py	feat: Support Qwen3.5 text generation models (#12771 )	2026-03-25 22:48:28 -04:00
sd.py	Cube3D: document convention deviations + drop unused VAE flag (review aid)	2026-06-14 23:58:14 -07:00
sdxl_clip.py
supported_models_base.py	Revert "Add SeedVR2 support (CORE-6) (#14110 )" (#14359 )	2026-06-08 18:00:20 -04:00
supported_models.py	Cube3D: document convention deviations + drop unused VAE flag (review aid)	2026-06-14 23:58:14 -07:00
utils.py	Threaded Loader performance fixes / improvements (+ Aimdo 0.4.6) (#14116 )	2026-05-30 15:20:04 -04:00