rattus
68ef4423a3
Merge 09a98a9539 into 6b61918a16
2026-05-18 23:54:15 -07:00
comfyanonymous
a4382e056e
Use temporal downscale to make empty audio latent nodes more reusable. ( #13975 )
2026-05-19 00:14:30 -04:00
comfyanonymous
990a7ae7f2
Initial work to make downscale_ratio_temporal work. ( #13972 )
2026-05-18 23:01:43 -04:00
Yousef R. Gamaleldin
187e5237e1
Fix BiRefNet issue ( #13966 )
2026-05-19 05:03:22 +08:00
rattus
16f862f02a
implement dynamic clip saving ( #13959 )
...
Fix clip saving by doing the same patching process and diffusion
models.
2026-05-18 11:46:40 -07:00
Rattus
09a98a9539
specify hostbuf max allocation size
...
There a signs of virtual memory exhaustion on some linux systems when
throwing 128GB for every little piece. Pass the actual to save aimdo
from over-estimates
2026-05-18 22:35:41 +10:00
Jukka Seppänen
971c9e3518
HiDream-O1: support area conditioning ( #13944 )
2026-05-18 01:17:05 -04:00
Jukka Seppänen
b39af210d0
Fix Qwen3.5 text generation with multiple input images ( #13943 )
2026-05-18 01:16:42 -04:00
comfyanonymous
f48d2a017e
Log which quant ops are enabled/emulated. ( #13946 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-17 16:30:54 -04:00
Rattus
c451854f84
pinning updates
2026-05-18 02:30:03 +10:00
drozbay
d3607a8e6d
feat: Add downscaled IC-LoRA support to LTXVAddGuide (CORE-102) ( #13896 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-16 15:02:57 +08:00
comfyanonymous
5d5a4554e1
Remove useless option and clarify what lowvram does. ( #13922 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-15 17:59:02 -07:00
Jukka Seppänen
33ce449c8b
Reduce LTX2.3 peak VRAM when guide_mask is in use (CORE-166) ( #13735 )
...
- Reduce peak VRAM by handling self_attn_mask more efficiently
- Fallback to SDPA when self_attention_mask is used
2026-05-16 00:02:27 +03:00
Rattus
43865639a7
mm: delegate pin-on-the-way to aimdo
...
Aimdo is able to chunk and slice this on the way for better CPU->GPU
overlap. The main advantage is the ability to shorten the bus contention
window between previous weight transfer and the next weights vbar
fault.
2026-05-16 03:19:51 +10:00
Rattus
ed15d62a6c
ops: refactor in prep for chunking
2026-05-16 03:19:51 +10:00
Rattus
0242954aaa
model-patcher: force-cast tiny weights
...
Flux 2 gets crazy stalls due to a mix of tiny and giant weights
creating lopsided steam buffer rotations which creates stalls.
2026-05-16 03:19:51 +10:00
Jukka Seppänen
77e2ed5e01
feat: Support MoGe (CORE-168) ( #13878 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-15 10:34:56 +08:00
Rattus
d8b442709a
make default
2026-05-15 00:37:06 +10:00
Rattus
18a74cb96a
cli_args/execution: Implement lower background cache-ram threshold
...
Limit the amount of RAM background intermediates can use, so that
switching workflows doesn't degrade performance too much.
2026-05-15 00:36:35 +10:00
Rattus
31150538b0
implement pin registration swaps
...
Uncap the windows pins from 50% by extending the pool and have a pressure
mechanism to move the pin reservations om demand.
This unfortunately implies a GPU sync to do the freeing so significant
hysterisis needs to be added to consolidate these pressure events.
2026-05-15 00:35:23 +10:00
Talmaj
74c17a25e5
Fix void failing with RuntimeError: start (0) + length (464) exceeds dimension size (461). ( #13873 )
2026-05-13 12:37:30 -07:00
Rattus
3f717816e1
execution: implement pin eviction on RAM presure
...
Add back proper pin freeing on RAM pressure
2026-05-13 22:23:54 +10:00
Rattus
d61026d020
pins: implement freeing intermediate for pinned memory
...
Pinning is more important than inactive intermediates and the stream
pin buffer is more important than even active intermediates.
2026-05-13 22:23:54 +10:00
Rattus
ee927aafa8
ops: sync the CPU with only the offload stream activity
...
This was syncing with the offload stream which itself is synced with the
compute stream, so this was syncing CPU with compute transitively. Define
the event to sync it more gently.
2026-05-13 22:23:54 +10:00
comfyanonymous
2bd65f2091
Better Hidream O1 mem usage factor for non dynamic vram. ( #13864 )
2026-05-12 20:55:38 -07:00
comfyanonymous
0155ddcbe3
Fix dtype issue with hidream o1 ( #13849 )
2026-05-11 20:53:13 -07:00
Jukka Seppänen
8e53f001a4
feat: Support HiDream-O1-Image (CORE-187) ( #13817 )
...
* Initial HiDream01-image support
* Cleanup nodes
* Cleaner handling of empty placeholder models
* Remove snap_to_predefined, prefer tooltip for the trained resolutions
* Add model and block wrappers
* Fix shift tooltip
* Add node to work around the patch tile issue
Experimental, runs multiple passes with the patch grid offset and blends with various different methods.
* Qwen35 vision rotary_pos_emb cast fix
* Fix embedding layout type
* Some small optimizations
* Cleanup, don't need this fallback
* Prefix KV cache, cleanup
Bit of speed, reduce redundant code
* Get rid of redundant custom sampler, refactor noise scaling
Our existing lcm sampler is mathematically same, just added the missing options to it instead and a node to control them. Refactored the noise scaling and fix it for the stochastic samplers, add a generic node to control the initial noise scale.
* Update nodes_hidream_o1.py
* Fix some cache validation cases
* Keep existing sampling params
* Remove redundant video vision path
* Replace some numpy ops with torch
* Fx RoPE index for batch size > 1
* Prefer torch preprocessing
* Rename block_type to be compatible with existing patch nodes
* Fixes and tweaks
2026-05-11 20:35:53 -07:00
comfyanonymous
0a7d2ffd68
Support anima TE lora kohya format. ( #13847 )
2026-05-11 20:01:52 -07:00
Rattus
44c0a0602b
ops: remove unused arg
...
This was defeatured in aimdo iteration
2026-05-12 11:04:59 +10:00
Rattus
3a3b75a7e3
implement pinned loras
2026-05-12 11:04:59 +10:00
Rattus
e48dace145
prepare for multiple pin sets
2026-05-12 11:04:59 +10:00
Rattus
8e473d756f
lora: re-implement as inplace swiss-army-knife operation
2026-05-12 11:04:59 +10:00
Rattus
2b927e1783
LowVRAMPatch: change to two-phase visit
2026-05-12 11:04:59 +10:00
Rattus
8187cd783e
Implement JIT pinned memory pressure
...
Replace the predictive pin pressure mechanism with JIT PIN memory
pressure.
2026-05-12 11:04:59 +10:00
Rattus
17955235b2
remove old pin path
2026-05-12 11:04:59 +10:00
Rattus
8070cb7780
Add stream host pin buffer for AIMDO casts
...
Introduce per-offload-stream HostBuffer reuse for pinned staging,
include it in cast buffer reset synchronization.
Defer actual casts that go via this pin path to a separate pass
such that the buffer can be allocated monolithically (to avoid
cudaHostRegister thrash).
2026-05-12 11:04:59 +10:00
Rattus
b66b642068
mm: use aimdo to do transfer from disk to pin
...
Aimdo implements a faster threaded loader.
2026-05-12 11:04:59 +10:00
Rattus
157965a1c9
pinned_memory: implement with aimdo growable buffer
...
Use a single growable buffer so we can do threaded pre-warming on
pinned memory.
2026-05-12 11:04:59 +10:00
Rattus
1fe3a13f84
model_management: disable non-dynamic smart memory
...
Disable smart memory outright for non dynamic models.
This is a minor step towards deprecation of --disable-dynamic-vram
and the legacy ModelPatcher.
This is needed for estimate-free model development, where new models
can opt-out of supplying a memory estimate and not have to worry
about hard VRAM allocations due to legacy non-dynamic model patchers
This is also a general stability increase for a lot of stray use cases
where estimates may still be off and going forward we are not going
to accurately maintain such estimates.
2026-05-12 11:04:59 +10:00
rattus
20e439419c
model_patcher: Fix safetensors saving of fp8 ( #13835 )
...
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
This was missing proper weight scale casting in the saving path.
2026-05-11 12:48:10 -07:00
box4wangjing
f505cb4070
chore: remove extra word in comment ( #13826 )
2026-05-11 11:05:09 +08:00
Jukka Seppänen
3200f28e3a
Support Wan-Dancer ( #13813 )
...
* initial WanDancer support
* nodes_wandancer: Add list form of chunker.
Create an alternate list form of the node so the chunk gens can be
trivially looped by the comfy executor.
* Closer match to original soxr resampling
* Remove librosa node
* Cleanup
---------
Co-authored-by: Rattus <rattus128@gmail.com>
2026-05-09 14:02:56 -07:00
comfyanonymous
66669b2ded
I don't think there was any because nobody complained. ( #13807 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-08 17:32:14 -07:00
Alexis Rolland
c5ecd231a2
fix: Fix bug when mask not on same device (CORE-181) ( #13801 )
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
2026-05-08 23:06:29 +08:00
Yousef R. Gamaleldin
d3c18c1636
Add support for BiRefNet background remove model (CORE-46) ( #12747 )
2026-05-08 17:59:24 +08:00
omahs
bac6fc35fb
Fix typos ( #10986 )
2026-05-08 17:14:45 +08:00
Talmaj
ef8f25601a
Add I2V for causal forcing model. ( #13719 )
2026-05-07 18:38:36 -07:00
Jukka Seppänen
8dc3f3f209
Improve SAM3 large input handling ( #13767 )
2026-05-07 17:18:28 -07:00
Jukka Seppänen
cd8c7a2306
Throttle dynamic VRAM prepare logging ( #13704 )
2026-05-07 10:41:13 +08:00
Talmaj
78b3096bf3
Void model - pass 1 & 2 (CORE-38) ( #13403 )
2026-05-05 19:59:04 -07:00