ComfyUI/comfy/ldm/hunyuan_image_3
Yousef Rafat 7b4c1e8031 async cache revamp
Added an async loading and offloading of moe layers, having consistent memory with oom errors.
Used to give oom error after the third layer with 24 giga bytes gpu, now goes to the end with consistent memory with minimal latency
2025-11-14 09:15:16 +02:00
..
model.py async cache revamp 2025-11-14 09:15:16 +02:00