EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-05-27 09:27:24 +08:00

Author	SHA1	Message	Date
Jedrzej Kosinski	f663018950	multigpu: trim _force_supported_compute_dtype docstring to one line Amp-Thread-ID: https://ampcode.com/threads/T-019e61db-ffb1-73a6-b2a8-3d23d7b05792 Co-authored-by: Amp <amp@ampcode.com>	2026-05-25 19:58:36 -07:00
Jedrzej Kosinski	8969bbbf02	multigpu: shorten _force_supported_compute_dtype docstring Amp-Thread-ID: https://ampcode.com/threads/T-019e61db-ffb1-73a6-b2a8-3d23d7b05792 Co-authored-by: Amp <amp@ampcode.com>	2026-05-25 19:56:07 -07:00
Jedrzej Kosinski	4ca4d39076	multigpu: use unet_manual_cast for SelectModelDevice compute dtype Replace the hardcoded `_force_fp32_cpu_compute` helper with`_force_supported_compute_dtype`, which delegates to`comfy.model_management.unet_manual_cast(weight_dtype, device)`. The interrogator already encodes per-device dtype support (CPU returns False for fp16/bf16, older GPUs may not support bf16, pre-14 MPS doesn't support bf16, etc.) and returns None when no cast is needed.For SelectModelDevice -> CPU on an fp16/bf16 model, behavior is unchanged: `unet_manual_cast` returns `torch.float32` and `set_model_compute_dtype` casts at use without touching peak memory. As a bonus the same code path now handles other `weight_dtype not supported on device` cases (e.g. bf16 weights on pre-Ampere NVIDIA, bf16 on pre-macOS-14 MPS) without growing the code surface, so the call site no longer needs the `if resolved.type == 'cpu':` gate. Amp-Thread-ID: https://ampcode.com/threads/T-019e61db-ffb1-73a6-b2a8-3d23d7b05792 Co-authored-by: Amp <amp@ampcode.com>	2026-05-25 19:54:22 -07:00
Jedrzej Kosinski	0a2dd86e78	MultiGPU Work Units For Accelerated Sampling (CORE-184) (#7063 )	2026-05-25 18:26:40 -07:00

4 Commits