EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-05-09 00:32:31 +08:00

Author	SHA1	Message	Date
kiruno	d72908617c	Merge branch 'master' into fix/mps-Float8_e4m3fn	2026-03-18 16:13:34 +08:00
Jukka Seppänen	1c5db7397d	feat: Support mxfp8 (#12907 )	2026-03-14 18:36:29 -04:00
Kiruno-lz	1fc89ccdd9	fix: 添加MPS设备对float8数据类型的兼容性处理在MPS设备上，PyTorch不支持直接创建或转换float8数据类型。这会导致在尝试将float8张量移至MPS设备时出现运行时错误。 - 在 `stochastic_rounding` 函数中添加递归回退到CPU的逻辑，当检测到MPS设备且目标类型为float8时。 - 在 `cast_to` 函数中添加针对MPS设备的特殊处理，确保float8张量在移动到MPS前先转换为float16。 - 测试了两个workflow：templates-6-key-frames和image_z_image_turbo 相比https://github.com/Comfy-Org/ComfyUI/pull/12378 更具有维护性	2026-02-11 19:25:56 +08:00
comfyanonymous	6165c38cb5	Optimize nvfp4 lora applying. (#11866 ) Some checks are pending Python Linting / Run Ruff (push) Waiting to run Details Python Linting / Run Pylint (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run Details Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run Details Execution Tests / test (macos-latest) (push) Waiting to run Details Execution Tests / test (ubuntu-latest) (push) Waiting to run Details Execution Tests / test (windows-latest) (push) Waiting to run Details Test server launches without errors / test (push) Waiting to run Details Unit Tests / test (macos-latest) (push) Waiting to run Details Unit Tests / test (ubuntu-latest) (push) Waiting to run Details Unit Tests / test (windows-2022) (push) Waiting to run Details This changes results a bit but it also speeds up things a lot.	2026-01-14 00:49:38 -05:00
comfyanonymous	eff2b9d412	Optimize nvfp4 lora applying. (#11856 )	2026-01-13 19:37:19 -05:00
comfyanonymous	15b312de7a	Optimize nvfp4 lora applying. (#11854 )	2026-01-13 19:23:58 -05:00
comfyanonymous	117e7a5853	Refactor to try to lower mem usage. (#11840 )	2026-01-12 21:01:52 -08:00
comfyanonymous	b3c0e4de57	Make loras work on nvfp4 models. (#11837 ) The initial applying is a bit slow but will probably be sped up in the future.	2026-01-12 22:33:54 -05:00
comfyanonymous	73e3a9e676	Clamp output when rounding weight to prevent Nan.	2024-10-19 19:07:10 -04:00
comfyanonymous	7d2467e830	Some minor cleanups.	2024-10-05 13:22:39 -04:00
comfyanonymous	00a5d08103	Lower fp8 lora memory usage.	2024-09-03 01:25:05 -04:00
comfyanonymous	2ca8f6e23d	Make the stochastic fp8 rounding reproducible.	2024-08-26 15:12:06 -04:00
comfyanonymous	7985ff88b9	Use less memory in float8 lora patching by doing calculations in fp16.	2024-08-26 14:45:58 -04:00
comfyanonymous	4506ddc86a	Better subnormal fp8 stochastic rounding. Thanks Ashen.	2024-08-19 13:38:03 -04:00
comfyanonymous	22ec02afc0	Handle subnormal numbers in float8 rounding.	2024-08-19 05:51:08 -04:00
comfyanonymous	bb222ceddb	Fix loras having a weak effect when applied on fp8.	2024-08-17 15:20:17 -04:00

16 Commits