kiruno
|
d72908617c
|
Merge branch 'master' into fix/mps-Float8_e4m3fn
|
2026-03-18 16:13:34 +08:00 |
|
Jukka Seppänen
|
1c5db7397d
|
feat: Support mxfp8 (#12907)
|
2026-03-14 18:36:29 -04:00 |
|
Kiruno-lz
|
1fc89ccdd9
|
fix: 添加MPS设备对float8数据类型的兼容性处理
在MPS设备上,PyTorch不支持直接创建或转换float8数据类型。这会导致在尝试将float8张量移至MPS设备时出现运行时错误。
- 在 `stochastic_rounding` 函数中添加递归回退到CPU的逻辑,当检测到MPS设备且目标类型为float8时。
- 在 `cast_to` 函数中添加针对MPS设备的特殊处理,确保float8张量在移动到MPS前先转换为float16。
- 测试了两个workflow:templates-6-key-frames和image_z_image_turbo
相比https://github.com/Comfy-Org/ComfyUI/pull/12378 更具有维护性
|
2026-02-11 19:25:56 +08:00 |
|
comfyanonymous
|
6165c38cb5
|
Optimize nvfp4 lora applying. (#11866)
Python Linting / Run Ruff (push) Waiting to run
Python Linting / Run Pylint (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.10, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.11, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-stable (12.1, , linux, 3.12, [self-hosted Linux], stable) (push) Waiting to run
Full Comfy CI Workflow Runs / test-unix-nightly (12.1, , linux, 3.11, [self-hosted Linux], nightly) (push) Waiting to run
Execution Tests / test (macos-latest) (push) Waiting to run
Execution Tests / test (ubuntu-latest) (push) Waiting to run
Execution Tests / test (windows-latest) (push) Waiting to run
Test server launches without errors / test (push) Waiting to run
Unit Tests / test (macos-latest) (push) Waiting to run
Unit Tests / test (ubuntu-latest) (push) Waiting to run
Unit Tests / test (windows-2022) (push) Waiting to run
This changes results a bit but it also speeds up things a lot.
|
2026-01-14 00:49:38 -05:00 |
|
comfyanonymous
|
eff2b9d412
|
Optimize nvfp4 lora applying. (#11856)
|
2026-01-13 19:37:19 -05:00 |
|
comfyanonymous
|
15b312de7a
|
Optimize nvfp4 lora applying. (#11854)
|
2026-01-13 19:23:58 -05:00 |
|
comfyanonymous
|
117e7a5853
|
Refactor to try to lower mem usage. (#11840)
|
2026-01-12 21:01:52 -08:00 |
|
comfyanonymous
|
b3c0e4de57
|
Make loras work on nvfp4 models. (#11837)
The initial applying is a bit slow but will probably be sped up in the
future.
|
2026-01-12 22:33:54 -05:00 |
|
comfyanonymous
|
73e3a9e676
|
Clamp output when rounding weight to prevent Nan.
|
2024-10-19 19:07:10 -04:00 |
|
comfyanonymous
|
7d2467e830
|
Some minor cleanups.
|
2024-10-05 13:22:39 -04:00 |
|
comfyanonymous
|
00a5d08103
|
Lower fp8 lora memory usage.
|
2024-09-03 01:25:05 -04:00 |
|
comfyanonymous
|
2ca8f6e23d
|
Make the stochastic fp8 rounding reproducible.
|
2024-08-26 15:12:06 -04:00 |
|
comfyanonymous
|
7985ff88b9
|
Use less memory in float8 lora patching by doing calculations in fp16.
|
2024-08-26 14:45:58 -04:00 |
|
comfyanonymous
|
4506ddc86a
|
Better subnormal fp8 stochastic rounding. Thanks Ashen.
|
2024-08-19 13:38:03 -04:00 |
|
comfyanonymous
|
22ec02afc0
|
Handle subnormal numbers in float8 rounding.
|
2024-08-19 05:51:08 -04:00 |
|
comfyanonymous
|
bb222ceddb
|
Fix loras having a weak effect when applied on fp8.
|
2024-08-17 15:20:17 -04:00 |
|