EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-02-11 22:12:33 +08:00

Author	SHA1	Message	Date
Sasbom	0ef5557d6a	Add QOL feature for changing the custom nodes folder location through cli args. bugfix: fix typo in apply_directory for custom_nodes_directory allow for PATH style ';' delimited custom_node directories. change delimiter type for seperate folders per platform. feat(API-nodes): move Rodin3D nodes to new client; removed old api client.py (#10645) Fix qwen controlnet regression. (#10657) Enable pinned memory by default on Nvidia. (#10656) Removed the --fast pinned_memory flag. You can use --disable-pinned-memory to disable it. Please report if it causes any issues. Pinned mem also seems to work on AMD. (#10658) Remove environment variable. Removed environment variable fallback for custom nodes directory. Update documentation for custom nodes directory Clarified documentation on custom nodes directory argument, removed documentation on environment variable Clarify release cycle. (#10667) Tell users they need to upload their logs in bug reports. (#10671) mm: guard against double pin and unpin explicitly (#10672) As commented, if you let cuda be the one to detect double pin/unpinning it actually creates an asyc GPU error. Only unpin tensor if it was pinned by ComfyUI (#10677) Make ScaleROPE node work on Flux. (#10686) Add logging for model unloading. (#10692) Unload weights if vram usage goes up between runs. (#10690) ops: Put weight cast on the offload stream (#10697) This needs to be on the offload stream. This reproduced a black screen with low resolution images on a slow bus when using FP8. Update CI workflow to remove dead macOS runner. (#10704) * Update CI workflow to remove dead macOS runner. * revert * revert Don't pin tensor if not a torch.nn.parameter.Parameter (#10718) Update README.md for Intel Arc GPU installation, remove IPEX (#10729) IPEX is no longer needed for Intel Arc GPUs. Removing instruction to setup ipex. mm/mp: always unload re-used but modified models (#10724) The partial unloader path in model re-use flow skips straight to the actual unload without any check of the patching UUID. This means that if you do an upscale flow with a model patch on an existing model, it will not apply your patchings. Fix by delaying the partial_unload until after the uuid checks. This is done by making partial_unload a model of partial_load where extra_mem is -ve. qwen: reduce VRAM usage (#10725) Clean up a bunch of stacked and no-longer-needed tensors on the QWEN VRAM peak (currently FFN). With this I go from OOMing at B=37x1328x1328 to being able to succesfully run B=47 (RTX5090). Update Python 3.14 compatibility notes in README (#10730) Quantized Ops fixes (#10715) * offload support, bug fixes, remove mixins * add readme add PR template for API-Nodes (#10736) feat: add create_time dict to prompt field in /history and /queue (#10741) flux: reduce VRAM usage (#10737) Cleanup a bunch of stack tensors on Flux. This take me from B=19 to B=22 for 1600x1600 on RTX5090. Better instructions for the portable. (#10743) Use same code for chroma and flux blocks so that optimizations are shared. (#10746) Fix custom nodes import error. (#10747) This should fix the import errors but will break if the custom nodes actually try to use the class. revert import reordering revert imports pt 2 Add left padding support to tokenizers. (#10753) chore(api-nodes): mark OpenAIDalle2 and OpenAIDalle3 nodes as deprecated (#10757) Revert "chore(api-nodes): mark OpenAIDalle2 and OpenAIDalle3 nodes as deprecated (#10757)" (#10759) This reverts commit `9a02382568`. Change ROCm nightly install command to 7.1 (#10764)	2025-11-17 06:16:21 +01:00
comfyanonymous	af4b7b5edb	More fp8 torch.compile regressions fixed. (#10625 )	2025-11-03 22:14:20 -05:00
comfyanonymous	6b88478f9f	Bring back fp8 torch compile performance to what it should be. (#10622 )	2025-11-03 19:22:10 -05:00
comfyanonymous	e199c8cc67	Fixes (#10621 )	2025-11-03 17:58:24 -05:00
comfyanonymous	958a17199a	People should update their pytorch versions. (#10618 )	2025-11-03 17:08:30 -05:00
comfyanonymous	c58c13b2ba	Fix torch compile regression on fp8 ops. (#10580 )	2025-11-01 00:25:17 -04:00
comfyanonymous	906c089957	Fix small performance regression with fp8 fast and scaled fp8. (#10537 )	2025-10-29 19:29:01 -04:00
comfyanonymous	1a58087ac2	Reduce memory usage for fp8 scaled op. (#10531 )	2025-10-29 15:43:51 -04:00
contentis	8817f8fc14	Mixed Precision Quantization System (#10498 ) * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Implement mixed precision operations with a registry design and metadate for quant spec in checkpoint. * Updated design using Tensor Subclasses * Fix FP8 MM * An actually functional POC * Remove CK reference and ensure correct compute dtype * Update unit tests * ruff lint * Fix missing keys * Rename quant dtype parameter * Rename quant dtype parameter * Fix unittests for CPU build	2025-10-28 16:20:53 -04:00

9 Commits