EasyAI代码托管平台

mirror of https://github.com/comfyanonymous/ComfyUI.git synced 2026-03-10 03:37:33 +08:00

History

lspindler c2afda16ed Add ptq readme		2025-10-28 08:35:48 +01:00
..
configs	WIP PTQ tool	2025-10-28 08:26:02 +01:00
data	WIP PTQ tool	2025-10-28 08:26:02 +01:00
dataset	WIP PTQ tool	2025-10-28 08:26:02 +01:00
models	WIP PTQ tool	2025-10-28 08:26:02 +01:00
checkpoint_merger.py	WIP PTQ tool	2025-10-28 08:26:02 +01:00
example.yml	WIP PTQ tool	2025-10-28 08:26:02 +01:00
quantize.py	WIP PTQ tool	2025-10-28 08:26:02 +01:00
quantizer.py	WIP PTQ tool	2025-10-28 08:26:02 +01:00
README.md	Add ptq readme	2025-10-28 08:35:48 +01:00
utils.py	WIP PTQ tool	2025-10-28 08:26:02 +01:00

README.md

Create PTQ Artefact / Run Calibration

Only needs to be done once per model

python -m tools.ptq.quantize \
    --model_type flux_schnell 
    --unet_path <path>/flux1-schnell.safetensors 
    --clip_path clip_l.safetensors 
    --t5_path t5xxl_fp16.safetensors  
    --output flux_schnell_debug.json 
    --calib_steps 16

Create Quantized Checkpoint

Uses the artefact from before + checkpoint to generate a quantized checkpoint based of a yml configuration. This file defines which layers to quantize and what dtype they should use using regex. See tools/ptq/configs/ for examples.

# config.yml
disable_list: ["*img_in*", "*final_layer*", "*norm*"]  # Keep these in BF16
per_layer_dtype: {"*": "float8_e4m3fn"}                # Everything else to FP8

python -m tools.ptq.checkpoint_merger 
    --artefact flux_dev_debug.json 
    --checkpoint <path>/flux1-dev.safetensors 
    --config tools/ptq/configs/flux_nvfp4.yml 
    --output <out_path>/flux1-nvfp4.safetensors
    --debug