Yu Li
|
5ba2d28b7f
|
add block-wise scaled int8 quantization based on QuantizedLayout mechanism
add more tests by comparing with manual torch implementation
add perf benchmarks
fix errors caused by merging
default no output quant
fix unittest
|
2025-12-10 12:23:05 -06:00 |
|