TensorRT-LLMs/tensorrt_llm/quantization
Quanfeng Li 839aad4d6e
fix: Add missing parameter for WeightOnlyQuantRowLinear module (#2768)
Signed-off-by: Quanfeng Li <liquanfeng7@foxmail.com>
2025-03-31 16:20:30 +08:00
..
utils Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
__init__.py Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
functional.py Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
image_processing.py Update TensorRT-LLM (#2582) 2024-12-16 21:50:47 -08:00
layers.py fix: Add missing parameter for WeightOnlyQuantRowLinear module (#2768) 2025-03-31 16:20:30 +08:00
mode.py Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
quantize_by_modelopt.py fix: fix for cp > kvHeadNum (#3002) 2025-03-26 12:39:02 +08:00
quantize.py Update (#2978) 2025-03-23 16:39:35 +08:00