TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Quanfeng Li 839aad4d6e fix: Add missing parameter for WeightOnlyQuantRowLinear module (#2768 ) Signed-off-by: Quanfeng Li <liquanfeng7@foxmail.com>		2025-03-31 16:20:30 +08:00
..
utils	Update TensorRT-LLM (#2873 )	2025-03-11 21:13:42 +08:00
__init__.py	Update TensorRT-LLM (#2792 )	2025-02-18 21:27:39 +08:00
functional.py	Update TensorRT-LLM (#2792 )	2025-02-18 21:27:39 +08:00
image_processing.py	Update TensorRT-LLM (#2582 )	2024-12-16 21:50:47 -08:00
layers.py	fix: Add missing parameter for WeightOnlyQuantRowLinear module (#2768 )	2025-03-31 16:20:30 +08:00
mode.py	Update TensorRT-LLM (#2792 )	2025-02-18 21:27:39 +08:00
quantize_by_modelopt.py	fix: fix for cp > kvHeadNum (#3002 )	2025-03-26 12:39:02 +08:00
quantize.py	Update (#2978 )	2025-03-23 16:39:35 +08:00