TensorRT-LLMs/tensorrt_llm/quantization
Kaiyu Xie 75057cd036
Update TensorRT-LLM (#2333)
* Update TensorRT-LLM

---------

Co-authored-by: Puneesh Khanna <puneesh.khanna@tii.ae>
Co-authored-by: Ethan Zhang <26497102+ethnzhng@users.noreply.github.com>
2024-10-15 15:28:40 +08:00
..
__init__.py Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
functional.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
layers.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
mode.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
quantize_by_modelopt.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
quantize.py Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00