TensorRT-LLMs/tensorrt_llm/quantization
石晓伟 8f91cff22e
TensorRT-LLM Release 0.15.0 (#2529)
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2024-12-04 13:44:56 +08:00
..
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
functional.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
layers.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
mode.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
quantize_by_modelopt.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00
quantize.py TensorRT-LLM Release 0.15.0 (#2529) 2024-12-04 13:44:56 +08:00