TensorRT-LLMs/tensorrt_llm/quantization
2024-11-01 19:48:44 +08:00
..
__init__.py TensorRT-LLM v0.12 Update (#2164) 2024-08-29 17:25:07 +08:00
functional.py TensorRT-LLM v0.13 Update (#2269) 2024-09-30 16:20:23 +08:00
layers.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
mode.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
quantize_by_modelopt.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00
quantize.py Update TensorRT-LLM v0.14.0 (#2401) 2024-11-01 19:48:44 +08:00