TensorRT-LLMs/tensorrt_llm/quantization
2024-04-01 16:39:43 +08:00
..
__init__.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
functional.py Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
layers.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
mode.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
quantize_by_ammo.py Update TensorRT-LLM (#1387) 2024-04-01 16:39:43 +08:00
quantize.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00