TensorRT-LLMs/tensorrt_llm/quantization
2024-04-16 19:40:08 +08:00
..
__init__.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
functional.py Update TensorRT-LLM (#1427) 2024-04-09 17:03:34 +08:00
layers.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
mode.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
quantize_by_ammo.py Update TensorRT-LLM (#1427) 2024-04-09 17:03:34 +08:00
quantize.py Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00