TensorRT-LLMs/tensorrt_llm/quantization
石晓伟 2a115dae84
Update TensorRT-LLM (#1793)
Co-authored-by: DreamGenX <x@dreamgen.com>
Co-authored-by: Ace-RR <78812427+Ace-RR@users.noreply.github.com>
Co-authored-by: bprus <39293131+bprus@users.noreply.github.com>
Co-authored-by: janpetrov <janpetrov@icloud.com>
2024-06-18 18:18:23 +08:00
..
__init__.py Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
functional.py Update TensorRT-LLM (#1639) 2024-05-21 17:51:02 +08:00
layers.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
mode.py Update TensorRT-LLM (#1358) 2024-03-26 20:47:14 +08:00
quantize_by_modelopt.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
quantize.py Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00