TensorRT-LLMs/tensorrt_llm/models/gpt
brb-nv 727d78e785
Support prequantized fp8 ckpt for nemotron-mini-4b-instruct (#3046)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-04-01 14:52:09 +08:00
..
__init__.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
config.py Update TensorRT-LLM (#2436) 2024-11-12 15:27:49 +08:00
convert.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
model.py Support prequantized fp8 ckpt for nemotron-mini-4b-instruct (#3046) 2025-04-01 14:52:09 +08:00