TensorRT-LLMs/tensorrt_llm/commands
2025-12-08 10:37:40 -08:00
..
__init__.py Update TensorRT-LLM (#613) 2023-12-08 17:49:24 +08:00
bench.py [TRTLLM-9089][chore] Port prepare_dataset into trtllm-bench (#9250) 2025-12-08 10:37:40 -08:00
build.py [TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330) 2025-10-28 09:17:26 -07:00
eval.py [None][chore] Fix trtllm-eval and move GroupedGemmInputsHelper (#9612) 2025-12-03 07:55:03 +08:00
prune.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
refit.py [TRTLLM-8684][chore] Migrate BuildConfig to Pydantic, add a Python wrapper for KVCacheType enum (#8330) 2025-10-28 09:17:26 -07:00
serve.py [https://nvbugs/5703953][fix] Preserving ip:port for trtllm-serve before initializing llm (#9646) 2025-12-06 20:13:48 -08:00