TensorRT-LLMs/tensorrt_llm/commands
2025-05-21 09:57:46 +08:00
..
__init__.py Update TensorRT-LLM (#613) 2023-12-08 17:49:24 +08:00
bench.py Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
build.py Update (#2978) 2025-03-23 16:39:35 +08:00
eval.py Breaking change: perf: Enable scheduling overlap by default (#4174) 2025-05-15 14:27:36 +08:00
prune.py Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
refit.py Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
serve.py feat: conditional disaggregation in disagg server (#3974) 2025-05-21 09:57:46 +08:00