TensorRT-LLMs/triton_backend/tools
Guoming Zhang 202bed4574 [None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
..
dataset [nvbug 5283506] fix: Fix spec decode triton test (#4845) 2025-06-09 08:40:17 -04:00
gpt Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
inflight_batcher_llm [None][chroe] Rename TensorRT-LLM to TensorRT LLM for source code. (#7851) 2025-09-25 21:02:35 +08:00
multimodal [https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714) 2025-08-21 18:08:38 +02:00
tests [None][chore] Add tests for non-existent and completed request cancellation (#6840) 2025-08-14 15:57:01 -07:00
utils Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
whisper Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
__init__.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
fill_template.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
llmapi_client.py feat: Add support for Triton request cancellation (#5898) 2025-07-15 20:52:43 -04:00
utils.sh Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00