TensorRT-LLMs/triton_backend/tools
Dimitrios Bariamis f49dafe0da
[https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714)
Signed-off-by: Dimitrios Bariamis <12195802+dbari@users.noreply.github.com>
Signed-off-by: Dimitrios Bariamis <dbari@users.noreply.github.com>
Co-authored-by: Dimitrios Bariamis <12195802+dbari@users.noreply.github.com>
Co-authored-by: Iman Tabrizian <10105175+Tabrizian@users.noreply.github.com>
2025-08-21 18:08:38 +02:00
..
dataset [nvbug 5283506] fix: Fix spec decode triton test (#4845) 2025-06-09 08:40:17 -04:00
gpt Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
inflight_batcher_llm feat: add dataset support for benchmark_core_model with LLMAPI (#4457) 2025-05-21 19:18:43 -07:00
multimodal [https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714) 2025-08-21 18:08:38 +02:00
tests [None][chore] Add tests for non-existent and completed request cancellation (#6840) 2025-08-14 15:57:01 -07:00
utils Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
whisper Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
__init__.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
fill_template.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
llmapi_client.py feat: Add support for Triton request cancellation (#5898) 2025-07-15 20:52:43 -04:00
utils.sh Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00