TensorRT-LLMs/tests/integration/defs/triton_server
Jhao-Ting Chen 92d90fa29a
[None][feat] Expose enable_trt_overlap in Triton_backend brings 1.05x OTPS (#10018)
Signed-off-by: Jhao-Ting Chen <jhaotingc@nvidia.com>
2025-12-23 11:41:31 -06:00
..
rcca/bug_4323566 Release 0.20 to main (#4577) 2025-05-28 16:25:33 +08:00
__init__.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
build_engines.py [https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714) 2025-08-21 18:08:38 +02:00
build_model.sh Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
common.py [https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714) 2025-08-21 18:08:38 +02:00
conftest.py [TRTLLM-5950][infra] Removing remaining turtle keywords from the code base (#7086) 2025-09-07 14:26:18 +08:00
local_venv.py [Infra]Remove some old keyword (#4552) 2025-05-31 13:50:45 +08:00
runner_interface.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00
test_list_parser.py [None][feat] add waive by sm version (#8928) 2025-11-05 19:20:43 -08:00
test_triton_llm.py [None][feat] Expose enable_trt_overlap in Triton_backend brings 1.05x OTPS (#10018) 2025-12-23 11:41:31 -06:00
test_triton_memleak.py chore: Mass integration of release/0.20 (#5082) 2025-06-17 14:32:02 +03:00
test_triton_multi_node.py Release 0.20 to main (#4577) 2025-05-28 16:25:33 +08:00
test_triton_rcca.py chore: Mass integration of release/0.20 (#4898) 2025-06-08 23:26:26 +08:00
test_triton.py [TRTLLM-6224][infra] Upgrade dependencies to DLFW 25.06 and CUDA 12.9.1 (#5678) 2025-08-03 11:18:59 +08:00
test.sh [https://nvbugs/5394409][feat] Support Mistral Small 3.1 multimodal in Triton Backend (#6714) 2025-08-21 18:08:38 +02:00
trt_test_alternative.py Move Triton backend to TRT-LLM main (#3549) 2025-05-16 07:15:23 +08:00