TensorRT-LLMs/tests/integration/test_lists/qa
ruodil 2848e012ae
test: add llama4 models for perf test (#5187)
Signed-off-by: ruodil <200874449+ruodil@users.noreply.github.com>
Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>
2025-06-16 11:24:35 +08:00
..
.gitignore Update (#2978) 2025-03-23 16:39:35 +08:00
examples_test_list.txt test: Add json_mode_eval for guided decoding evaluation (#5179) 2025-06-16 10:03:55 +08:00
llm_multinodes_function_test.txt tests: add llama 3.3 70b 2 nodes tests (#4391) 2025-05-21 12:42:45 +08:00
llm_release_gb20x.txt chore: Mass integration of release/0.20 (#4898) 2025-06-08 23:26:26 +08:00
llm_release_perf_multinode_test.txt chore: Mass integration of release/0.18 (#3421) 2025-04-16 10:03:29 +08:00
llm_release_rtx_pro_6000.txt Release 0.20 to main (#4577) 2025-05-28 16:25:33 +08:00
llm_sanity_test.txt test: waive the NIXL related tests (#5153) 2025-06-12 17:02:27 +08:00
llm_triton_integration_test.txt chore: Mass integration of release/0.20 (#4898) 2025-06-08 23:26:26 +08:00
trt_llm_integration_perf_sanity_test.yml [TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092) 2025-05-14 23:10:04 +02:00
trt_llm_integration_perf_test.yml [TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092) 2025-05-14 23:10:04 +02:00
trt_llm_release_perf_cluster_test.yml test: add llama4 models for perf test (#5187) 2025-06-16 11:24:35 +08:00
trt_llm_release_perf_l2_test.yml shorten reqs in con:1 cases and add streaming cases, and add l2 perf … (#4849) 2025-06-03 12:28:13 +08:00
trt_llm_release_perf_sanity_test.yml test: set enable_attention_dp to False for non-deepseek models and add more cases for llama_v3.1/3.3 70b fp8 models (#5149) 2025-06-12 14:59:16 +08:00
trt_llm_release_perf_test.yml test: add llama4 models for perf test (#5187) 2025-06-16 11:24:35 +08:00