TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

ruodil 2848e012ae test: add llama4 models for perf test (#5187 ) Signed-off-by: ruodil <200874449+ruodil@users.noreply.github.com> Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>		2025-06-16 11:24:35 +08:00
..
.gitignore	Update (#2978 )	2025-03-23 16:39:35 +08:00
examples_test_list.txt	test: Add json_mode_eval for guided decoding evaluation (#5179 )	2025-06-16 10:03:55 +08:00
llm_multinodes_function_test.txt	tests: add llama 3.3 70b 2 nodes tests (#4391 )	2025-05-21 12:42:45 +08:00
llm_release_gb20x.txt	chore: Mass integration of release/0.20 (#4898 )	2025-06-08 23:26:26 +08:00
llm_release_perf_multinode_test.txt	chore: Mass integration of release/0.18 (#3421 )	2025-04-16 10:03:29 +08:00
llm_release_rtx_pro_6000.txt	Release 0.20 to main (#4577 )	2025-05-28 16:25:33 +08:00
llm_sanity_test.txt	test: waive the NIXL related tests (#5153 )	2025-06-12 17:02:27 +08:00
llm_triton_integration_test.txt	chore: Mass integration of release/0.20 (#4898 )	2025-06-08 23:26:26 +08:00
trt_llm_integration_perf_sanity_test.yml	[TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092 )	2025-05-14 23:10:04 +02:00
trt_llm_integration_perf_test.yml	[TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092 )	2025-05-14 23:10:04 +02:00
trt_llm_release_perf_cluster_test.yml	test: add llama4 models for perf test (#5187 )	2025-06-16 11:24:35 +08:00
trt_llm_release_perf_l2_test.yml	shorten reqs in con:1 cases and add streaming cases, and add l2 perf … (#4849 )	2025-06-03 12:28:13 +08:00
trt_llm_release_perf_sanity_test.yml	test: set enable_attention_dp to False for non-deepseek models and add more cases for llama_v3.1/3.3 70b fp8 models (#5149 )	2025-06-12 14:59:16 +08:00
trt_llm_release_perf_test.yml	test: add llama4 models for perf test (#5187 )	2025-06-16 11:24:35 +08:00