TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

ruodil 9c03a7ab74 test: add llama_3.2_1B model and fix for test lora script issue (#4139 ) * test: add llama_v3.1_8b_fp8 model, llama_v3.1_405b model and llama_nemotron_49b model in perf test, and modify original llama models dtype from float16 to bfloat16 according to README.md Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com> * add llama_3.2_1B model and fix for lora script issue Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com> --------- Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com>		2025-05-12 14:51:59 +08:00
..
.gitignore	Update (#2978 )	2025-03-23 16:39:35 +08:00
examples_test_list.txt	tests: https://nvbugs/5219534 remove failed tests from test list (#4113 )	2025-05-12 14:13:40 +08:00
llm_multinodes_function_test.txt	tests: https://nvbugs/5219534 remove failed tests from test list (#4113 )	2025-05-12 14:13:40 +08:00
llm_release_perf_multinode_test.txt	chore: Mass integration of release/0.18 (#3421 )	2025-04-16 10:03:29 +08:00
llm_sanity_test.txt	test: move mistral / mixtral test cases in QA test list into the new accuracy test suite (#3440 )	2025-05-09 13:32:02 +08:00
trt_llm_integration_perf_sanity_test.yml	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
trt_llm_integration_perf_test.yml	tests: change qa perf test to trtllm-bench (#3189 )	2025-04-17 09:53:32 +08:00
trt_llm_release_perf_cluster_test.yml	tests: skip writing prepare_dataset output to logs, and add llama_v3.1_8b_fp8, llama_v3.3_70b_fp8, llama_v3.1_405b_fp4 models (#3864 )	2025-05-07 13:56:35 +08:00
trt_llm_release_perf_sanity_test.yml	test: add llama_3.2_1B model and fix for test lora script issue (#4139 )	2025-05-12 14:51:59 +08:00
trt_llm_release_perf_test.yml	test: add llama_3.2_1B model and fix for test lora script issue (#4139 )	2025-05-12 14:51:59 +08:00