TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

ruodil 9c03a7ab74 test: add llama_3.2_1B model and fix for test lora script issue (#4139 ) * test: add llama_v3.1_8b_fp8 model, llama_v3.1_405b model and llama_nemotron_49b model in perf test, and modify original llama models dtype from float16 to bfloat16 according to README.md Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com> * add llama_3.2_1B model and fix for lora script issue Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com> --------- Signed-off-by: Ruodi <200874449+ruodil@users.noreply.github.com>		2025-05-12 14:51:59 +08:00
..
dev	Update (#2978 )	2025-03-23 16:39:35 +08:00
qa	test: add llama_3.2_1B model and fix for test lora script issue (#4139 )	2025-05-12 14:51:59 +08:00
test-db	Refactor: Restructure C++ tests for better modularisation of non-shared code (#4027 )	2025-05-09 19:16:51 +01:00
waives.txt	tests: https://nvbugs/5219534 remove failed tests from test list (#4113 )	2025-05-12 14:13:40 +08:00