TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-22 03:35:00 +08:00

History

Ivy Zhang 7666bec7c4 [TRTQA-2861][test]: add nemotron and llama4 cases into qa test (#4053 ) * add MMLU, GPQADiamond check for llama-4 models Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * add nomotron cases Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * add online quant test cases Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * remove trt flow cases Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * update threshold Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * adjust parallelism strategy Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * fix fail Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * update sanity list Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * fix comment Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * skip nemotron-h test case Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> --------- Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> Co-authored-by: Larry <197874197+LarryXFly@users.noreply.github.com>		2025-05-08 18:10:41 +08:00
..
_llmapi_perf_evaluator	Update (#2978 )	2025-03-23 16:39:35 +08:00
accuracy	[TRTQA-2861][test]: add nemotron and llama4 cases into qa test (#4053 )	2025-05-08 18:10:41 +08:00
deterministic	chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732 )	2025-05-07 13:20:25 +08:00
disaggregated	chore: bump version to 0.19.0 (#3598 ) (#3841 )	2025-04-29 16:57:22 +08:00
examples	fix:https://nvbugs/5246733 (#3989 )	2025-05-01 22:52:31 +08:00
llmapi	chore: refactor llmapi e2e tests (#3803 )	2025-05-05 07:37:24 +08:00
perf	tests: skip writing prepare_dataset output to logs, and add llama_v3.1_8b_fp8, llama_v3.3_70b_fp8, llama_v3.1_405b_fp4 models (#3864 )	2025-05-07 13:56:35 +08:00
stress_test	fix: trtllm-serve hang in stress test and ds v3 stress parameter update (#3836 )	2025-05-06 16:52:30 +08:00
sysinfo	Update (#2978 )	2025-03-23 16:39:35 +08:00
__init__.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
.test_durations	infra: Remove the WAR for test items incompletely (#3313 )	2025-05-04 11:31:59 +08:00
agg_unit_mem_df.csv	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
ci_profiler.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
common.py	[TRTLLM-4763][test] Accuracy test improvement (Part 3.6): Deprecate mmlu_llmapi.py (#3802 )	2025-04-23 23:05:13 +08:00
conftest.py	infra: Remove the WAR for test items incompletely (#3313 )	2025-05-04 11:31:59 +08:00
cpp_common.py	refactor: Move ModelSpec to core library (#3980 )	2025-05-04 01:39:09 +08:00
local_venv.py	test: Automatically clean checkpoints and engines (#3468 )	2025-04-12 09:56:29 +08:00
pytest.ini	chore: Refine attention backend interface. (#3271 )	2025-04-09 02:34:53 +08:00
runner_interface.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_cache.py	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
test_cases.yml	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_cpp.py	refactor: Move ModelSpec to core library (#3980 )	2025-05-04 01:39:09 +08:00
test_e2e.py	[TRTQA-2861][test]: add nemotron and llama4 cases into qa test (#4053 )	2025-05-08 18:10:41 +08:00
test_list_parser.py	infra: Add test list name check (#3097 )	2025-04-20 23:02:16 +08:00
test_list_validation.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_mlpf_results.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_sanity.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_unittests.py	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
trt_test_alternative.py	Add thread leak check and fix thread/memory leak issues. (#3270 )	2025-04-08 19:03:18 +08:00