TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

History

Ivy Zhang c4a0d768b5 tests: add qa test mentioned in docs (#4357 ) * add nemotron-h and llama_70b cases Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * trial Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * add llm decoder quick_start case Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * update nemotron-h test case Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * add qwen3 quickstart test Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * add trtllm_decoder accuracy test Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * remove quickstart test for llm_decoder Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * fix import error Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * nemotronh fp8 trial Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * fix name Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> * remove nemotronh-fp8 Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com> --------- Signed-off-by: Ivy Zhang <25222398+crazydemo@users.noreply.github.com>		2025-05-19 10:06:51 +08:00
..
_llmapi_perf_evaluator	Update (#2978 )	2025-03-23 16:39:35 +08:00
accuracy	tests: add qa test mentioned in docs (#4357 )	2025-05-19 10:06:51 +08:00
cpp	[TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092 )	2025-05-14 23:10:04 +02:00
deterministic	chore: Cleanup deprecated APIs from LLM-API (part 1/2) (#3732 )	2025-05-07 13:20:25 +08:00
disaggregated	Breaking change: perf: Enable scheduling overlap by default (#4174 )	2025-05-15 14:27:36 +08:00
examples	test: FIX test_ptp_quickstart_advanced_deepseek_v3_2nodes_8gpus (#4283 )	2025-05-15 15:57:44 +08:00
llmapi	chore: refactor llmapi e2e tests (#3803 )	2025-05-05 07:37:24 +08:00
perf	test(perf): Add `Phi-4-mini-instruct` to perf tests (#4267 )	2025-05-15 21:27:03 +08:00
stress_test	Breaking change: perf: Enable scheduling overlap by default (#4174 )	2025-05-15 14:27:36 +08:00
sysinfo	Update (#2978 )	2025-03-23 16:39:35 +08:00
triton_server	Remove vila test (#4376 )	2025-05-19 09:02:39 +08:00
__init__.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
.test_durations	[TRTLLM-5171] chore: Remove GptSession/V1 from TRT workflow (#4092 )	2025-05-14 23:10:04 +02:00
agg_unit_mem_df.csv	test: reorganize tests folder hierarchy (#2996 )	2025-03-27 12:07:53 +08:00
ci_profiler.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
common.py	Add test case for kv memory estimation (#4158 )	2025-05-14 18:39:25 +08:00
conftest.py	Move Triton backend to TRT-LLM main (#3549 )	2025-05-16 07:15:23 +08:00
local_venv.py	tests: https://nvbugs/5219534 remove failed tests from test list (#4113 )	2025-05-12 14:13:40 +08:00
pytest.ini	Move Triton backend to TRT-LLM main (#3549 )	2025-05-16 07:15:23 +08:00
runner_interface.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_cache.py	chore: clean some ci of qa test (#3083 )	2025-03-31 14:30:41 +08:00
test_cases.yml	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_e2e.py	tests: add qa test mentioned in docs (#4357 )	2025-05-19 10:06:51 +08:00
test_list_parser.py	infra: Add test list name check (#3097 )	2025-04-20 23:02:16 +08:00
test_list_validation.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_mlpf_results.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_sanity.py	Update (#2978 )	2025-03-23 16:39:35 +08:00
test_unittests.py	[TRTLLM-4886][infra]Try another timeout opt to exit test thread directly instead of gracefully (#4341 )	2025-05-16 17:56:40 +08:00
trt_test_alternative.py	Add test case for kv memory estimation (#4158 )	2025-05-14 18:39:25 +08:00