mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-13 22:18:36 +08:00

History

Eran Geva c5d5af9e7f [#8391 ][chore] removed llama and added deepseek to AutoDeploy's L0 perf test (#10585 ) Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>		2026-01-11 16:31:24 -05:00
..
l0_a10.yml	[TRTLLM-10171][fix] Correct attention handling in ModelConfig and KVCacheManager (#10330 )	2026-01-04 06:07:30 -05:00
l0_a30.yml	[None][feat] Expose enable_trt_overlap in Triton_backend brings 1.05x OTPS (#10018 )	2025-12-23 11:41:31 -06:00
l0_a100.yml	[TRTLLM-9551][infra] Partition test_llm_pytorch.py for parallel execution (#10400 )	2026-01-05 13:58:03 -05:00
l0_b200.yml	[TRTLLM-10195][feat] K-EXAONE support (#10355 )	2026-01-12 00:29:51 +09:00
l0_b300.yml	[https://nvbugs/5640873 ][fix] Move thop tests to pre-merge (#9094 )	2025-11-13 13:08:13 +08:00
l0_dgx_b200_perf_sanity.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_dgx_b200.yml	[https://nvbugs/5767223 ][feat] add pp support for DeepSeek-v3.2 (#10449 )	2026-01-07 12:29:51 +08:00
l0_dgx_b300_perf_sanity.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_dgx_b300.yml	[TRTLLM-9622][infra] Enable DGX_B300 multi-gpu testing in pre-merge pipeline (#9699 )	2026-01-06 14:39:55 +08:00
l0_dgx_h100.yml	[#10056 ][chore] AutoDeploy: Enable Nemo SuperV3 accuracy test (#10308 )	2026-01-02 11:20:19 +02:00
l0_dgx_h200.yml	[https://nvbugs/5779534 ][fix] fix buffer reuse for CUDA graph attention metadata (#10393 )	2026-01-05 09:43:44 +08:00
l0_gb10.yml	[None][infra] Enable single-gpu CI on spark (#9304 )	2025-12-30 17:22:14 +08:00
l0_gb200_multi_gpus_perf_sanity.yml	[None][fix] Decrease Pre Merge Perf Tests (#10390 )	2026-01-04 12:21:34 -05:00
l0_gb200_multi_gpus.yml	[None][fix] Mistral large 3 few code refine (#10405 )	2026-01-08 06:38:49 -05:00
l0_gb200_multi_nodes_aggr_perf_sanity_2_nodes.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_gb200_multi_nodes_disagg_perf_sanity_3_nodes.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_gb200_multi_nodes_disagg_perf_sanity_6_nodes.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_gb200_multi_nodes_disagg_perf_sanity_8_nodes.yml	[TRTLLM-9834][feat] Transfer to TRTLLM-INFRA Database and Fail post-merge tests if regression (#10282 )	2025-12-31 21:44:59 +08:00
l0_gb200_multi_nodes.yml	[https://nvbugs/5715568 ][fix] Force release torch memory when LLM is destroyed (#10314 )	2026-01-05 15:30:18 +08:00
l0_gb202.yml	[TRTLLM-9831][perf] Enable 2CTA with autotune for CuteDSL MoE and Grouped GEMM optimizations (#10201 )	2025-12-25 09:04:20 -05:00
l0_gb203.yml	[TRTLLM-5277] chore: refine llmapi examples for 1.0 (part1) (#5431 )	2025-07-01 19:06:41 +08:00
l0_gb300_multi_gpus.yml	[None][feat] Enable EPLB for trtllm-gen and cutlass backend (#8886 )	2025-11-12 12:30:27 -08:00
l0_gb300.yml	[TRTLLM-8971][infra] Update gpu key for B300/GB300 (#8724 )	2025-10-29 20:36:44 -07:00
l0_gh200.yml	[None][test] Update llm_models_root to improve path handling on BareMetal environment (#7876 )	2025-09-24 17:35:57 +08:00
l0_h100.yml	[https://nvbugs/5669097 ][tests] Add MMMU test for mistral small (#10530 )	2026-01-09 16:09:28 -08:00
l0_l40s.yml	[None][fix] Enable AttentionDP on Qwen3-VL and fix test (#10435 )	2026-01-10 00:13:26 +09:00
l0_perf.yml	[#8391 ][chore] removed llama and added deepseek to AutoDeploy's L0 perf test (#10585 )	2026-01-11 16:31:24 -05:00
l0_rtx_pro_6000.yml	[TRTLLM-9831][perf] Enable 2CTA with autotune for CuteDSL MoE and Grouped GEMM optimizations (#10201 )	2025-12-25 09:04:20 -05:00
l0_sanity_check.yml	[None][fix] Fix ready signal in NIXL backend (#10000 )	2025-12-19 09:43:40 +08:00
README.md	Update (#2978 )	2025-03-23 16:39:35 +08:00

README.md

Description

This folder contains test definition which is consumed by trt-test-db tool based on system specifications.

Installation

Install trt-test-db using the following command:

pip3 install --extra-index-url https://urm.nvidia.com/artifactory/api/pypi/sw-tensorrt-pypi/simple --ignore-installed trt-test-db==1.8.5+bc6df7

Test Definition

Test definitions are stored in YAML files located in ${TRT_LLM_ROOT}/tests/integration/test_lists/test-db/. These files define test conditions and the tests to be executed.

Example YAML Structure

version: 0.0.1
l0_e2e:
  - condition:
      terms:
        supports_fp8: true
      ranges:
        system_gpu_count:
          gte: 4
          lte: 4
      wildcards:
        gpu:
          - '*h100*'
        linux_distribution_name: ubuntu*
    tests:
      - examples/test_llama.py::test_llm_llama_v3_1_1node_multi_gpus[llama-3.1-8b-enable_fp8]
      - examples/test_llama.py::test_llm_llama_v3_1_1node_multi_gpus[llama-3.1-70b-enable_fp8]

Generating Test Lists

Use trt-test-db to generate a test list based on the system configuration:

trt-test-db -d /TensorRT-LLM/src/tests/integration/test_lists/test-db \
            --context l0_e2e \
            --test-names \
            --output /TensorRT-LLM/src/l0_e2e.txt \
            --match-exact '{"chip":"ga102gl-a","compute_capability":"8.6","cpu":"x86_64","gpu":"A10","gpu_memory":"23028.0","host_mem_available_mib":"989937","host_mem_total_mib":"1031949","is_aarch64":false,"is_linux":true,"linux_distribution_name":"ubuntu","linux_version":"22.04","supports_fp8":false,"supports_int8":true,"supports_tf32":true,"sysname":"Linux","system_gpu_count":"1",...}'

This command generates a test list file (l0_e2e.txt) based on the specified context and system configuration.

Running Tests

Execute the tests using pytest with the generated test list:

pytest -v --test-list=/TensorRT-LLM/src/l0_e2e.txt --output-dir=/tmp/logs

This command runs the tests specified in the test list and outputs the results to the specified directory.

Additional Information

The --context parameter in the trt-test-db command specifies which context to search in the YAML files.
The --match-exact parameter provides system information used to filter tests based on the conditions defined in the YAML files.
Modify the YAML files to add or update test conditions and test cases as needed. For more detailed information on trt-test-db and pytest usage, refer to their respective documentation.