TensorRT-LLMs/tests/integration/test_lists/test-db
NVJiangShao a700646132
[None][fix] Add FP4 all2all unitest and fix a bug for module WideEPMoE (#6784)
Signed-off-by: Jiang Shao <91270701+StudyingShao@users.noreply.github.com>
2025-08-14 13:35:37 +08:00
..
l0_a10.yml [None][feat] Core Metrics Implementation (#5785) 2025-08-09 02:48:53 -04:00
l0_a30.yml [None][test] Test trtllm-bench AD vs, PT BEs on H100 single gpu (#6487) 2025-08-11 08:33:13 +03:00
l0_a100.yml fix [nvbug/5351244]: address remote mpi session submit (#5664) 2025-07-22 12:48:00 +08:00
l0_b200.yml [None][test] Test trtllm-bench AD vs, PT BEs on H100 single gpu (#6487) 2025-08-11 08:33:13 +03:00
l0_dgx_b200.yml [None][fix] Add FP4 all2all unitest and fix a bug for module WideEPMoE (#6784) 2025-08-14 13:35:37 +08:00
l0_dgx_h100.yml [TRTLLM-6675][infra] Cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/6623 (#6735) 2025-08-14 04:36:38 +00:00
l0_dgx_h200.yml [None][fix] fix same pp disagg (#6730) 2025-08-10 22:45:15 -04:00
l0_gb200_multi_nodes.yml [doc][ci][Qwen3][nvbugs 5374145] Add Qwen3 235B eagle3 CI (#6477) 2025-07-31 09:37:23 +08:00
l0_gb200.yml test: Move some of the test from post merge to pre-merge, update dgx b200 test case (#5640) 2025-07-14 17:17:30 +08:00
l0_gb202.yml [fix] WAR to fix the illegal memory access issue in moe gemm on SM120 (#5636) 2025-07-10 09:20:30 +08:00
l0_gb203.yml [TRTLLM-5277] chore: refine llmapi examples for 1.0 (part1) (#5431) 2025-07-01 19:06:41 +08:00
l0_gh200.yml [Infra] - Set default timeout to 1hr and remove some specific settings (#5667) 2025-07-02 08:37:54 -04:00
l0_h100.yml [None][feat] Add support for Hopper MLA chunked prefill (#6655) 2025-08-14 10:39:26 +08:00
l0_l40s.yml [TRTLLM-6761][refactor] Replace LogitBiasLogitsProcessor with embedding bias tensor system (#6464) 2025-08-05 07:14:24 -07:00
l0_perf.yml CI: Performance regression tests update (#3531) 2025-06-01 09:47:55 +03:00
l0_rtx_pro_6000.yml feat: Add deepseek-lite tests for RTX pro 6000 (#5903) 2025-07-16 15:51:45 +08:00
l0_sanity_check.yml [TRTLLM-6160] chore: add sampling examples for pytorch (#5951) 2025-07-14 15:28:32 +09:00
README.md Update (#2978) 2025-03-23 16:39:35 +08:00

Description

This folder contains test definition which is consumed by trt-test-db tool based on system specifications.

Installation

Install trt-test-db using the following command:

pip3 install --extra-index-url https://urm.nvidia.com/artifactory/api/pypi/sw-tensorrt-pypi/simple --ignore-installed trt-test-db==1.8.5+bc6df7

Test Definition

Test definitions are stored in YAML files located in ${TRT_LLM_ROOT}/tests/integration/test_lists/test-db/. These files define test conditions and the tests to be executed.

Example YAML Structure

version: 0.0.1
l0_e2e:
  - condition:
      terms:
        supports_fp8: true
      ranges:
        system_gpu_count:
          gte: 4
          lte: 4
      wildcards:
        gpu:
          - '*h100*'
        linux_distribution_name: ubuntu*
    tests:
      - examples/test_llama.py::test_llm_llama_v3_1_1node_multi_gpus[llama-3.1-8b-enable_fp8]
      - examples/test_llama.py::test_llm_llama_v3_1_1node_multi_gpus[llama-3.1-70b-enable_fp8]

Generating Test Lists

Use trt-test-db to generate a test list based on the system configuration:

trt-test-db -d /TensorRT-LLM/src/tests/integration/test_lists/test-db \
            --context l0_e2e \
            --test-names \
            --output /TensorRT-LLM/src/l0_e2e.txt \
            --match-exact '{"chip":"ga102gl-a","compute_capability":"8.6","cpu":"x86_64","gpu":"A10","gpu_memory":"23028.0","host_mem_available_mib":"989937","host_mem_total_mib":"1031949","is_aarch64":false,"is_linux":true,"linux_distribution_name":"ubuntu","linux_version":"22.04","supports_fp8":false,"supports_int8":true,"supports_tf32":true,"sysname":"Linux","system_gpu_count":"1",...}'

This command generates a test list file (l0_e2e.txt) based on the specified context and system configuration.

Running Tests

Execute the tests using pytest with the generated test list:

pytest -v --test-list=/TensorRT-LLM/src/l0_e2e.txt --output-dir=/tmp/logs

This command runs the tests specified in the test list and outputs the results to the specified directory.

Additional Information

  • The --context parameter in the trt-test-db command specifies which context to search in the YAML files.
  • The --match-exact parameter provides system information used to filter tests based on the conditions defined in the YAML files.
  • Modify the YAML files to add or update test conditions and test cases as needed. For more detailed information on trt-test-db and pytest usage, refer to their respective documentation.