TensorRT-LLMs/tests/unittest
Yukun He 9c5b464fe0
[None][feat] Apply AutoTuner to fp8_block_scale_deep_gemm to trigger JIT ahead of time. (#7113)
Because deep_gemm.gp8_gemm_nt will trigger many JIT processes during the inference phase, we need to sweep these shapes ahead of time. Apply the AutoTuner framework to achieve this and retain the potential capability to tune the swap_ab flag.

Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-08-25 10:48:31 +08:00
..
_torch [None][feat] Apply AutoTuner to fp8_block_scale_deep_gemm to trigger JIT ahead of time. (#7113) 2025-08-25 10:48:31 +08:00
api_stability [TRTLLM-7326][feat] Add standalone multimodal encoder (#6743) 2025-08-19 21:42:50 -07:00
bindings [TRTLLM-6881][feat] Include attention dp rank info with KV cache events (#6563) 2025-08-07 14:17:07 +02:00
disaggregated feat: Dynamically remove servers in PD (#5270) 2025-06-25 09:50:04 +08:00
llmapi [None][chore] Mass integration of release/1.0 (#6864) 2025-08-22 09:25:15 +08:00
others [TRTLLM-7096][infra] Testing cache transmission functionality in Python (#7025) 2025-08-25 09:47:39 +08:00
scaffolding [https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140) 2025-07-18 10:34:37 +08:00
tools enh: Add script to map tests <-> jenkins stages & vice-versa (#5177) 2025-07-19 00:50:40 +08:00
trt [TRTLLM-5863][feat] Support MoE INT8 Weight-Only-Quantization in PyTorch Workflow (#6629) 2025-08-15 17:15:49 -04:00
utils [TRTLLM-4921][feat] Enable chunked prefill for Nemotron-H (#6334) 2025-08-22 12:15:20 -04:00
conftest.py [TRTLLM-5508][feat] check input tokens + improve error handling (#5170) 2025-08-05 18:27:43 +01:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
gc_utils.py [nvbug 5273941] fix: broken cyclic reference detect (#5417) 2025-07-01 20:12:55 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [ci] parallelize torch unittests (#5714) 2025-07-09 11:05:57 +03:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py [TRTLLM-7141][infra] Use repo mirrors to avoid intermittent network failures (#6836) 2025-08-15 11:16:07 +08:00