TensorRT-LLMs/tests/unittest
Venky 61da2daeb4
[TRTLLM-6761][refactor] Replace LogitBiasLogitsProcessor with embedding bias tensor system (#6464)
Signed-off-by: Venky Ganesh <23023424+venkywonka@users.noreply.github.com>
2025-08-05 07:14:24 -07:00
..
_torch [None][chore] Update Gemma3 closeness check to mitigate flakiness (#6591) 2025-08-04 10:10:58 -04:00
api_stability [None][feat] Add support of scheduling attention dp request (#6246) 2025-08-01 20:38:01 -04:00
bindings [nvbugs/5404000] fix: waive request_perf_metrics_draft test on pre-Hopper GPUs (#6339) 2025-07-28 12:36:44 -07:00
disaggregated feat: Dynamically remove servers in PD (#5270) 2025-06-25 09:50:04 +08:00
llmapi [TRTLLM-6761][refactor] Replace LogitBiasLogitsProcessor with embedding bias tensor system (#6464) 2025-08-05 07:14:24 -07:00
others test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
scaffolding [https://nvbugs/5387375] fix(scaffolding): fix scaffolding aime test in test_e2e (#6140) 2025-07-18 10:34:37 +08:00
tools enh: Add script to map tests <-> jenkins stages & vice-versa (#5177) 2025-07-19 00:50:40 +08:00
trt [TRTLLM-6674][feat] (Breaking Change) Hopper SWA non-cyclic kernels + KV reuse + Spec Dec (#6379) 2025-08-05 07:47:41 +00:00
utils [TRTLLM-5826][feat] Support pytorch LoRA adapter eviction (#5616) 2025-07-20 08:00:14 +03:00
conftest.py [feat][test] reuse MPI pool executor across tests (#5566) 2025-06-29 17:23:12 +03:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
gc_utils.py [nvbug 5273941] fix: broken cyclic reference detect (#5417) 2025-07-01 20:12:55 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [ci] parallelize torch unittests (#5714) 2025-07-09 11:05:57 +03:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py Deepseek R1 FP8 Support on Blackwell (#6486) 2025-08-01 10:26:28 +08:00