TensorRT-LLMs/tests/unittest
Enwei Zhu 21efb50068
[TRTLLM-6406] feat: Enable guided decoding with overlap scheduler (#6000)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-07-17 17:46:10 +08:00
..
_torch feat: TRTLLM-5574 Add phi-4-multimodal pytorch-backend support (#5644) 2025-07-17 06:30:58 +08:00
api_stability [TRTLLM-5493] Add core infrastructure to enable loading of custom checkpoint formats (#5372) 2025-07-17 00:50:30 +08:00
bindings chore:[BREAKING CHANGE] use cacheTransceiverConfig as knobs for disagg service (#5234) 2025-07-17 17:42:07 +08:00
disaggregated feat: Dynamically remove servers in PD (#5270) 2025-06-25 09:50:04 +08:00
llmapi [TRTLLM-6406] feat: Enable guided decoding with overlap scheduler (#6000) 2025-07-17 17:46:10 +08:00
others test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
scaffolding fix #4974: A thread leak issue in scaffolding unittest (#5020) 2025-07-14 20:22:03 +09:00
tools Enable trtllm-bench to run LoRA and add basic e2e perf testing capability for LoRA in PyT flow (#5130) 2025-06-15 18:54:04 +03:00
trt Update transformers to 4.53.0 (#5747) 2025-07-09 09:32:24 -07:00
utils feat: W4A16 GEMM (#4232) 2025-07-01 10:36:05 +03:00
conftest.py [feat][test] reuse MPI pool executor across tests (#5566) 2025-06-29 17:23:12 +03:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
gc_utils.py [nvbug 5273941] fix: broken cyclic reference detect (#5417) 2025-07-01 20:12:55 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [ci] parallelize torch unittests (#5714) 2025-07-09 11:05:57 +03:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py relax the limitation of setuptools (#2992) 2025-03-24 13:36:10 +08:00