TensorRT-LLMs/tests/unittest
Kaiyu Xie 155b19e6b0 feat: Return context response immediately when stream_interval > 1 (#5836)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2025-07-10 22:50:57 -07:00
..
_torch perf: Optimize swizzle_sf, unswizzle_sf, reswizzle_sf (#5318) 2025-06-26 14:03:56 +08:00
api_stability test: Add LLGuidance test and refine guided decoding (#5348) 2025-06-25 14:12:56 +08:00
bindings refactor: remove batch_manager::KvCacheConfig and use executor::KvCacheConfig instead (#5384) 2025-06-26 19:45:52 +08:00
disaggregated feat: Dynamically remove servers in PD (#5270) 2025-06-25 09:50:04 +08:00
llmapi feat: Return context response immediately when stream_interval > 1 (#5836) 2025-07-10 22:50:57 -07:00
others test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
scaffolding [TRTLLM-4638] feat(scaffolding): update Reward Controller to PRM specific controller with step split (#4337) 2025-05-19 17:53:41 +08:00
tools Enable trtllm-bench to run LoRA and add basic e2e perf testing capability for LoRA in PyT flow (#5130) 2025-06-15 18:54:04 +03:00
trt Mxfp8xmxfp4 quant mode(#4978) 2025-06-10 22:01:37 +08:00
utils chore: fix llm_root when LLM_ROOT is not set (#4741) 2025-05-29 19:44:34 -07:00
conftest.py [fix][test] clear cuda cache before unittests automatically (#5121) 2025-06-19 00:36:53 +03:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation (#4506) 2025-06-03 12:02:07 -07:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py relax the limitation of setuptools (#2992) 2025-03-24 13:36:10 +08:00