TensorRT-LLMs/tests/unittest
Emma Qiao ff32caf4d7
[Infra] - Update dependencies with NGC PyTorch 25.05 and TRT 10.11 (#4885)
Signed-off-by: qqiao <qqiao@nvidia.com>
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Signed-off-by: Emma Qiao <qqiao@nvidia.com>
Co-authored-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
Co-authored-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-06-17 23:48:34 +08:00
..
_torch Re-implement LlmResponse in Python to reduce host overhead of pybind (#5224) 2025-06-17 21:28:09 +08:00
api_stability Speculation: Draft Target in new FW (#4558) 2025-06-17 02:26:08 +08:00
bindings Solve underallocation in VSWA+/VGQA (#4667) 2025-06-12 12:12:46 +08:00
disaggregated Add disaggregated unittest (#4899) 2025-06-05 19:14:31 +08:00
llmapi test: Add json_mode_eval for guided decoding evaluation (#5179) 2025-06-16 10:03:55 +08:00
others test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
scaffolding [TRTLLM-4638] feat(scaffolding): update Reward Controller to PRM specific controller with step split (#4337) 2025-05-19 17:53:41 +08:00
tools Enable trtllm-bench to run LoRA and add basic e2e perf testing capability for LoRA in PyT flow (#5130) 2025-06-15 18:54:04 +03:00
trt Mxfp8xmxfp4 quant mode(#4978) 2025-06-10 22:01:37 +08:00
utils chore: fix llm_root when LLM_ROOT is not set (#4741) 2025-05-29 19:44:34 -07:00
conftest.py [Infra] - Update dependencies with NGC PyTorch 25.05 and TRT 10.11 (#4885) 2025-06-17 23:48:34 +08:00
dump_checkpoint_stats.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
profile_utils.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
pytest.ini [TRTLLM-5053] Refactoring and Unifying the Multimodal input preparation (#4506) 2025-06-03 12:02:07 -07:00
test_model_runner_cpp.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
test_pip_install.py relax the limitation of setuptools (#2992) 2025-03-24 13:36:10 +08:00