TensorRT-LLMs/tests/integration/defs
Shunkangz ea050084ad
feat: Add support of chat completion in PD (#2985)
* Add support of chat completion in PD

Add support of include_usage in PD


Reformat


* Remove redundant code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Add chat completion test

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

---------

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-04-11 17:53:28 +08:00
..
_llmapi_perf_evaluator Update (#2978) 2025-03-23 16:39:35 +08:00
accuracy test: add torch flow test case in qa test list (#3404) 2025-04-11 16:57:41 +08:00
deterministic Update (#2978) 2025-03-23 16:39:35 +08:00
disaggregated feat: Add support of chat completion in PD (#2985) 2025-04-11 17:53:28 +08:00
examples test: add cuda visible device constraint for phi_1gpu test (#3364) 2025-04-11 17:14:52 +08:00
perf Update (#2978) 2025-03-23 16:39:35 +08:00
sysinfo Update (#2978) 2025-03-23 16:39:35 +08:00
__init__.py Update (#2978) 2025-03-23 16:39:35 +08:00
_run_llmapi_llm.py Update (#2978) 2025-03-23 16:39:35 +08:00
.test_durations chore: Rename nvsmall to nemotron nas (#3447) 2025-04-10 23:16:52 +08:00
agg_unit_mem_df.csv test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
ci_profiler.py Update (#2978) 2025-03-23 16:39:35 +08:00
common.py test: Add Eagle tests with untrained heads (#2991) 2025-04-01 11:41:59 +08:00
conftest.py Add thread leak check and fix thread/memory leak issues. (#3270) 2025-04-08 19:03:18 +08:00
cpp_common.py chore : split GptExecutor tests out of gpt tests to reduce single test time (#3412) 2025-04-10 09:08:15 +08:00
local_venv.py Update (#2978) 2025-03-23 16:39:35 +08:00
pytest.ini chore: Refine attention backend interface. (#3271) 2025-04-09 02:34:53 +08:00
runner_interface.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_cache.py chore: clean some ci of qa test (#3083) 2025-03-31 14:30:41 +08:00
test_cases.yml Update (#2978) 2025-03-23 16:39:35 +08:00
test_cpp.py chore : split GptExecutor tests out of gpt tests to reduce single test time (#3412) 2025-04-10 09:08:15 +08:00
test_e2e.py feat: Add Qwen2.5-VL and refactor Qwen2-VL (#3156) 2025-04-10 04:09:03 +08:00
test_list_parser.py Feat: Variable-Beam-Width-Search (VBWS) part3 (#3338) 2025-04-08 23:51:27 +08:00
test_list_validation.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_mlpf_results.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_sanity.py Update (#2978) 2025-03-23 16:39:35 +08:00
test_unittests.py test: reorganize tests folder hierarchy (#2996) 2025-03-27 12:07:53 +08:00
trt_test_alternative.py Add thread leak check and fix thread/memory leak issues. (#3270) 2025-04-08 19:03:18 +08:00
turtle_defs.json Update (#2978) 2025-03-23 16:39:35 +08:00