TensorRT-LLMs/tests/integration/defs/disaggregated
Shunkangz ea050084ad
feat: Add support of chat completion in PD (#2985)
* Add support of chat completion in PD

Add support of include_usage in PD


Reformat


* Remove redundant code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Add chat completion test

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

---------

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-04-11 17:53:28 +08:00
..
test_configs chore: Adding DS V3-lite tests with overlap + cuda graph (#3342) 2025-04-08 09:36:09 -04:00
sanity_check.sh feat: Add support of chat completion in PD (#2985) 2025-04-11 17:53:28 +08:00
test_disaggregated_single_gpu.py test: disable attention DP tests for single GPU (#3395) 2025-04-11 01:38:17 +08:00
test_disaggregated.py chore: Adding DS V3-lite tests with overlap + cuda graph (#3342) 2025-04-08 09:36:09 -04:00