TensorRT-LLMs/examples/disaggregated/clients
Shunkangz ea050084ad
feat: Add support of chat completion in PD (#2985)
* Add support of chat completion in PD

Add support of include_usage in PD


Reformat


* Remove redundant code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Add chat completion test

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

* Refactor code

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>

---------

Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-04-11 17:53:28 +08:00
..
disagg_client.py feat: Add support of chat completion in PD (#2985) 2025-04-11 17:53:28 +08:00
prompts.json Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
run_loadgen.sh Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
template_trtllm_openai_completions.json Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00