TensorRT-LLMs/tensorrt_llm/serve
Shunkangz dda7354d1a
Refactor return of first gen token in PD (#2986)
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-04-01 12:28:27 +08:00
..
__init__.py Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
openai_disagg_server.py Refactor return of first gen token in PD (#2986) 2025-04-01 12:28:27 +08:00
openai_protocol.py Update (#2978) 2025-03-23 16:39:35 +08:00
openai_server.py Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
postprocess_handlers.py Update (#2978) 2025-03-23 16:39:35 +08:00