TensorRT-LLMs/tensorrt_llm/serve
JunyiXu-nv e291a834db
[TRTLLM-8462][feat] Support GET/DELETE v1/responses/{response_id} (#9937)
Signed-off-by: Junyi Xu <219237550+JunyiXu-nv@users.noreply.github.com>
2026-01-13 03:57:14 -05:00
..
scripts [None] [feat] skip batch_tokenize_prompts in CustomDataset (#10214) 2025-12-23 17:40:57 +08:00
tool_parser [None][chore] Unify DS tool parser names (#10239) 2025-12-31 14:40:07 +08:00
__init__.py
chat_utils.py [None][feat] Support custom chat template for tool calling (#9297) 2025-11-25 22:07:04 +08:00
cluster_storage.py [TRTLLM-9091] [feat] Replace GenAI-Perf with AIPerf (#9310) 2025-12-23 13:25:55 +08:00
disagg_auto_scaling.py [https://nvbugs/5726066][fix] fix auto-scaling related failures (#9845) 2025-12-18 16:37:48 -05:00
harmony_adapter.py [https://nvbugs/5633700][fix] Cache tiktoken vocab for gpt-oss (#10219) 2025-12-26 18:39:03 +08:00
metadata_server.py
openai_client.py [TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726) 2025-12-16 05:16:32 -08:00
openai_disagg_server.py [TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726) 2025-12-16 05:16:32 -08:00
openai_disagg_service.py [TRTLLM-9468][chore] Update disagg benchmarking scripts to support context parallelism (#9720) 2025-12-12 22:29:41 -08:00
openai_protocol.py [TRTLLM-9736][feat] AsyncLLM and verl integ (#9353) 2025-12-11 09:33:25 -08:00
openai_server.py [TRTLLM-8462][feat] Support GET/DELETE v1/responses/{response_id} (#9937) 2026-01-13 03:57:14 -05:00
openai_service.py [TRTLLM-8920][feat] decouple disagg service from fastapi (#8714) 2025-12-05 10:44:16 +08:00
perf_metrics.py [TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726) 2025-12-16 05:16:32 -08:00
postprocess_handlers.py [TRTLLM-7906][feat] Support multiple post process for Responses API (#9908) 2025-12-22 11:33:34 -05:00
responses_utils.py [TRTLLM-8462][feat] Support GET/DELETE v1/responses/{response_id} (#9937) 2026-01-13 03:57:14 -05:00
router.py [TRTLLM-9181][feat] improve disagg-server prometheus metrics; synchronize workers' clocks when workers are dynamic (#9726) 2025-12-16 05:16:32 -08:00