TensorRT-LLMs/cpp/tensorrt_llm/executor/cache_transmission
Patrice Castonguay b7602f7bd4 [https://nvbugs/5534837][fix] Fix KV cache split on long context (#8247)
Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
Co-authored-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
Signed-off-by: Mike Iovine <6158008+mikeiovine@users.noreply.github.com>
2025-10-16 22:46:19 +08:00
..
agent_utils [None][feat] Support for cancelling requests with disaggregation (#8114) 2025-10-02 11:04:26 -07:00
mpi_utils feat: NIXL interface integration (#3934) 2025-05-19 18:18:22 +08:00
nixl_utils [TRTLLM-7989][infra] Bundle UCX and NIXL libs in the TRTLLM python package (#7766) 2025-09-22 16:43:35 +08:00
ucx_utils [TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520) 2025-10-04 08:12:24 +08:00
cacheSplitConcat.cu [https://nvbugs/5534837][fix] Fix KV cache split on long context (#8247) 2025-10-16 22:46:19 +08:00
cacheSplitConcat.h [TRTLLM-7731][feat] KV cache transmission in disagg with CP on gen side (#7624) 2025-09-20 06:15:26 -07:00
transferAgent.cpp Agent interface impl for NIXL (#4125) 2025-05-22 09:09:41 +08:00