TensorRT-LLMs/cpp/tensorrt_llm/executor/cache_transmission
Patrice Castonguay fd7a11e11d
[https://nvbugs/5534837][fix] Fix KV cache split on long context (#8247)
Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
Signed-off-by: Patrice Castonguay <55748270+pcastonguay@users.noreply.github.com>
Co-authored-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
2025-10-13 11:48:49 -04:00
..
agent_utils [TRTLLM-8044][refactor] Rename data -> cache for cacheTransceiver (#7659) 2025-09-16 08:43:56 -04:00
mpi_utils feat: NIXL interface integration (#3934) 2025-05-19 18:18:22 +08:00
nixl_utils [TRTLLM-7989][infra] Bundle UCX and NIXL libs in the TRTLLM python package (#7766) 2025-09-22 16:43:35 +08:00
ucx_utils [None][fix] get Local IP by connect remote (#7719) 2025-09-19 10:01:03 +08:00
cacheSplitConcat.cu [https://nvbugs/5534837][fix] Fix KV cache split on long context (#8247) 2025-10-13 11:48:49 -04:00
cacheSplitConcat.h [TRTLLM-7731][feat] KV cache transmission in disagg with CP on gen side (#7624) 2025-09-20 06:15:26 -07:00
transferAgent.cpp Agent interface impl for NIXL (#4125) 2025-05-22 09:09:41 +08:00