TensorRT-LLMs/cpp/tensorrt_llm/executor/cache_transmission
brb-nv d798d66976
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-10-31 17:32:39 -07:00
..
agent_utils [None][doc] Facilitates the integration of the transfer agent (#7867) 2025-10-21 20:06:24 +08:00
mpi_utils feat: NIXL interface integration (#3934) 2025-05-19 18:18:22 +08:00
nixl_utils [None][doc] Facilitates the integration of the transfer agent (#7867) 2025-10-21 20:06:24 +08:00
ucx_utils [TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520) 2025-10-04 08:12:24 +08:00
cacheSplitConcat.cu [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
cacheSplitConcat.h [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
transferAgent.cpp Agent interface impl for NIXL (#4125) 2025-05-22 09:09:41 +08:00