TensorRT-LLMs/cpp/include/tensorrt_llm/executor
Zheng Duan fea5bfbda7
[None][feat] add detailed KV cache transfer time breakdown (#8521)
Signed-off-by: zhengd-nv <200704041+zhengd-nv@users.noreply.github.com>
2025-10-29 10:11:09 +08:00
..
cacheCommunicator.h Agent interface impl for NIXL (#4125) 2025-05-22 09:09:41 +08:00
dataTransceiverState.h [TRTLLM-6106][feat] Add support for KVCache transfer from KVCache reuse path (#6348) 2025-09-27 19:29:30 -04:00
disaggServerUtil.h Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00
executor.h [None][feat] Support ignored prompt length for penalties via new sampling config parameter (#8127) 2025-10-27 13:12:31 -04:00
serialization.h [TRTLLM-6106][feat] Add support for KVCache transfer from KVCache reuse path (#6348) 2025-09-27 19:29:30 -04:00
tensor.h Update TensorRT-LLM (#1918) 2024-07-09 14:42:22 +08:00
transferAgent.h [None][doc] Facilitates the integration of the transfer agent (#7867) 2025-10-21 20:06:24 +08:00
types.h [None][feat] add detailed KV cache transfer time breakdown (#8521) 2025-10-29 10:11:09 +08:00