TensorRT-LLMs/cpp/tensorrt_llm/pybind/batch_manager
Chuang Zhu 44cfd757b2
Agent interface impl for NIXL (#4125)
* agentConnection

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

recv

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

agentState

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

NIXL interfaces

Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

update cmakelists

Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

nixl improve

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

remove cppzmq

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

fix

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

transferAgent remove register

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

work for cache Test

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

reduce sleep time

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

fix test

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

intergarte

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

nixl env

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

fix rebase error

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

cpp test

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

stash for send metaData

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

loadRemoteMD after fetchRemoteMD

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

workaround for mixed gen and context

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

test_env

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

avoid port conflict in test

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

* format

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

* use std::string

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

* typo

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

* fix transferAgentTest

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>

---------

Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
2025-05-22 09:09:41 +08:00
..
algorithms.cpp refactor: Copy sequence lengths once in decoder setup (#4102) 2025-05-16 22:03:55 +08:00
algorithms.h Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
bindings.cpp refactor: Copy sequence lengths once in decoder setup (#4102) 2025-05-16 22:03:55 +08:00
bindings.h Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
buffers.cpp Update TensorRT-LLM (#2849) 2025-03-04 18:44:00 +08:00
buffers.h Update TensorRT-LLM (#2755) 2025-02-11 03:01:00 +00:00
cacheTransceiver.cpp Agent interface impl for NIXL (#4125) 2025-05-22 09:09:41 +08:00
cacheTransceiver.h Update TensorRT-LLM (#2820) 2025-02-25 21:21:49 +08:00
kvCacheManager.cpp cacheTransceiver buffer manager (#3798) 2025-04-27 11:48:15 +08:00
kvCacheManager.h Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
llmRequest.cpp feat: Add multimodal embedding field in LlmRequest (#3855) 2025-05-01 12:23:30 +08:00
llmRequest.h feat: Add multimodal embedding field in LlmRequest (#3855) 2025-05-01 12:23:30 +08:00