TensorRT-LLMs/cpp/tensorrt_llm/nanobind
Yao Yao 6f07fa81d7
[TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736)
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com>

KVCacheManagerV2 is a new python-based implementation of the KV cache manager, featuring cleaner API, better abstraction and better code quality without the accumulated legacy.
2026-01-24 04:48:39 -05:00
..
batch_manager [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00
common [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00
executor [TRTLLM-10059][feat] Use global unique id as disagg request id (#10187) 2026-01-21 22:52:34 -05:00
process_group [TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520) 2025-10-04 08:12:24 +08:00
runtime [TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699) 2026-01-23 10:16:58 +08:00
testing feat: nanobind bindings (#6185) 2025-07-21 08:56:57 +01:00
thop [None][fix] Remove unused params in attn (#10652) 2026-01-20 03:08:59 -05:00
userbuffers [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
bindings.cpp [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00
CMakeLists.txt [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00