TensorRT-LLMs/cpp/tensorrt_llm/pybind
Yao Yao 6f07fa81d7
[TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736)
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com>

KVCacheManagerV2 is a new python-based implementation of the KV cache manager, featuring cleaner API, better abstraction and better code quality without the accumulated legacy.
2026-01-24 04:48:39 -05:00
..
batch_manager [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00
common [None][feat] Add Request specific exception (#6931) 2025-09-04 18:43:42 -04:00
executor [TRTLLM-10059][feat] Use global unique id as disagg request id (#10187) 2026-01-21 22:52:34 -05:00
process_group [TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520) 2025-10-04 08:12:24 +08:00
runtime [TRTLLM-9108][feat] Add test configurable moe module multi gpu (#10699) 2026-01-23 10:16:58 +08:00
testing fix: Improve chunking test and skip empty kernel calls (#5710) 2025-07-04 09:08:15 +02:00
thop [None][fix] Remove unused params in attn (#10652) 2026-01-20 03:08:59 -05:00
userbuffers [None][fix] Introduce inline namespace to avoid symbol collision (#9541) 2025-12-12 23:32:15 +08:00
bindings.cpp [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00
CMakeLists.txt [TRTLLM-7738][feat] Adding implementation of KVCacheManagerV2 (#10736) 2026-01-24 04:48:39 -05:00