mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-05 02:31:33 +08:00
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com> KVCacheManagerV2 is a new python-based implementation of the KV cache manager, featuring cleaner API, better abstraction and better code quality without the accumulated legacy. |
||
|---|---|---|
| .. | ||
| batch_manager | ||
| common | ||
| executor | ||
| process_group | ||
| runtime | ||
| testing | ||
| thop | ||
| userbuffers | ||
| bindings.cpp | ||
| CMakeLists.txt | ||