mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-02-05 02:31:33 +08:00
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com> KVCacheManagerV2 is a new python-based implementation of the KV cache manager, featuring cleaner API, better abstraction and better code quality without the accumulated legacy. |
||
|---|---|---|
| .. | ||
| algorithms.cpp | ||
| algorithms.h | ||
| bindings.cpp | ||
| bindings.h | ||
| buffers.cpp | ||
| buffers.h | ||
| cacheTransceiver.cpp | ||
| cacheTransceiver.h | ||
| kvCacheConnector.cpp | ||
| kvCacheConnector.h | ||
| kvCacheManager.cpp | ||
| kvCacheManager.h | ||
| kvCacheManagerV2Utils.cpp | ||
| kvCacheManagerV2Utils.h | ||
| llmRequest.cpp | ||
| llmRequest.h | ||