TensorRT-LLMs/cpp/tensorrt_llm/batch_manager
brb-nv d798d66976
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-10-31 17:32:39 -07:00
..
utils
allocateKvCache.cpp
assignReqSeqSlots.cpp
cacheFormatter.cpp [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
cacheFormatter.h [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
cacheTransBuffer.cpp
cacheTransBuffer.h
cacheTransceiver.cpp
capacityScheduler.cpp
CMakeLists.txt
contextProgress.cpp
createNewDecoderRequests.cpp
dataTransceiver.cpp
dataTransceiver.h
decoderBuffers.cpp
encoderBuffers.cpp
encoderBuffers.h
evictionPolicy.cpp
guidedDecoder.cpp
handleContextLogits.cpp
handleGenerationLogits.cpp
kvCacheEventManager.cpp
kvCacheManager.cpp [TRTLLM-8976][feat] Move indexer-k-cache to KVCacheManager (#8699) 2025-10-29 08:04:26 -07:00
kvCacheTransferManager.cpp
llmRequest.cpp
logitsPostProcessor.cpp
loraBuffers.cpp
loraBuffers.h
makeDecodingBatchInputOutput.cpp
medusaBuffers.cpp
microBatchScheduler.cpp
mlaCacheFormatter.cpp [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
mlaCacheFormatter.h [TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (#8145) 2025-10-31 17:32:39 -07:00
pauseRequests.cpp
peftCacheManager.cpp
promptTuningBuffers.cpp
rnnStateBuffers.cpp
rnnStateBuffers.h
rnnStateManager.cpp
runtimeBuffers.cpp
scheduledBlocksManager.h
sequenceSlotManager.cpp
transformerBuffers.cpp
trtEncoderModel.cpp
trtEncoderModel.h
trtGptModel.h
trtGptModelFactory.h
trtGptModelInflightBatching.cpp
trtGptModelInflightBatching.h
updateDecoderBuffers.cpp