This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-12 14:03:48 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
a64693b6ce
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
batch_manager
History
brb-nv
d798d66976
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (
#8145
)
...
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
2025-10-31 17:32:39 -07:00
..
utils
allocateKvCache.cpp
assignReqSeqSlots.cpp
cacheFormatter.cpp
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (
#8145
)
2025-10-31 17:32:39 -07:00
cacheFormatter.h
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (
#8145
)
2025-10-31 17:32:39 -07:00
cacheTransBuffer.cpp
cacheTransBuffer.h
cacheTransceiver.cpp
capacityScheduler.cpp
CMakeLists.txt
contextProgress.cpp
createNewDecoderRequests.cpp
dataTransceiver.cpp
dataTransceiver.h
decoderBuffers.cpp
encoderBuffers.cpp
encoderBuffers.h
evictionPolicy.cpp
guidedDecoder.cpp
handleContextLogits.cpp
handleGenerationLogits.cpp
kvCacheEventManager.cpp
kvCacheManager.cpp
[TRTLLM-8976][feat] Move indexer-k-cache to KVCacheManager (
#8699
)
2025-10-29 08:04:26 -07:00
kvCacheTransferManager.cpp
llmRequest.cpp
logitsPostProcessor.cpp
loraBuffers.cpp
loraBuffers.h
makeDecodingBatchInputOutput.cpp
medusaBuffers.cpp
microBatchScheduler.cpp
mlaCacheFormatter.cpp
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (
#8145
)
2025-10-31 17:32:39 -07:00
mlaCacheFormatter.h
[TRTLLM-7731][feat] Avoid over-allocation of KV cache for transmission in disagg with CP (
#8145
)
2025-10-31 17:32:39 -07:00
pauseRequests.cpp
peftCacheManager.cpp
promptTuningBuffers.cpp
rnnStateBuffers.cpp
rnnStateBuffers.h
rnnStateManager.cpp
runtimeBuffers.cpp
scheduledBlocksManager.h
sequenceSlotManager.cpp
transformerBuffers.cpp
trtEncoderModel.cpp
trtEncoderModel.h
trtGptModel.h
trtGptModelFactory.h
trtGptModelInflightBatching.cpp
trtGptModelInflightBatching.h
updateDecoderBuffers.cpp