TensorRT-LLMs/cpp/include/tensorrt_llm/batch_manager
milesial c1eefa8735
[https://nvbugs/5467062][fix] pass logitsPostProcessorBatched by reference (#7110)
Signed-off-by: Alexandre Milesi <30204471+milesial@users.noreply.github.com>
2025-08-22 04:42:06 -04:00
..
allocateKvCache.h
assignReqSeqSlots.h
cacheTransceiver.h chore:[BREAKING CHANGE] use cacheTransceiverConfig as knobs for disagg service (#5234) 2025-07-17 17:42:07 +08:00
capacityScheduler.h
common.h
contextProgress.h
createNewDecoderRequests.h
decoderBuffers.h refactor: Enhanced handling of decoder requests and logits within the batch manager (#6055) 2025-07-18 12:12:08 +02:00
evictionPolicy.h
guidedDecoder.h refactor: Enhanced handling of decoder requests and logits within the batch manager (#6055) 2025-07-18 12:12:08 +02:00
handleContextLogits.h
handleGenerationLogits.h
kvCacheEventManager.h feat: KV events for sliding window attention (#5580) 2025-07-05 06:05:20 +08:00
kvCacheManager.h [TRTLLM-5059][feat] Add KV cache reuse support for multimodal models (#5444) 2025-07-21 16:11:58 -07:00
kvCacheTransferManager.h
kvCacheType.h
kvCacheUtils.h
llmRequest.h [TRTLLM-6683][feat] Support LoRA reload CPU cache evicted adapter (#6786) 2025-08-11 14:31:39 -04:00
logitsPostProcessor.h [https://nvbugs/5467062][fix] pass logitsPostProcessorBatched by reference (#7110) 2025-08-22 04:42:06 -04:00
makeDecodingBatchInputOutput.h refactor: Enhanced handling of decoder requests and logits within the batch manager (#6055) 2025-07-18 12:12:08 +02:00
medusaBuffers.h
microBatchScheduler.h
pauseRequests.h
peftCacheManager.h
peftCacheManagerConfig.h
promptTuningBuffers.h
rnnStateManager.h
runtimeBuffers.h Revert "feat: nanobind bindings (#5961)" (#6160) 2025-07-18 10:12:54 +08:00
sequenceSlotManager.h
transformerBuffers.h
updateDecoderBuffers.h