TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-23 12:12:39 +08:00

History

Robin Kobus 37543a9ad7 [None][refactor] Simplify decoder state initialization for speculative decoding (#6869 ) Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>		2025-08-22 18:44:17 +02:00
..
allocateKvCache.h
assignReqSeqSlots.h
cacheTransceiver.h
capacityScheduler.h
common.h
contextProgress.h
createNewDecoderRequests.h
decoderBuffers.h
evictionPolicy.h
guidedDecoder.h
handleContextLogits.h
handleGenerationLogits.h
kvCacheEventManager.h
kvCacheManager.h
kvCacheTransferManager.h
kvCacheType.h
kvCacheUtils.h
llmRequest.h
logitsPostProcessor.h
makeDecodingBatchInputOutput.h
medusaBuffers.h
microBatchScheduler.h
pauseRequests.h
peftCacheManager.h
peftCacheManagerConfig.h
promptTuningBuffers.h
rnnStateManager.h
runtimeBuffers.h
sequenceSlotManager.h
transformerBuffers.h
updateDecoderBuffers.h