This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-23 12:12:39 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
a765ee4d21
TensorRT-LLMs
/
cpp
/
include
/
tensorrt_llm
/
batch_manager
History
Robin Kobus
37543a9ad7
[None][refactor] Simplify decoder state initialization for speculative decoding (
#6869
)
...
Signed-off-by: Robin Kobus <19427718+Funatiq@users.noreply.github.com>
2025-08-22 18:44:17 +02:00
..
allocateKvCache.h
assignReqSeqSlots.h
cacheTransceiver.h
capacityScheduler.h
common.h
contextProgress.h
createNewDecoderRequests.h
decoderBuffers.h
evictionPolicy.h
guidedDecoder.h
handleContextLogits.h
handleGenerationLogits.h
kvCacheEventManager.h
kvCacheManager.h
kvCacheTransferManager.h
kvCacheType.h
kvCacheUtils.h
llmRequest.h
logitsPostProcessor.h
makeDecodingBatchInputOutput.h
medusaBuffers.h
microBatchScheduler.h
pauseRequests.h
peftCacheManager.h
peftCacheManagerConfig.h
promptTuningBuffers.h
rnnStateManager.h
runtimeBuffers.h
sequenceSlotManager.h
transformerBuffers.h
updateDecoderBuffers.h