TensorRT-LLMs/cpp/tensorrt_llm/batch_manager
Kaiyu Xie 3aa6b11d13
Update TensorRT-LLM (#2936)
* Update TensorRT-LLM

---------

Co-authored-by: changcui <cuichang147@gmail.com>
2025-03-18 21:25:19 +08:00
..
utils Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
allocateKvCache.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
assignReqSeqSlots.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
cacheFormatter.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
cacheFormatter.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
cacheTransceiver.cpp Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
capacityScheduler.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
CMakeLists.txt Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
contextProgress.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
createNewDecoderRequests.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
dataTransceiver.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
dataTransceiver.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
dataTransceiverImpl.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
dataTransceiverImpl.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
decoderBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
encoderBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
encoderBuffers.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
evictionPolicy.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
generateRequestOptions.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
guidedDecoder.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
handleContextLogits.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
handleGenerationLogits.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
kvCacheEventManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
kvCacheManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
kvCacheTransferManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
llmRequest.cpp Update TensorRT-LLM (#2936) 2025-03-18 21:25:19 +08:00
logitsPostProcessor.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
loraBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
loraBuffers.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
makeDecodingBatchInputOutput.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
medusaBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
microBatchScheduler.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
mlaCacheFormatter.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
mlaCacheFormatter.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
pauseRequests.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
peftCacheManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
promptTuningBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
promptTuningBuffers.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
rnnStateBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
rnnStateBuffers.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
rnnStateManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
runtimeBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
sequenceSlotManager.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
transformerBuffers.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtEncoderModel.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtEncoderModel.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModel.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModelFactory.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModelInflightBatching.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModelInflightBatching.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModelV1.cpp Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
trtGptModelV1.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00