TensorRT-LLMs/cpp/include/tensorrt_llm/runtime
Kaiyu Xie 8681b3a4c0
open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297)
* Update TensorRT-LLM

---------

Co-authored-by: Bhuvanesh Sridharan <bhuvanesh.sridharan@sprinklr.com>
Co-authored-by: Qingquan Song <ustcsqq@gmail.com>
2024-10-08 12:19:19 +02:00
..
utils Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
bufferManager.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
common.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
cudaEvent.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
cudaStream.h Update TensorRT-LLM (#1122) 2024-02-21 21:30:55 +08:00
decodingInput.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
decodingOutput.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
explicitDraftTokensBuffers.h Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
generationInput.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
generationOutput.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
gptDecoder.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
gptDecoderBatched.h open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
gptJsonConfig.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
gptSession.h Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
iBuffer.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
iGptDecoderBatched.h open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
ipcUtils.h Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00
iStatefulGptDecoder.h Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
iTensor.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
lookaheadBuffers.h open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
lookaheadModule.h Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
loraCache.h Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
loraCachePageManagerConfig.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
loraModule.h open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
medusaModule.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
memoryCounters.h Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
modelConfig.h open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
promptTuningParams.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
rawEngine.h Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00
request.h Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00
samplingConfig.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
speculativeDecodingMode.h open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
speculativeDecodingModule.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
tllmLogger.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
worldConfig.h Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00