TensorRT-LLMs/cpp/include/tensorrt_llm/runtime
Kaiyu Xie be9cd719f7
Update TensorRT-LLM (#2094)
* Update TensorRT-LLM

---------

Co-authored-by: akhoroshev <arthoroshev@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Tayef Shah <tayefshah@gmail.com>
Co-authored-by: lfz941 <linfanzai941@gmail.com>
2024-08-07 16:44:43 +08:00
..
utils Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
bufferManager.h Update TensorRT-LLM (#1427) 2024-04-09 17:03:34 +08:00
common.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
cudaEvent.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
cudaStream.h Update TensorRT-LLM (#1122) 2024-02-21 21:30:55 +08:00
decodingInput.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
decodingOutput.h Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
explicitDraftTokensBuffers.h Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
generationInput.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
generationOutput.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
gptDecoder.h Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
gptDecoderBatched.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
gptJsonConfig.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
gptSession.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
iBuffer.h Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
iGptDecoderBatched.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
ipcUtils.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
iStatefulGptDecoder.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
iTensor.h Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
lookaheadModule.h Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
loraCache.h Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
loraCachePageManagerConfig.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
loraModule.h Update TensorRT-LLM (#2016) 2024-07-24 19:50:28 +08:00
medusaModule.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
memoryCounters.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
modelConfig.h Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
promptTuningParams.h Update TensorRT-LLM (#1598) 2024-05-14 16:43:41 +08:00
rawEngine.h Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
request.h Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
samplingConfig.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
speculativeDecodingMode.h Update TensorRT-LLM (#1793) 2024-06-18 18:18:23 +08:00
speculativeDecodingModule.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
tllmLogger.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
worldConfig.h Update TensorRT-LLM (#1688) 2024-05-28 20:07:49 +08:00