TensorRT-LLMs/cpp/include/tensorrt_llm/batch_manager
Kaiyu Xie e153372759
Update TensorRT-LLM (#2253)
* Update TensorRT-LLM

---------

Co-authored-by: Ivan Sorokin <isorokin@nvidia.com>
Co-authored-by: lkm2835 <lkm2835@gmail.com>
2024-09-24 17:27:31 +02:00
..
BatchManager.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
callbacks.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
GptManager.h Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
inferenceRequest.h Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00
kvCacheConfig.h Update TensorRT-LLM (#2110) 2024-08-13 22:34:33 +08:00
kvCacheManager.h Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
kvCacheUtils.h Update TensorRT-LLM (#1954) 2024-07-16 15:30:25 +08:00
llmRequest.h Update TensorRT-LLM (#2253) 2024-09-24 17:27:31 +02:00
namedTensor.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00
peftCacheManager.h Update TensorRT-LLM (#1725) 2024-06-04 20:26:32 +08:00
peftCacheManagerConfig.h Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00
trtGptModelOptionalParams.h Update TensorRT-LLM (#2215) 2024-09-10 18:21:22 +08:00