TensorRT-LLMs/cpp/tensorrt_llm/kernels/speculativeDecoding
Kaiyu Xie db4edea1e1
Update TensorRT-LLM (#1763)
* Update TensorRT-LLM

---------

Co-authored-by: Kota Tsuyuzaki <bloodeagle40234@gmail.com>
Co-authored-by: Pzzzzz <hello-cd.plus@hotmail.com>
Co-authored-by: Patrick Reiter Horn <patrick.horn@gmail.com>
2024-06-11 16:59:02 +08:00
..
common.cu Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
common.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
explicitDraftTokensKernels.cu Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
explicitDraftTokensKernels.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
externalDraftTokensKernels.cu Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
externalDraftTokensKernels.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
kvCacheUpdateKernels.cu Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
kvCacheUpdateKernels.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
medusaDecodingKernels.cu Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
medusaDecodingKernels.h Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00