TensorRT-LLMs/cpp/tensorrt_llm/kernels/weightOnlyBatchedGemv
Kaiyu Xie 728cc0044b
Update TensorRT-LLM (#1233)
* Update TensorRT-LLM

---------

Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2024-03-05 18:32:53 +08:00
..
sm90 Update TensorRT-LLM (#1168) 2024-02-27 17:37:34 +08:00
common.h Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
enabled.h Update TensorRT-LLM (#1098) 2024-02-18 15:48:08 +08:00
kernel.h Update TensorRT-LLM (#1233) 2024-03-05 18:32:53 +08:00
kernelLauncher.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
kernelLauncher.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
utility.h Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs1Int4b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs1Int8b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs2Int4b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs2Int8b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs3Int4b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs3Int8b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs4Int4b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
weightOnlyBatchedGemvBs4Int8b.cu Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00