TensorRT-LLMs/cpp/tensorrt_llm/plugins
Dan Blanaru 48686bca3a
open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273)
* Update TensorRT-LLM

---------
Co-authored-by: Qingquan Song <ustcsqq@gmail.com>
2024-09-30 13:51:19 +02:00
..
api Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
bertAttentionPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
common Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
cumsumLastDimPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
fp8RowwiseGemmPlugin Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
gemmPlugin Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
gemmSwigluPlugin Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
gptAttentionCommon open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
gptAttentionPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
identityPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
layernormQuantizationPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
lookupPlugin Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
loraPlugin Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
lowLatencyGemmPlugin Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
lruPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
mambaConv1dPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
mixtureOfExperts Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
ncclPlugin Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
quantizePerTokenPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
quantizeTensorPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
rmsnormQuantizationPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
selectiveScanPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
smoothQuantGemmPlugin Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
weightOnlyGroupwiseQuantMatmulPlugin Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
weightOnlyQuantMatmulPlugin Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
CMakeLists.txt Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
exports.def Update 2023-10-10 23:22:17 -07:00
exports.map Update TensorRT-LLM (#1530) 2024-04-30 17:19:10 +08:00