TensorRT-LLMs/cpp/tensorrt_llm/plugins
2024-11-05 16:27:06 +08:00
..
api Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
bertAttentionPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
common Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
cumsumLastDimPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
eaglePlugin Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
fp8RowwiseGemmPlugin Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
gemmPlugin Update TensorRT-LLM 2024-08-20 18:55:15 +08:00
gemmSwigluPlugin Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
gptAttentionCommon Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
gptAttentionPlugin Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
identityPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
layernormQuantizationPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
lookupPlugin Update TensorRT-LLM (#1763) 2024-06-11 16:59:02 +08:00
loraPlugin open source 4dbf696ae9b74a26829d120b67ab8443d70c8e58 (#2297) 2024-10-08 12:19:19 +02:00
lowLatencyGemmPlugin Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
lowLatencyGemmSwigluPlugin Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
lruPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
mambaConv1dPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
mixtureOfExperts Update TensorRT-LLM (#2184) 2024-09-03 12:14:23 +02:00
ncclPlugin Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
quantizePerTokenPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
quantizeTensorPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
rmsnormQuantizationPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
selectiveScanPlugin open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273) 2024-09-30 13:51:19 +02:00
smoothQuantGemmPlugin Update TensorRT-LLM (#2053) 2024-07-30 21:25:01 +08:00
weightOnlyGroupwiseQuantMatmulPlugin Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
weightOnlyQuantMatmulPlugin Update TensorRT-LLM (#2333) 2024-10-15 15:28:40 +08:00
CMakeLists.txt Update TensorRT-LLM (#2413) 2024-11-05 16:27:06 +08:00
exports.def Update 2023-10-10 23:22:17 -07:00
exports.map Update TensorRT-LLM (#1530) 2024-04-30 17:19:10 +08:00