TensorRT-LLMs/cpp/tensorrt_llm/plugins
Kaiyu Xie 9bd15f1937
TensorRT-LLM v0.10 update
* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-06-05 20:43:25 +08:00
..
api TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
bertAttentionPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
common TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
cumsumLastDimPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gemmPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gptAttentionCommon TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
gptAttentionPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
identityPlugin Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
layernormQuantizationPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
lookupPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
loraPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
lruPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
mambaConv1dPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
mixtureOfExperts TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
ncclPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
quantizePerTokenPlugin Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
quantizeTensorPlugin Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
rmsnormQuantizationPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
selectiveScanPlugin Update TensorRT-LLM Release branch (#1445) 2024-04-12 17:59:19 +08:00
smoothQuantGemmPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
weightOnlyGroupwiseQuantMatmulPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
weightOnlyQuantMatmulPlugin TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
CMakeLists.txt TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
exports.def Update 2023-10-10 23:22:17 -07:00
exports.map TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00