TensorRT-LLMs/cpp/tensorrt_llm/plugins
Kaiyu Xie 587d063e6d
Update TensorRT-LLM (#506)
* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2023-11-30 16:46:22 +08:00
..
api Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
bertAttentionPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
common Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
gemmPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
gptAttentionCommon Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
gptAttentionPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
identityPlugin Update code 2023-09-28 09:00:05 -07:00
layernormPlugin Updates for release/0.5.0 2023-10-15 21:26:20 +08:00
layernormQuantizationPlugin Update 2023-10-10 23:22:17 -07:00
lookupPlugin Update code 2023-09-28 09:00:05 -07:00
loraPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
ncclPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
quantizePerTokenPlugin Update code 2023-09-28 09:00:05 -07:00
quantizeTensorPlugin Update code 2023-09-28 09:00:05 -07:00
rmsnormPlugin Update code 2023-09-28 09:00:05 -07:00
rmsnormQuantizationPlugin Update 2023-10-10 23:22:17 -07:00
smoothQuantGemmPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
weightOnlyGroupwiseQuantMatmulPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
weightOnlyQuantMatmulPlugin Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
CMakeLists.txt Update TensorRT-LLM (#506) 2023-11-30 16:46:22 +08:00
exports.def Update 2023-10-10 23:22:17 -07:00
exports.map Update code 2023-09-28 09:00:05 -07:00