TensorRT-LLMs/cpp/tensorrt_llm/plugins/weightOnlyGroupwiseQuantMatmulPlugin
NVJiangShao 6cc3f2093a
Fix bias shape in weightOnlyGroupwiseQuantMatmulPlugin for TRT workflow (#4348)
Signed-off-by: Jiang Shao <91270701+StudyingShao@users.noreply.github.com>
Co-authored-by: AIDC-AI <AIDC-AIB@365fanyi.com>
2025-05-16 10:02:30 +08:00
..
CMakeLists.txt Initial commit 2023-09-20 00:29:41 -07:00
weightOnlyGroupwiseQuantMatmulPlugin.cpp Fix bias shape in weightOnlyGroupwiseQuantMatmulPlugin for TRT workflow (#4348) 2025-05-16 10:02:30 +08:00
weightOnlyGroupwiseQuantMatmulPlugin.h Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00