TensorRT-LLMs/cpp/tensorrt_llm/plugins/weightOnlyGroupwiseQuantMatmulPlugin
Barry Kang 26793e3569
[https://nvbugs/5289907][fix] Restore per-channel pre-quant (#4545)
* Restore per-channel pre-quant

Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>

* Update TRT test script

Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>

* Fix pre-commit

Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>

---------

Signed-off-by: Barry Kang <43644113+Barry-Delaney@users.noreply.github.com>
2025-05-23 19:46:53 +08:00
..
CMakeLists.txt Initial commit 2023-09-20 00:29:41 -07:00
weightOnlyGroupwiseQuantMatmulPlugin.cpp [https://nvbugs/5289907][fix] Restore per-channel pre-quant (#4545) 2025-05-23 19:46:53 +08:00
weightOnlyGroupwiseQuantMatmulPlugin.h Update TensorRT-LLM (#2792) 2025-02-18 21:27:39 +08:00