TensorRT-LLMs/cpp/tensorrt_llm/plugins/fp8RowwiseGemmPlugin
Tracin 6c91f1c7ac
Mxfp8xmxfp4 quant mode(#4978)
Signed-off-by: Tracin <10434017+Tracin@users.noreply.github.com>
Co-authored-by: QI JUN <22017000+QiJune@users.noreply.github.com>
2025-06-10 22:01:37 +08:00
..
CMakeLists.txt Update TensorRT-LLM (#2008) 2024-07-23 23:05:09 +08:00
fp8RowwiseGemmPlugin.cpp Mxfp8xmxfp4 quant mode(#4978) 2025-06-10 22:01:37 +08:00
fp8RowwiseGemmPlugin.h Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00