TensorRT-LLMs/cpp/tensorrt_llm/kernels/trtllmGenKernels/fp8BlockScaleMoe/gemmCubins
2025-03-11 21:13:42 +08:00
..
MoE_ProjDown__BatchN_E4m3Fp32_Bfloat16_Tile128x128x128_EpiTile64x128_Mma64x128x32_Cluster1x1x1_transposeMmaOutput_DsFp8_sm100a_cubin.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
MoE_ProjDown__BatchN_E4m3Fp32_E4m3_Tile128x128x128_EpiTile64x128_Mma64x128x32_Cluster1x1x1_transposeMmaOutput_DsFp8_sm100a_cubin.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00
MoE_ProjUp__BatchN_E4m3Fp32_E4m3_Tile128x128x128_EpiTile64x128_Mma64x128x32_Cluster1x1x1_transposeMmaOutput_DsFp8_InplaceRoute_sm100a_cubin.h Update TensorRT-LLM (#2873) 2025-03-11 21:13:42 +08:00