TensorRT-LLMs/cpp/tensorrt_llm/kernels/weightOnlyBatchedGemv
Cheng Hang 656c705ff1
[None][feat] sm100 weight-only kernel (#10190)
Signed-off-by: Cheng Hang <chang@nvidia.com>
2026-01-05 09:44:36 +08:00
..
common.h
converter.h
cudaCoreGemm.cu
cudaCoreGemm.h
cudaCoreGemmNVFP4.cu
cudaCoreGemmNVFP4.h
details.h
int8SQ.cu
int8SQ.h
kernel.h
kernelDispatcher.h
kernelDispatcherBf16Int4GroupwiseColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherBf16Int4GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int4GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int4PerChannelColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherBf16Int4PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int4PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int8GroupwiseColumnMajoInterleavedForHopperTrue.cu
kernelDispatcherBf16Int8GroupwiseColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherBf16Int8GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int8PerChannelColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherBf16Int8PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int8PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int4GroupwiseColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherFp16Int4GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int4GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int4PerChannelColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherFp16Int4PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int4PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int8GroupwiseColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherFp16Int8GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int8GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int8PerChannelColumnMajorFalse.cu [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
kernelDispatcherFp16Int8PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int8PerChannelColumnMajorInterleavedTrue.cu
kernelLauncher.h [None][feat] sm100 weight-only kernel (#10190) 2026-01-05 09:44:36 +08:00
utility.h