This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-01-26 13:43:38 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
b4167cce68
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
kernels
/
weightOnlyBatchedGemv
History
DylanChen-NV
74dca0aa7b
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support (
#5029
)
...
Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>
2025-07-09 23:16:42 +08:00
..
common.h
converter.h
cudaCoreGemm.cu
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support (
#5029
)
2025-07-09 23:16:42 +08:00
cudaCoreGemm.h
[NVBUG-5304516/5319741]Qwen2.5VL FP8 support (
#5029
)
2025-07-09 23:16:42 +08:00
details.h
int8SQ.cu
int8SQ.h
kernel.h
[feat] Optimizations on weight-only batched gemv kernel (
#5420
)
2025-06-30 10:20:16 +08:00
kernelDispatcher.h
kernelDispatcherBf16Int4GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int4GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int4PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int4PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int8GroupwiseColumnMajoInterleavedForHopperTrue.cu
kernelDispatcherBf16Int8GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherBf16Int8PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherBf16Int8PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int4GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int4GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int4PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int4PerChannelColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int8GroupwiseColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int8GroupwiseColumnMajorInterleavedTrue.cu
kernelDispatcherFp16Int8PerChannelColumnMajorInterleavedForHopperTrue.cu
kernelDispatcherFp16Int8PerChannelColumnMajorInterleavedTrue.cu
kernelLauncher.h
utility.h
[feat] Optimizations on weight-only batched gemv kernel (
#5420
)
2025-06-30 10:20:16 +08:00