This website requires JavaScript.
Explore
Help
Sign In
kanshan
/
TensorRT-LLMs
Watch
1
Star
0
Fork
0
You've already forked TensorRT-LLMs
mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced
2026-02-03 01:31:30 +08:00
Code
Issues
Actions
1
Packages
Projects
Releases
Wiki
Activity
9db769ee62
TensorRT-LLMs
/
cpp
/
tensorrt_llm
/
kernels
/
unfusedAttentionKernels
History
Daniel Stokes
3a4851b7c3
feat: Add Mixture of Experts FP8xMXFP4 support (
#4750
)
...
Signed-off-by: Daniel Stokes <40156487+djns99@users.noreply.github.com>
2025-06-09 13:25:04 +08:00
..
unfusedAttentionKernels_2_bf16_bf16.cu
unfusedAttentionKernels_2_bf16_fp4.cu
unfusedAttentionKernels_2_bf16_fp8.cu
unfusedAttentionKernels_2_bf16_int8.cu
unfusedAttentionKernels_2_float_float.cu
unfusedAttentionKernels_2_float_fp8.cu
unfusedAttentionKernels_2_float_int8.cu
unfusedAttentionKernels_2_half_fp4.cu
unfusedAttentionKernels_2_half_fp8.cu
unfusedAttentionKernels_2_half_half.cu
unfusedAttentionKernels_2_half_int8.cu
unfusedAttentionKernels_2_template.h
feat: Add Mixture of Experts FP8xMXFP4 support (
#4750
)
2025-06-09 13:25:04 +08:00