TensorRT-LLMs/cpp/tensorrt_llm/kernels/trtllmGenKernels/fmha
Perkz Zheng 1f292ff2a0
[https://jirasw.nvidia.com/browse/TRTLLM-4645] support mutliCtasKvMode for high-throughput MLA kernels (#5426)
Signed-off-by: Perkz Zheng <67892460+PerkzZheng@users.noreply.github.com>
2025-06-25 16:31:10 +08:00
..
cubin
CMakeLists.txt
fmhaKernels.h
fmhaRunner.cpp
fmhaRunner.h
fmhaRunnerParams.h
kernelParams.h