TensorRT-LLMs/cpp/kernels
Jhao-Ting Chen 220dc01372
[None][feat] support JIT mha.cu for SPEC_DEC in runtime (#6078)
Signed-off-by: Jhao-Ting Chen <jhaotingc@nvidia.com>
2025-09-23 14:56:17 -07:00
..
fmha_v2 [TRTLLM-6577][feat] Support nano_v2_vlm in pytorch backend (#7207) 2025-09-18 16:26:20 +08:00
xqa [None][feat] support JIT mha.cu for SPEC_DEC in runtime (#6078) 2025-09-23 14:56:17 -07:00