TensorRT-LLMs/cpp/tensorrt_llm/kernels/decoderMaskedMultiheadAttention/decoderXQAImplJIT
Jhao-Ting Chen 220dc01372
[None][feat] support JIT mha.cu for SPEC_DEC in runtime (#6078)
Signed-off-by: Jhao-Ting Chen <jhaotingc@nvidia.com>
2025-09-23 14:56:17 -07:00
..
nvrtcWrapper
compileEngine.cpp
compileEngine.h
cubinObj.cpp
cubinObj.h
cubinObjRegistry.h
decoderXQAImplJIT.cpp [None][feat] support JIT mha.cu for SPEC_DEC in runtime (#6078) 2025-09-23 14:56:17 -07:00
decoderXQAImplJIT.h [None][feat] support JIT mha.cu for SPEC_DEC in runtime (#6078) 2025-09-23 14:56:17 -07:00
kernelUtils.cpp
kernelUtils.h
serializationUtils.h