TensorRT-LLMs/cpp/tensorrt_llm/kernels/decoderMaskedMultiheadAttention/decoderXQAImplJIT/nvrtcWrapper
Yao Yao 3545d59635
Support speculative decoding with Hopper XQA (#3269)
Signed-off-by: Yao Yao <lowsfer@users.noreply.github.com>
2025-04-07 17:14:34 +08:00
..
aarch64-linux-gnu Support speculative decoding with Hopper XQA (#3269) 2025-04-07 17:14:34 +08:00
include Support speculative decoding with Hopper XQA (#3269) 2025-04-07 17:14:34 +08:00
x86_64-linux-gnu Support speculative decoding with Hopper XQA (#3269) 2025-04-07 17:14:34 +08:00