TensorRT-LLMs/cpp/tensorrt_llm/kernels/decoderMaskedMultiheadAttention/cubin
2023-12-01 22:27:51 +08:00
..
xqa_kernel_cubin.h Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
xqa_kernel_dt_fp16_d_128_beam_1_kvt_fp16_nqpkv_8_sm_80.cubin.cpp Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
xqa_kernel_dt_fp16_d_128_beam_1_kvt_fp16_nqpkv_8_sm_86.cubin.cpp Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
xqa_kernel_dt_fp16_d_128_beam_1_kvt_fp16_nqpkv_8_sm_89.cubin.cpp Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00
xqa_kernel_dt_fp16_d_128_beam_1_kvt_fp16_nqpkv_8_sm_90.cubin.cpp Update TensorRT-LLM (#524) 2023-12-01 22:27:51 +08:00