TensorRT-LLMs/cpp/tensorrt_llm/kernels/unfusedAttentionKernels
石晓伟 b8fc6633ba
Update TensorRT-LLM (#2156)
Co-authored-by: Bruno Magalhaes <bruno.magalhaes@synthesia.io>
2024-08-27 18:20:59 +08:00
..
unfusedAttentionKernels_2_bf16_bf16.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_bf16_fp8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_bf16_int8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_float_float.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_float_fp8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_float_int8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_half_fp8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_half_half.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_half_int8.cu Update TensorRT-LLM (#1455) 2024-04-16 19:40:08 +08:00
unfusedAttentionKernels_2_template.h Update TensorRT-LLM (#2156) 2024-08-27 18:20:59 +08:00