TensorRT-LLMs/cpp/tensorrt_llm/kernels/unfusedAttentionKernels
Kaiyu Xie aaacc9bd68
Update TensorRT-LLM (#2562)
* Update TensorRT-LLM

---------

Co-authored-by: Starrick Liu <73152103+StarrickLiu@users.noreply.github.com>
2024-12-11 00:31:05 -08:00
..
unfusedAttentionKernels_2_bf16_bf16.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_bf16_fp8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_bf16_int8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_float_float.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_float_fp8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_float_int8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_half_fp8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_half_half.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_half_int8.cu Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
unfusedAttentionKernels_2_template.h Update TensorRT-LLM (#2562) 2024-12-11 00:31:05 -08:00