TensorRT-LLMs/cpp/tensorrt_llm/kernels/unfusedAttentionKernels
Kaiyu Xie 9bd15f1937
TensorRT-LLM v0.10 update
* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-06-05 20:43:25 +08:00
..
unfusedAttentionKernels_2_bf16_bf16.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_bf16_fp8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_bf16_int8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_float_float.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_float_fp8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_float_int8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_half_fp8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_half_half.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_half_int8.cu TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00
unfusedAttentionKernels_2_template.h TensorRT-LLM v0.10 update 2024-06-05 20:43:25 +08:00