TensorRT-LLMs/cpp/tensorrt_llm/kernels/contextFusedMultiHeadAttention
石晓伟 548b5b7310
Update TensorRT-LLM (#2532)
* blossom-ci.yml: run vulnerability scan on blossom

* open source efb18c1256f8c9c3d47b7d0c740b83e5d5ebe0ec

---------

Co-authored-by: niukuo <6831097+niukuo@users.noreply.github.com>
Co-authored-by: pei0033 <59505847+pei0033@users.noreply.github.com>
Co-authored-by: Kyungmin Lee <30465912+lkm2835@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2024-12-04 21:16:56 +08:00
..
cubin Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
CMakeLists.txt Update TensorRT-LLM (#2094) 2024-08-07 16:44:43 +08:00
fmhaPackedMask.cu Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
fmhaPackedMask.h Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
fmhaRunner.cpp Update TensorRT-LLM (#2532) 2024-12-04 21:16:56 +08:00
fmhaRunner.h Update TensorRT-LLM (#2363) 2024-10-22 20:27:35 +08:00
fused_multihead_attention_common.h Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
fused_multihead_attention_v2.h Update TensorRT-LLM (#2502) 2024-11-26 16:51:34 +08:00
tmaDescriptor.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00