TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-25 21:22:57 +08:00

History

Kaiyu Xie 80bc07510a Update TensorRT-LLM Release branch (#745 ) * Update TensorRT-LLM --------- Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>		2023-12-26 19:42:17 +08:00
..
cubin	Update TensorRT-LLM (#708 )	2023-12-20 16:38:28 +08:00
pagedKVCubin	Update TensorRT-LLM (#708 )	2023-12-20 16:38:28 +08:00
fmhaRunner.cpp	Update TensorRT-LLM Release branch (#745 )	2023-12-26 19:42:17 +08:00
fmhaRunner.h	Update TensorRT-LLM (#708 )	2023-12-20 16:38:28 +08:00
fused_multihead_attention_common.h	Update TensorRT-LLM (#708 )	2023-12-20 16:38:28 +08:00
fused_multihead_attention_v2.h	Update TensorRT-LLM (#708 )	2023-12-20 16:38:28 +08:00
tmaDescriptor.h	Initial commit	2023-09-20 00:29:41 -07:00