TensorRT-LLMs/cpp/tensorrt_llm/plugins/gptAttentionCommon
DylanChen-NV 1ac0566a93
fix: fix for cp > kvHeadNum (#3002)
* fix for cp > kvHeadNum

Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>

* fix for None kv_head_num

Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>

---------

Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>
2025-03-26 12:39:02 +08:00
..
CMakeLists.txt Initial commit 2023-09-20 00:29:41 -07:00
gptAttentionCommon.cpp fix: fix for cp > kvHeadNum (#3002) 2025-03-26 12:39:02 +08:00
gptAttentionCommon.h fix: fix for cp > kvHeadNum (#3002) 2025-03-26 12:39:02 +08:00
gptAttentionCommonImpl.h Update TensorRT-LLM (#1274) 2024-03-12 18:15:52 +08:00