TensorRT-LLMs/tensorrt_llm/models/enc_dec
xiweny da47d5f27e
fix: nvbugs/5075538: fix cross attention mask when decoder input len > 1 (#3585)
* fix: nvbugs/5075538: fix cross attention mask when decoder input len > 1

Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>

* remove waiver

Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>

---------

Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-04-16 08:31:33 +08:00
..
__init__.py Update TensorRT-LLM (#787) 2024-01-02 17:54:32 +08:00
model.py fix: nvbugs/5075538: fix cross attention mask when decoder input len > 1 (#3585) 2025-04-16 08:31:33 +08:00