Tin-Yin Lai
|
6c52bb07ff
|
[https://nvbugs/5302040][feat] Add whisper support (Bert Attention on SM100 and GPTAttention for cross attention on SM100) (#5527)
Signed-off-by: tinyinl <tinyinl@nvidia.com>
|
2025-08-13 11:19:13 -07:00 |
|
forrestl
|
9477661f4c
|
Support RingAttention in the BertAttention plugin and the DiT model (#3661)
support ring attn for bert_attention plugin and dit model
Signed-off-by: ChunhuanLin <lch_xdu@163.com>
|
2025-05-09 08:06:54 +08:00 |
|
Kaiyu Xie
|
77d7fe1eb2
|
Update TensorRT-LLM (#2849)
* Update TensorRT-LLM
---------
Co-authored-by: aotman <chenhangatm@gmail.com>
|
2025-03-04 18:44:00 +08:00 |
|
Kaiyu Xie
|
ab5b19e027
|
Update TensorRT-LLM (#2820)
|
2025-02-25 21:21:49 +08:00 |
|
Kaiyu Xie
|
9dbc5b38ba
|
Update TensorRT-LLM (#1891)
* Update TensorRT-LLM
---------
Co-authored-by: Marks101 <markus.schnoes@gmx.de>
Co-authored-by: lkm2835 <lkm2835@gmail.com>
|
2024-07-04 14:37:19 +08:00 |
|
Kaiyu Xie
|
4bb65f216f
|
Update TensorRT-LLM (#1274)
* Update TensorRT-LLM
---------
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-12 18:15:52 +08:00 |
|
Kaiyu Xie
|
a75618df24
|
Update TensorRT-LLM (#667)
* Update TensorRT-LLM
---------
Co-authored-by: 0xymoro <jerrymeng100@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-15 22:14:51 +08:00 |
|
Kaiyu Xie
|
6755a3f077
|
Update TensorRT-LLM (#422)
* Update TensorRT-LLM
---------
Co-authored-by: Tltin <TltinDeng01@gmail.com>
Co-authored-by: zhaohb <zhaohbcloud@126.com>
Co-authored-by: Bradley Heilbrun <brad@repl.it>
Co-authored-by: nqbao11 <nqbao11.01@gmail.com>
Co-authored-by: Nikhil Varghese <nikhil@bot-it.ai>
|
2023-11-18 00:05:54 +08:00 |
|
Kaiyu Xie
|
75b6210ff4
|
Kaiyu/update main (#5)
* Update
* Update
|
2023-10-18 22:38:53 +08:00 |
|
Kevin Xie
|
027cd518e3
|
Update
|
2023-10-10 23:22:17 -07:00 |
|
Kevin Xie
|
6e9e318e91
|
Update code
|
2023-09-28 09:00:05 -07:00 |
|
Kaiyu Xie
|
23bc5b7c49
|
Initial commit
|
2023-09-20 00:29:41 -07:00 |
|