DylanChen-NV
|
1ac0566a93
|
fix: fix for cp > kvHeadNum (#3002)
* fix for cp > kvHeadNum
Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>
* fix for None kv_head_num
Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>
---------
Signed-off-by: Dylan Chen <191843203+DylanChen-NV@users.noreply.github.com>
|
2025-03-26 12:39:02 +08:00 |
|
Kaiyu Xie
|
9b931c0f63
|
Update TensorRT-LLM (#2873)
|
2025-03-11 21:13:42 +08:00 |
|
石晓伟
|
548b5b7310
|
Update TensorRT-LLM (#2532)
* blossom-ci.yml: run vulnerability scan on blossom
* open source efb18c1256f8c9c3d47b7d0c740b83e5d5ebe0ec
---------
Co-authored-by: niukuo <6831097+niukuo@users.noreply.github.com>
Co-authored-by: pei0033 <59505847+pei0033@users.noreply.github.com>
Co-authored-by: Kyungmin Lee <30465912+lkm2835@users.noreply.github.com>
Co-authored-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2024-12-04 21:16:56 +08:00 |
|
Kaiyu Xie
|
385626572d
|
Update TensorRT-LLM (#2502)
* Update TensorRT-LLM
---------
Co-authored-by: 岑灿 <yunyi.hyy@alibaba-inc.com>
|
2024-11-26 16:51:34 +08:00 |
|
Kaiyu Xie
|
535c9cc673
|
Update TensorRT-LLM (#2460)
|
2024-11-19 18:30:34 +08:00 |
|
Kaiyu Xie
|
b7868dd1bd
|
Update TensorRT-LLM (#2413)
|
2024-11-05 16:27:06 +08:00 |
|
Kaiyu Xie
|
31ac30e928
|
Update TensorRT-LLM (#2215)
* Update TensorRT-LLM
---------
Co-authored-by: Sherlock Xu <65327072+Sherlock113@users.noreply.github.com>
|
2024-09-10 18:21:22 +08:00 |
|
Kaiyu Xie
|
78f5c2936b
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
石晓伟
|
32ed92e449
|
Update TensorRT-LLM
Co-authored-by: Rong Zhou <130957722+ReginaZh@users.noreply.github.com>
Co-authored-by: Onur Galoglu <33498883+ogaloglu@users.noreply.github.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
|
2024-08-20 18:55:15 +08:00 |
|
Kaiyu Xie
|
bca9a33b02
|
Update TensorRT-LLM (#2008)
* Update TensorRT-LLM
---------
Co-authored-by: Timur Abishev <abishev.timur@gmail.com>
Co-authored-by: MahmoudAshraf97 <hassouna97.ma@gmail.com>
Co-authored-by: Saeyoon Oh <saeyoon.oh@furiosa.ai>
Co-authored-by: hattizai <hattizai@gmail.com>
|
2024-07-23 23:05:09 +08:00 |
|
Kaiyu Xie
|
b777bd6475
|
Update TensorRT-LLM (#1725)
* Update TensorRT-LLM
---------
Co-authored-by: RunningLeon <mnsheng@yeah.net>
Co-authored-by: Tlntin <TlntinDeng01@Gmail.com>
Co-authored-by: ZHENG, Zhen <zhengzhen.z@qq.com>
Co-authored-by: Pham Van Ngoan <ngoanpham1196@gmail.com>
Co-authored-by: Nathan Price <nathan@abridge.com>
Co-authored-by: Tushar Goel <tushar.goel.ml@gmail.com>
Co-authored-by: Mati <132419219+matichon-vultureprime@users.noreply.github.com>
|
2024-06-04 20:26:32 +08:00 |
|
Kaiyu Xie
|
f430a4b447
|
Update TensorRT-LLM (#1688)
* Update TensorRT-LLM
---------
Co-authored-by: IbrahimAmin <ibrahimamin532@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Pzzzzz <hello-cd.plus@hotmail.com>
Co-authored-by: CoderHam <hemant@cohere.com>
Co-authored-by: Konstantin Lopuhin <kostia.lopuhin@gmail.com>
|
2024-05-28 20:07:49 +08:00 |
|
Kaiyu Xie
|
89ba1b1a67
|
Update TensorRT-LLM (#1554)
|
2024-05-07 23:34:28 +08:00 |
|
Kaiyu Xie
|
71d8d4d3dc
|
Update TensorRT-LLM (#1455)
|
2024-04-16 19:40:08 +08:00 |
|
Kaiyu Xie
|
4bb65f216f
|
Update TensorRT-LLM (#1274)
* Update TensorRT-LLM
---------
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-12 18:15:52 +08:00 |
|
Kaiyu Xie
|
b57221b764
|
Update TensorRT-LLM (#941)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-23 23:22:35 +08:00 |
|
Kaiyu Xie
|
c89653021e
|
Update TensorRT-LLM (20240116) (#891)
* Update TensorRT-LLM
---------
Co-authored-by: Eddie-Wang1120 <81598289+Eddie-Wang1120@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-16 20:03:11 +08:00 |
|
Kaiyu Xie
|
deaae40bd7
|
Update TensorRT-LLM (#787)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-02 17:54:32 +08:00 |
|
Kaiyu Xie
|
d37b507f41
|
Update TensorRT-LLM main branch (#754)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-27 17:41:24 +08:00 |
|
Kaiyu Xie
|
f7eca56161
|
Update TensorRT-LLM (#613)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
Co-authored-by: zhang-ge-hao <842720660@qq.com>
|
2023-12-08 17:49:24 +08:00 |
|
Kaiyu Xie
|
6755a3f077
|
Update TensorRT-LLM (#422)
* Update TensorRT-LLM
---------
Co-authored-by: Tltin <TltinDeng01@gmail.com>
Co-authored-by: zhaohb <zhaohbcloud@126.com>
Co-authored-by: Bradley Heilbrun <brad@repl.it>
Co-authored-by: nqbao11 <nqbao11.01@gmail.com>
Co-authored-by: Nikhil Varghese <nikhil@bot-it.ai>
|
2023-11-18 00:05:54 +08:00 |
|
Kaiyu Xie
|
b2fd493c16
|
Update TensorRT-LLM (#349)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-11-10 22:30:31 +08:00 |
|
Kaiyu Xie
|
75b6210ff4
|
Kaiyu/update main (#5)
* Update
* Update
|
2023-10-18 22:38:53 +08:00 |
|
Kevin Xie
|
027cd518e3
|
Update
|
2023-10-10 23:22:17 -07:00 |
|
Kevin Xie
|
6e9e318e91
|
Update code
|
2023-09-28 09:00:05 -07:00 |
|
Kaiyu Xie
|
23bc5b7c49
|
Initial commit
|
2023-09-20 00:29:41 -07:00 |
|