Aurelien Chartier
|
1389f5a4d3
|
feat: Add support for fp8 rowwise quantization (#4876)
Signed-off-by: Aurelien Chartier <2567591+achartier@users.noreply.github.com>
Co-authored-by: aikitoria <151776613+aikitoria@users.noreply.github.com>
|
2025-06-14 06:37:48 -07:00 |
|
nv-guomingz
|
e76cf9d9fe
|
fix:https://nvbugs/5234033 enable starcoder trt-flow with transforme… (#3909)
fix:https://nvbugs/5234033 enable startcoder trt-flow with transformer 4.51.3.
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2025-05-15 11:16:45 +08:00 |
|
brb-nv
|
727d78e785
|
Support prequantized fp8 ckpt for nemotron-mini-4b-instruct (#3046)
Signed-off-by: Balaram Buddharaju <169953907+brb-nv@users.noreply.github.com>
|
2025-04-01 14:52:09 +08:00 |
|
Kaiyu Xie
|
ab5b19e027
|
Update TensorRT-LLM (#2820)
|
2025-02-25 21:21:49 +08:00 |
|
Kaiyu Xie
|
aaacc9bd68
|
Update TensorRT-LLM (#2562)
* Update TensorRT-LLM
---------
Co-authored-by: Starrick Liu <73152103+StarrickLiu@users.noreply.github.com>
|
2024-12-11 00:31:05 -08:00 |
|
Kaiyu Xie
|
c629546ce4
|
Update TensorRT-LLM (#2436)
|
2024-11-12 15:27:49 +08:00 |
|
Kaiyu Xie
|
f14d1d433c
|
Update TensorRT-LLM (#2389)
* Update TensorRT-LLM
---------
Co-authored-by: Alessio Netti <netti.alessio@gmail.com>
|
2024-10-29 22:24:38 +08:00 |
|
Dan Blanaru
|
48686bca3a
|
open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273)
* Update TensorRT-LLM
---------
Co-authored-by: Qingquan Song <ustcsqq@gmail.com>
|
2024-09-30 13:51:19 +02:00 |
|
Kaiyu Xie
|
31ac30e928
|
Update TensorRT-LLM (#2215)
* Update TensorRT-LLM
---------
Co-authored-by: Sherlock Xu <65327072+Sherlock113@users.noreply.github.com>
|
2024-09-10 18:21:22 +08:00 |
|
Kaiyu Xie
|
78f5c2936b
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
石晓伟
|
b8fc6633ba
|
Update TensorRT-LLM (#2156)
Co-authored-by: Bruno Magalhaes <bruno.magalhaes@synthesia.io>
|
2024-08-27 18:20:59 +08:00 |
|
Kaiyu Xie
|
74b324f667
|
Update TensorRT-LLM (#2110)
|
2024-08-13 22:34:33 +08:00 |
|
Kaiyu Xie
|
be9cd719f7
|
Update TensorRT-LLM (#2094)
* Update TensorRT-LLM
---------
Co-authored-by: akhoroshev <arthoroshev@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Tayef Shah <tayefshah@gmail.com>
Co-authored-by: lfz941 <linfanzai941@gmail.com>
|
2024-08-07 16:44:43 +08:00 |
|
Kaiyu Xie
|
93293aa46d
|
open source 315e9f5ccd286e906d4c0d402fefbf2f69a1febe (#2033)
|
2024-07-26 16:19:24 +08:00 |
|
Kaiyu Xie
|
2d234357c6
|
Update TensorRT-LLM (#1954)
* Update TensorRT-LLM
---------
Co-authored-by: Altair-Alpha <62340011+Altair-Alpha@users.noreply.github.com>
|
2024-07-16 15:30:25 +08:00 |
|
Kaiyu Xie
|
a96cccafcf
|
Update TensorRT-LLM (#1918)
|
2024-07-09 14:42:22 +08:00 |
|
Kaiyu Xie
|
9dbc5b38ba
|
Update TensorRT-LLM (#1891)
* Update TensorRT-LLM
---------
Co-authored-by: Marks101 <markus.schnoes@gmx.de>
Co-authored-by: lkm2835 <lkm2835@gmail.com>
|
2024-07-04 14:37:19 +08:00 |
|
石晓伟
|
2a115dae84
|
Update TensorRT-LLM (#1793)
Co-authored-by: DreamGenX <x@dreamgen.com>
Co-authored-by: Ace-RR <78812427+Ace-RR@users.noreply.github.com>
Co-authored-by: bprus <39293131+bprus@users.noreply.github.com>
Co-authored-by: janpetrov <janpetrov@icloud.com>
|
2024-06-18 18:18:23 +08:00 |
|
Kaiyu Xie
|
b777bd6475
|
Update TensorRT-LLM (#1725)
* Update TensorRT-LLM
---------
Co-authored-by: RunningLeon <mnsheng@yeah.net>
Co-authored-by: Tlntin <TlntinDeng01@Gmail.com>
Co-authored-by: ZHENG, Zhen <zhengzhen.z@qq.com>
Co-authored-by: Pham Van Ngoan <ngoanpham1196@gmail.com>
Co-authored-by: Nathan Price <nathan@abridge.com>
Co-authored-by: Tushar Goel <tushar.goel.ml@gmail.com>
Co-authored-by: Mati <132419219+matichon-vultureprime@users.noreply.github.com>
|
2024-06-04 20:26:32 +08:00 |
|
Kaiyu Xie
|
f430a4b447
|
Update TensorRT-LLM (#1688)
* Update TensorRT-LLM
---------
Co-authored-by: IbrahimAmin <ibrahimamin532@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Pzzzzz <hello-cd.plus@hotmail.com>
Co-authored-by: CoderHam <hemant@cohere.com>
Co-authored-by: Konstantin Lopuhin <kostia.lopuhin@gmail.com>
|
2024-05-28 20:07:49 +08:00 |
|
Kaiyu Xie
|
bf0a5afc92
|
Update TensorRT-LLM (#1598)
* Update TensorRT-LLM
|
2024-05-14 16:43:41 +08:00 |
|
Kaiyu Xie
|
89ba1b1a67
|
Update TensorRT-LLM (#1554)
|
2024-05-07 23:34:28 +08:00 |
|
Kaiyu Xie
|
71d8d4d3dc
|
Update TensorRT-LLM (#1455)
|
2024-04-16 19:40:08 +08:00 |
|
石晓伟
|
850b6fa1e7
|
Update TensorRT-LLM (#1358)
Co-authored-by: Kaiyu <26294424+kaiyux@users.noreply.github.com>
|
2024-03-26 20:47:14 +08:00 |
|
Kaiyu Xie
|
66ca3378c6
|
Update TensorRT-LLM (#1315)
|
2024-03-19 17:36:42 +08:00 |
|
Kaiyu Xie
|
4bb65f216f
|
Update TensorRT-LLM (#1274)
* Update TensorRT-LLM
---------
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-12 18:15:52 +08:00 |
|
Kaiyu Xie
|
728cc0044b
|
Update TensorRT-LLM (#1233)
* Update TensorRT-LLM
---------
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-05 18:32:53 +08:00 |
|
Kaiyu Xie
|
655524dd82
|
Update TensorRT-LLM (#1168)
* Update TensorRT-LLM
---------
Co-authored-by: Bhuvanesh Sridharan <bhuvan.sridharan@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-02-27 17:37:34 +08:00 |
|
Kaiyu Xie
|
b57221b764
|
Update TensorRT-LLM (#941)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-23 23:22:35 +08:00 |
|
Kaiyu Xie
|
c89653021e
|
Update TensorRT-LLM (20240116) (#891)
* Update TensorRT-LLM
---------
Co-authored-by: Eddie-Wang1120 <81598289+Eddie-Wang1120@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-16 20:03:11 +08:00 |
|
Kaiyu Xie
|
d879430b04
|
Update TensorRT-LLM (#846)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-09 21:03:35 +08:00 |
|
Kaiyu Xie
|
deaae40bd7
|
Update TensorRT-LLM (#787)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-02 17:54:32 +08:00 |
|
Kaiyu Xie
|
d37b507f41
|
Update TensorRT-LLM main branch (#754)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-27 17:41:24 +08:00 |
|
Kaiyu Xie
|
a75618df24
|
Update TensorRT-LLM (#667)
* Update TensorRT-LLM
---------
Co-authored-by: 0xymoro <jerrymeng100@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-15 22:14:51 +08:00 |
|
Kaiyu Xie
|
f7eca56161
|
Update TensorRT-LLM (#613)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
Co-authored-by: zhang-ge-hao <842720660@qq.com>
|
2023-12-08 17:49:24 +08:00 |
|
Kaiyu Xie
|
71f60f6df0
|
Update TensorRT-LLM (#524)
|
2023-12-01 22:27:51 +08:00 |
|
Kaiyu Xie
|
711a28d9bf
|
Update TensorRT-LLM (#465)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-11-24 22:12:26 +08:00 |
|
Kaiyu Xie
|
6755a3f077
|
Update TensorRT-LLM (#422)
* Update TensorRT-LLM
---------
Co-authored-by: Tltin <TltinDeng01@gmail.com>
Co-authored-by: zhaohb <zhaohbcloud@126.com>
Co-authored-by: Bradley Heilbrun <brad@repl.it>
Co-authored-by: nqbao11 <nqbao11.01@gmail.com>
Co-authored-by: Nikhil Varghese <nikhil@bot-it.ai>
|
2023-11-18 00:05:54 +08:00 |
|
Kaiyu Xie
|
b2fd493c16
|
Update TensorRT-LLM (#349)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-11-10 22:30:31 +08:00 |
|
Kaiyu Xie
|
f044eb8d94
|
Update TensorRT-LLM (#302)
* Update TensorRT-LLM
---------
Co-authored-by: wangruohui <12756472+wangruohui@users.noreply.github.com>
|
2023-11-07 19:51:58 +08:00 |
|
Kaiyu Xie
|
d8b408e6dc
|
Update TensorRT-LLM (#148)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-10-27 12:10:00 +08:00 |
|
Kaiyu Xie
|
75b6210ff4
|
Kaiyu/update main (#5)
* Update
* Update
|
2023-10-18 22:38:53 +08:00 |
|
Kevin Xie
|
027cd518e3
|
Update
|
2023-10-10 23:22:17 -07:00 |
|
Kevin Xie
|
6e9e318e91
|
Update code
|
2023-09-28 09:00:05 -07:00 |
|
Kaiyu Xie
|
23bc5b7c49
|
Initial commit
|
2023-09-20 00:29:41 -07:00 |
|