Kaiyu Xie
|
f430a4b447
|
Update TensorRT-LLM (#1688)
* Update TensorRT-LLM
---------
Co-authored-by: IbrahimAmin <ibrahimamin532@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Pzzzzz <hello-cd.plus@hotmail.com>
Co-authored-by: CoderHam <hemant@cohere.com>
Co-authored-by: Konstantin Lopuhin <kostia.lopuhin@gmail.com>
|
2024-05-28 20:07:49 +08:00 |
|
Kaiyu Xie
|
5d8ca2faf7
|
Update TensorRT-LLM (#1639)
* Update TensorRT-LLM
---------
Co-authored-by: vonjackustc <fga@mail.ustc.edu.cn>
|
2024-05-21 17:51:02 +08:00 |
|
Minwoo Lee
|
b189b61312
|
Fix mistral v0.1 build instructions (#1373)
|
2024-05-20 18:16:02 +08:00 |
|
Samriddha Sinha
|
309ab33db0
|
Update dead links in perf-best-practices.md (#1545)
|
2024-05-20 18:05:14 +08:00 |
|
Seongjong Bae
|
2759e628d9
|
Update customAllReduceKernels.cu (#1558)
|
2024-05-20 17:56:57 +08:00 |
|
Kaiyu Xie
|
bf0a5afc92
|
Update TensorRT-LLM (#1598)
* Update TensorRT-LLM
|
2024-05-14 16:43:41 +08:00 |
|
Kaiyu Xie
|
89ba1b1a67
|
Update TensorRT-LLM (#1554)
|
2024-05-07 23:34:28 +08:00 |
|
Kaiyu Xie
|
06c0e9b1ec
|
Update TensorRT-LLM (#1530)
|
2024-04-30 17:19:10 +08:00 |
|
Kaiyu Xie
|
66ef1df492
|
Update TensorRT-LLM (#1492)
* Update TensorRT-LLM
---------
Co-authored-by: Loki <lokravi@amazon.com>
|
2024-04-24 14:44:22 +08:00 |
|
Kaiyu Xie
|
71d8d4d3dc
|
Update TensorRT-LLM (#1455)
|
2024-04-16 19:40:08 +08:00 |
|
Kaiyu Xie
|
035b99e0d0
|
Update TensorRT-LLM (#1427)
* Update TensorRT-LLM
---------
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
|
2024-04-09 17:03:34 +08:00 |
|
Kaiyu Xie
|
118b3d7e7b
|
Update TensorRT-LLM (#1387)
|
2024-04-01 16:39:43 +08:00 |
|
石晓伟
|
850b6fa1e7
|
Update TensorRT-LLM (#1358)
Co-authored-by: Kaiyu <26294424+kaiyux@users.noreply.github.com>
|
2024-03-26 20:47:14 +08:00 |
|
Kaiyu Xie
|
66ca3378c6
|
Update TensorRT-LLM (#1315)
|
2024-03-19 17:36:42 +08:00 |
|
Kaiyu Xie
|
4bb65f216f
|
Update TensorRT-LLM (#1274)
* Update TensorRT-LLM
---------
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-12 18:15:52 +08:00 |
|
Kaiyu Xie
|
728cc0044b
|
Update TensorRT-LLM (#1233)
* Update TensorRT-LLM
---------
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-03-05 18:32:53 +08:00 |
|
Ashwinkumar J S
|
b7c309d1c9
|
Update requirements.txt (#1146)
|
2024-02-27 22:09:31 +08:00 |
|
Kaiyu Xie
|
655524dd82
|
Update TensorRT-LLM (#1168)
* Update TensorRT-LLM
---------
Co-authored-by: Bhuvanesh Sridharan <bhuvan.sridharan@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-02-27 17:37:34 +08:00 |
|
HeAndres
|
e4e09dafea
|
Substitute deprecated nvidia-docker (#1096)
|
2024-02-26 11:08:49 +08:00 |
|
Tejaswin Parthasarathy
|
3c373ebc5b
|
fix : remove mentions of context plugin (#1128)
Update examples/gemma/README.md
|
2024-02-22 09:26:03 +08:00 |
|
byshiue_NV
|
8f4b4df27e
|
Update README.md (#1126)
update ammo branch from 0.7.0 to 0.7.3
|
2024-02-21 21:59:16 +08:00 |
|
Kaiyu Xie
|
eb8f26c7e4
|
Update TensorRT-LLM (#1122)
* Update TensorRT-LLM
---------
Co-authored-by: Eddie-Wang1120 <wangjinheng1120@163.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-02-21 21:30:55 +08:00 |
|
Kaiyu Xie
|
0f041b7b57
|
Update TensorRT-LLM (#1098)
* Update TensorRT-LLM
* update submodule
* Remove unused binaries
|
2024-02-18 15:48:08 +08:00 |
|
Kaiyu Xie
|
0ab9d17a59
|
Update TensorRT-LLM (#1055)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-02-06 18:38:07 +08:00 |
|
Karl Lok (Zhaokai Luo)
|
3d56a445e8
|
Fix typo in cutlass preprocessors (#859)
Co-authored-by: karll <karlluo@tencent.com>
|
2024-02-01 14:27:58 +08:00 |
|
BasicCoder
|
b310ec6751
|
Fix typo in perf_best_practices.md (#857)
Fix typo.
|
2024-02-01 09:58:39 +08:00 |
|
Kaiyu Xie
|
e06f537e08
|
Update TensorRT-LLM (#1019)
* Update TensorRT-LLM
---------
Co-authored-by: erenup <ping.nie@pku.edu.cn>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-31 21:55:32 +08:00 |
|
石晓伟
|
da79354b8e
|
Update TensorRT-LLM (#1017)
|
2024-01-31 17:48:46 +08:00 |
|
juney-nvidia
|
744bab6855
|
fix a typo (#1014)
|
2024-01-31 15:03:41 +08:00 |
|
juney-nvidia
|
a40dbae30d
|
Doc update 20240130 (#1009)
* doc updates
|
2024-01-31 03:40:22 +08:00 |
|
Kaiyu Xie
|
b57221b764
|
Update TensorRT-LLM (#941)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-23 23:22:35 +08:00 |
|
Kaiyu Xie
|
c89653021e
|
Update TensorRT-LLM (20240116) (#891)
* Update TensorRT-LLM
---------
Co-authored-by: Eddie-Wang1120 <81598289+Eddie-Wang1120@users.noreply.github.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-16 20:03:11 +08:00 |
|
Yan Chunwei
|
12e82e30b0
|
init (#848)
|
2024-01-09 22:48:48 +08:00 |
|
Kaiyu Xie
|
d879430b04
|
Update TensorRT-LLM (#846)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-09 21:03:35 +08:00 |
|
juney-nvidia
|
6cc5e177ff
|
Update issue templates
|
2024-01-03 16:22:51 +08:00 |
|
juney-nvidia
|
a413d132b8
|
Update issue templates
|
2024-01-03 16:22:03 +08:00 |
|
Kaiyu Xie
|
deaae40bd7
|
Update TensorRT-LLM (#787)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2024-01-02 17:54:32 +08:00 |
|
Kaiyu Xie
|
d37b507f41
|
Update TensorRT-LLM main branch (#754)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-27 17:41:24 +08:00 |
|
Kaiyu Xie
|
a75618df24
|
Update TensorRT-LLM (#667)
* Update TensorRT-LLM
---------
Co-authored-by: 0xymoro <jerrymeng100@gmail.com>
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-12-15 22:14:51 +08:00 |
|
Kaiyu Xie
|
f7eca56161
|
Update TensorRT-LLM (#613)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
Co-authored-by: zhang-ge-hao <842720660@qq.com>
|
2023-12-08 17:49:24 +08:00 |
|
Kaiyu Xie
|
42af740db5
|
Update badge (#551)
|
2023-12-04 15:27:40 +01:00 |
|
juney-nvidia
|
6cff6e6058
|
Quick doc update (#550)
|
2023-12-04 22:15:23 +08:00 |
|
石晓伟
|
e093e48459
|
Update latest news (#549)
|
2023-12-04 22:04:00 +08:00 |
|
Kaiyu Xie
|
71f60f6df0
|
Update TensorRT-LLM (#524)
|
2023-12-01 22:27:51 +08:00 |
|
Kaiyu Xie
|
711a28d9bf
|
Update TensorRT-LLM (#465)
* Update TensorRT-LLM
---------
Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2023-11-24 22:12:26 +08:00 |
|
Kaiyu Xie
|
6755a3f077
|
Update TensorRT-LLM (#422)
* Update TensorRT-LLM
---------
Co-authored-by: Tltin <TltinDeng01@gmail.com>
Co-authored-by: zhaohb <zhaohbcloud@126.com>
Co-authored-by: Bradley Heilbrun <brad@repl.it>
Co-authored-by: nqbao11 <nqbao11.01@gmail.com>
Co-authored-by: Nikhil Varghese <nikhil@bot-it.ai>
|
2023-11-18 00:05:54 +08:00 |
|
Kaiyu Xie
|
ab7b4614b8
|
Update latest news (#378)
* Update latest news
* Minor modification
|
2023-11-14 17:15:36 +08:00 |
|
石晓伟
|
c1718bbddf
|
Add Latest News section (#367)
|
2023-11-13 21:10:01 +08:00 |
|
石晓伟
|
ec769d63f9
|
Add Latest News section (#365)
|
2023-11-13 20:56:22 +08:00 |
|
石晓伟
|
24cf8de078
|
Add Latest News section (#362)
Co-authored-by: Shi Xiaowei <xiaoweis@nvidia.com>
|
2023-11-13 15:17:23 +08:00 |
|