Commit Graph

53 Commits

Author SHA1 Message Date
Kaiyu Xie
9bd15f1937
TensorRT-LLM v0.10 update
* TensorRT-LLM Release 0.10.0

---------

Co-authored-by: Loki <lokravi@amazon.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-06-05 20:43:25 +08:00
Kaiyu Xie
a9356d4b76
Fix document (#1462) 2024-04-17 13:00:20 +08:00
石晓伟
6533c4e779
Update documents for release 0.9 (#1461) 2024-04-17 11:51:50 +08:00
Kaiyu Xie
250d9c293d
Update TensorRT-LLM Release branch (#1445)
* Update TensorRT-LLM

---------

Co-authored-by: Bhuvanesh Sridharan <bhuvan.sridharan@gmail.com>
Co-authored-by: Morgan Funtowicz <funtowiczmo@gmail.com>
Co-authored-by: Eddie-Wang1120 <wangjinheng1120@163.com>
Co-authored-by: meghagarwal <16129366+megha95@users.noreply.github.com>
2024-04-12 17:59:19 +08:00
Tejaswin Parthasarathy
37aee91e1a
Add 0.8 batch manager static lib for Windows (#1202)
Signed-off-by: tejaswinp <tejaswinp@nvidia.com>
2024-03-01 10:44:05 +08:00
Kaiyu Xie
5955b8afba
Update TensorRT-LLM Release branch (#1192)
* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2024-02-29 17:20:55 +08:00
Sean Stephens
2f169d17d5
Add batch manager static lib for Windows (#814) 2024-01-05 09:49:56 +08:00
Kaiyu Xie
80bc07510a
Update TensorRT-LLM Release branch (#745)
* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2023-12-26 19:42:17 +08:00
石晓伟
a8018c14e6
Fix a docker build error (#719) 2023-12-22 08:00:11 +08:00
石晓伟
59f41c067d
Update TensorRT-LLM (#708)
* Update TensorRT-LLM

* update

* Bump version to 0.7.0
2023-12-20 16:38:28 +08:00
Sean Stephens
0268914005
Add batch manager static lib for Windows (#569)
Signed-off-by: Sean Stephens <149515905+sestephens-nv@users.noreply.github.com>
2023-12-06 13:43:30 +08:00
Kaiyu Xie
b40cfac22b
Update badge (#552) 2023-12-04 22:35:48 +08:00
石晓伟
9b3e12dbc8
Update TensorRT-LLM (#546) 2023-12-04 18:59:41 +08:00
Kaiyu Xie
8dd9c91470
Update TensorRT-LLM (#539)
* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2023-12-04 18:06:59 +08:00
石晓伟
119e21606d
update aarch64 libraries (#525) 2023-12-01 16:27:03 +08:00
Kaiyu Xie
587d063e6d
Update TensorRT-LLM (#506)
* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
2023-11-30 16:46:22 +08:00
石晓伟
a21e2f8517
Fix an issue of mpi4py (#475) 2023-11-27 15:30:14 +08:00
Kaiyu Xie
6837c8141a
Update the latest news (#379) 2023-11-14 17:15:05 +08:00
石晓伟
73a9ee4b1c
Add Latest News section (#368) 2023-11-13 21:11:10 +08:00
石晓伟
7ce7e1dc47
Add Latest News section (#366) 2023-11-13 20:56:37 +08:00
石晓伟
1f3a421528
Add Latest News section (#361) 2023-11-13 15:15:23 +08:00
石晓伟
71a5b97b9c
Add Latest News section (#314) 2023-11-08 15:04:30 +08:00
Sean Stephens
93174abd9a
build: Update Windows torch versions (#309) 2023-11-08 11:56:38 +08:00
石晓伟
d8ebeee2f6
patch for commit f84d5fe (#245) 2023-11-02 14:50:51 +08:00
Sean Stephens
f84d5fea2b
Add batch manager lib (#221) 2023-11-02 13:49:41 +08:00
石晓伟
11e14500f3
update the batch manager (#152) 2023-10-27 12:15:49 +08:00
nv-guomingz
d0b56df751
fix doc typo. (#114)
Co-authored-by: Guoming Zhang <37257613+nv-guomingz@users.noreply.github.com>
2023-10-25 19:31:50 +08:00
juney-nvidia
1c4a0ee24c
Update windows related documentation (#59) 2023-10-22 09:24:52 +08:00
Kaiyu Xie
26f797c732
Update docs/source/batch_manager.md (#40) 2023-10-20 12:08:45 +02:00
Yi Wang
7295ce25aa
Fix two deadlinks in README.md (#21)
* Fix deadlines in README.md

* Update README.md
2023-10-20 10:13:20 +08:00
juney-nvidia
9a5da6dbe9
Fix small doc issue (#19) 2023-10-20 08:52:02 +08:00
Julien Demouth
84fd0ddbae
Fix the link to the documentation (#15)
Co-authored-by: Julien Demouth <jdemouth@nvidia.com>
2023-10-19 23:46:57 +08:00
石晓伟
ffd5af342a revise the homepage (#14)
Co-authored-by: Shi Xiaowei <xiaoweis@nvidia.com>
2023-10-19 18:04:24 +08:00
石晓伟
395f85088e add git-lfs dependency for binaries (#11)
Co-authored-by: Shi Xiaowei <xiaoweis@nvidia.com>
2023-10-19 18:04:24 +08:00
June Yang
a42dc2bad8 Minor doc updates 2023-10-19 18:04:24 +08:00
石晓伟
4926a921eb update aarch64 batch manager libraries to release/0.5.0 (#10) 2023-10-19 18:04:17 +08:00
Kaiyu Xie
b4af28ca35
Fix memory leak in falcon weight loader (#8) 2023-10-18 23:58:33 +08:00
June Yang
6b32b407b8 refresh 0.5.0 release branch with the latest revision 2023-10-18 22:18:19 +08:00
June Yang
dcd773ea98 Updates for release/0.5.0 2023-10-15 21:26:20 +08:00
Kaiyu Xie
4941ad29d2
Merge pull request #4 from NVIDIA/kaiyu/update
Update TensorRT-LLM
2023-10-11 16:03:32 +08:00
Kevin Xie
cf3028dbd1 Update .a libs 2023-10-11 00:42:47 -07:00
Kevin Xie
39d574ae66 Update 2023-10-11 00:42:09 -07:00
Kevin Xie
3a87f272bb Update 2023-10-10 23:32:03 -07:00
Kevin Xie
6c3bfd26fc Update 2023-10-10 23:24:07 -07:00
Kevin Xie
027cd518e3 Update 2023-10-10 23:22:17 -07:00
Kaiyu Xie
279e329b22
Merge pull request #3 from NVIDIA/kaiyu/update
Update TRT-LLM code
2023-09-29 01:46:55 +08:00
Kevin Xie
6111f5210b Update submodule 2023-09-28 10:28:36 -07:00
Kevin Xie
496456efec Update submodule 2023-09-28 10:00:48 -07:00
Kevin Xie
766926cfd4 Add .a libs 2023-09-28 09:36:15 -07:00
Kevin Xie
6e9e318e91 Update code 2023-09-28 09:00:05 -07:00