heyuhhh
|
7395ca93b6
|
[None][doc] Add Sparse Attention feature doc (#9648)
Signed-off-by: yuhangh <58161490+heyuhhh@users.noreply.github.com>
Signed-off-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
Co-authored-by: Fanrong Li <23290157+lfr-0531@users.noreply.github.com>
|
2025-12-25 00:26:18 -05:00 |
|
Yi Zhang
|
496b419791
|
[None][doc] Add doc for torch.compile & piecewise cuda graph (#8527)
Signed-off-by: yizhang-nv <187001205+yizhang-nv@users.noreply.github.com>
|
2025-10-29 21:15:46 -07:00 |
|
Shi Xiaowei
|
a0024f4d34
|
[None][doc] Facilitates the integration of the transfer agent (#7867)
Signed-off-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>
|
2025-10-21 20:06:24 +08:00 |
|
Jonas Yang CN
|
88ea2c4ee9
|
[TRTLLM-7349][feat] Adding new orchestrator type -- ray (#7520)
Signed-off-by: Erin Ho <14718778+hchings@users.noreply.github.com>
Co-authored-by: Yuan Tong <13075180+tongyuantongyu@users.noreply.github.com>
Co-authored-by: Erin Ho <14718778+hchings@users.noreply.github.com>
|
2025-10-04 08:12:24 +08:00 |
|
Guoming Zhang
|
f53fb4c803
|
[TRTLLM-5930][doc] 1.0 Documentation. (#6696)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-09 12:16:03 +08:00 |
|
Fridah-nv
|
cc0f4c87d4
|
[None][doc] Move AutoDeploy README.md to torch docs (#6528)
Signed-off-by: Frida Hou <201670829+Fridah-nv@users.noreply.github.com>
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
Co-authored-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2025-08-08 19:11:45 -04:00 |
|
Adamz-nvidia
|
b1878eabeb
|
Add Wechat_Group_QR_Code.png to docs/source/media and main page of TR… (#5142)
Signed-off-by: AdamZ
|
2025-06-20 03:28:00 +08:00 |
|
Nick Comly
|
4735b87f1f
|
L4 added to readme (#3301)
* Add L4 chart
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
* Add L4 to readme
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
* Add files via upload
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
* Update README.md
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
* Add files via upload
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
* Add files via upload
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
---------
Signed-off-by: Nick Comly <85702008+ncomly-nvidia@users.noreply.github.com>
|
2025-04-06 19:09:28 +08:00 |
|
Laikh Tewari
|
d2b7b64b25
|
Add R1 perf data to latest news page (#2823)
* Update README.md
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
* add r1 perf chart to repo
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
* Delete docs/source/blogs/media/r1-perf.jpeg
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
* add file to correct media dir
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
* Update README.md with local img + remove old img
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
---------
Signed-off-by: Laikh Tewari <laikhtewari1@gmail.com>
|
2025-02-25 16:50:19 -08:00 |
|
Dan Blanaru
|
16d2467ea8
|
Update TensorRT-LLM (#2755)
* Update TensorRT-LLM
---------
Co-authored-by: Denis Kayshev <topenkoff@gmail.com>
Co-authored-by: akhoroshev <arthoroshev@gmail.com>
Co-authored-by: Patrick Reiter Horn <patrick.horn@gmail.com>
Update
|
2025-02-11 03:01:00 +00:00 |
|
Kaiyu Xie
|
535c9cc673
|
Update TensorRT-LLM (#2460)
|
2024-11-19 18:30:34 +08:00 |
|
Kaiyu Xie
|
b7868dd1bd
|
Update TensorRT-LLM (#2413)
|
2024-11-05 16:27:06 +08:00 |
|
Kaiyu Xie
|
f6821ee393
|
Update the latest news (#2391)
|
2024-10-29 23:23:02 +08:00 |
|
Kaiyu Xie
|
75057cd036
|
Update TensorRT-LLM (#2333)
* Update TensorRT-LLM
---------
Co-authored-by: Puneesh Khanna <puneesh.khanna@tii.ae>
Co-authored-by: Ethan Zhang <26497102+ethnzhng@users.noreply.github.com>
|
2024-10-15 15:28:40 +08:00 |
|
Dan Blanaru
|
48686bca3a
|
open source 7f370deb0090d885d7518c2b146399ba3933c004 (#2273)
* Update TensorRT-LLM
---------
Co-authored-by: Qingquan Song <ustcsqq@gmail.com>
|
2024-09-30 13:51:19 +02:00 |
|
Kaiyu Xie
|
78f5c2936b
|
Update TensorRT-LLM (#2184)
|
2024-09-03 12:14:23 +02:00 |
|
Kaiyu Xie
|
d335ef9797
|
Fix README (#2175)
|
2024-09-02 08:48:45 +02:00 |
|
石晓伟
|
b8fc6633ba
|
Update TensorRT-LLM (#2156)
Co-authored-by: Bruno Magalhaes <bruno.magalhaes@synthesia.io>
|
2024-08-27 18:20:59 +08:00 |
|
石晓伟
|
32ed92e449
|
Update TensorRT-LLM
Co-authored-by: Rong Zhou <130957722+ReginaZh@users.noreply.github.com>
Co-authored-by: Onur Galoglu <33498883+ogaloglu@users.noreply.github.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
|
2024-08-20 18:55:15 +08:00 |
|
Kaiyu Xie
|
74b324f667
|
Update TensorRT-LLM (#2110)
|
2024-08-13 22:34:33 +08:00 |
|
Kaiyu Xie
|
be9cd719f7
|
Update TensorRT-LLM (#2094)
* Update TensorRT-LLM
---------
Co-authored-by: akhoroshev <arthoroshev@gmail.com>
Co-authored-by: Fabian Joswig <fjosw@users.noreply.github.com>
Co-authored-by: Tayef Shah <tayefshah@gmail.com>
Co-authored-by: lfz941 <linfanzai941@gmail.com>
|
2024-08-07 16:44:43 +08:00 |
|
Kaiyu Xie
|
5fa9436e17
|
Update TensorRT-LLM (#2016)
|
2024-07-24 19:50:28 +08:00 |
|
Kaiyu Xie
|
0d5ffae9a7
|
Update README (#2012)
|
2024-07-24 09:31:27 +08:00 |
|
Kaiyu Xie
|
bca9a33b02
|
Update TensorRT-LLM (#2008)
* Update TensorRT-LLM
---------
Co-authored-by: Timur Abishev <abishev.timur@gmail.com>
Co-authored-by: MahmoudAshraf97 <hassouna97.ma@gmail.com>
Co-authored-by: Saeyoon Oh <saeyoon.oh@furiosa.ai>
Co-authored-by: hattizai <hattizai@gmail.com>
|
2024-07-23 23:05:09 +08:00 |
|
石晓伟
|
5ddb6bf218
|
Update the latest news (#1966)
|
2024-07-17 20:39:41 +08:00 |
|
Kaiyu Xie
|
9dbc5b38ba
|
Update TensorRT-LLM (#1891)
* Update TensorRT-LLM
---------
Co-authored-by: Marks101 <markus.schnoes@gmx.de>
Co-authored-by: lkm2835 <lkm2835@gmail.com>
|
2024-07-04 14:37:19 +08:00 |
|
Kaiyu Xie
|
71d8d4d3dc
|
Update TensorRT-LLM (#1455)
|
2024-04-16 19:40:08 +08:00 |
|