QI JUN
6624cc293a
[None][doc] remove nano-vl-v2 model support in release notes ( #9887 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-12-10 18:09:54 -08:00
QI JUN
67ffa90d62
[ https://nvbugs/5729847 ][doc] fix broken links to modelopt ( #9868 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-12-10 02:57:11 -08:00
QI JUN
df8d2310c8
[None][doc] Update release notes ( #9739 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
Signed-off-by: QI JUN <22017000+QiJune@users.noreply.github.com>
Co-authored-by: Laikh Tewari <laikhtewari1@gmail.com>
2025-12-09 18:46:55 -08:00
Zac Patel
cfaa13a98a
[IB-1920][doc] Update Perf_Overview.md with Benchmarking Results for Release 1.1 ( #9723 )
...
Signed-off-by: Zachary Patel <22306219+zbpatel@users.noreply.github.com>
2025-12-09 15:36:13 -08:00
xiweny
82e5a4cad8
[TRTLLM-4629][doc] Add B300 & GB300 in documents ( #9663 )
...
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-12-03 20:06:50 +08:00
Iman Tabrizian
ac5aa63b11
[TRTLLM-9082][doc] Address Dynamo Example feedback ( #9619 )
...
Signed-off-by: Iman Tabrizian <10105175+tabrizian@users.noreply.github.com>
2025-12-02 17:25:55 +08:00
Kaiyu Xie
6f7804ff50
[TRTLLM-9090] [doc] Update online benchmarking docs ( #9611 )
...
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
2025-12-02 15:58:43 +08:00
QI JUN
ec223a11c9
[TRTLLM-9092][doc] link to modelopt checkpoints in quick start guide ( #9571 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-12-01 10:51:31 +08:00
QI JUN
534609b5a6
[TRTLLM-9093][doc] update hyper links in overview ( #9568 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-12-01 10:43:03 +08:00
Yiqing Yan
6339e76b6d
[None][infra] Updated Linux installation guide ( #9485 )
...
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-11-29 22:55:47 +08:00
Enwei Zhu
4263108ebe
[TRTLLM-9157][doc] Guided decoding doc improvement ( #9359 )
...
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-11-27 14:14:43 +08:00
QI JUN
267c850792
[TRTLLM-9086][doc] Clean up TODOs in documentation ( #9292 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-11-27 14:13:00 +08:00
Pengyun Lin
41c903d6a7
[None][doc] VDR 1.0 trtllm-serve doc enhancement ( #9443 )
...
Signed-off-by: Pengyun Lin <81065165+LinPoly@users.noreply.github.com>
2025-11-27 13:08:26 +08:00
jthomson04
b9d92380da
[TRTLLM-9199][docs] KV Connector Docs ( #9325 )
...
Signed-off-by: jthomson04 <jwillthomson19@gmail.com>
Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>
2025-11-24 18:07:50 +01:00
QI JUN
a49fdb36df
[TRTLLM-9092][doc] Add a pre-quantized example in quick start guide ( #9223 )
...
Signed-off-by: junq <22017000+QiJune@users.noreply.github.com>
2025-11-18 17:36:01 -08:00
Chang Liu
4661820d05
[TRTLLM-7971][doc] Doc update for multimodal in v1.1 ( #9015 )
...
Signed-off-by: Chang Liu (Enterprise Products) <9713593+chang-l@users.noreply.github.com>
2025-11-13 14:58:14 -08:00
Guoming Zhang
5192af14ea
[TRTLLM-9073][doc] Add the missing content for model support section and fix… ( #9033 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-11-10 13:44:16 +08:00
Yiqing Yan
572f9be06f
[None][chore] Lock onnx version <1.20.0 and remove WAR for TRT 10.13 ( #9007 )
...
Signed-off-by: Yiqing Yan <yiqingy@nvidia.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-11-10 12:50:37 +08:00
Guoming Zhang
b941d7acbb
[ https://nvbugs/5634220 ][fix] Add developer guide back and fix some i… ( #8911 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-11-05 10:17:01 +08:00
Zhanrui Sun
776bb25bfd
[TRTLLM-8658][infra] upgrade to DLFW 25.10 and pytorch 2.9.0 / triton 3.5.0 ( #8621 )
...
Signed-off-by: ZhanruiSunCh <184402041+ZhanruiSunCh@users.noreply.github.com>
Signed-off-by: Zhanrui Sun <184402041+ZhanruiSunCh@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
2025-11-03 09:24:58 +08:00
xiweny
6545d541bb
[ https://nvbugs/5532789 ] [doc] Add documents about CUDA 12.9 ( #8192 )
...
Signed-off-by: Xiwen Yu <13230610+VALLIS-NERIA@users.noreply.github.com>
2025-10-13 00:35:36 -07:00
Guoming Zhang
0c47925600
[None][doc] Refine perf overview.md and correct the error link in per… ( #8036 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
2025-09-28 16:14:31 +08:00
Chuang Zhu
f98fa0cf8b
[None][feat] Optimize kv cache transfer TEP ( #7613 )
...
Signed-off-by: Chuang Zhu <111838961+chuangz0@users.noreply.github.com>
2025-09-25 20:20:04 -07:00
Yanchao Lu
7e2521a7f0
[None][chore] Some clean-ups for CUDA 13.0 dependencies ( #7979 )
...
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
2025-09-26 08:46:11 +08:00
WeiHaocheng
4b0570a0d6
[None][doc] Add acknowledgements in scaffolding tech blog ( #7983 )
...
Signed-off-by: Fred Wei <20514172+WeiHaocheng@users.noreply.github.com>
2025-09-25 08:07:13 -07:00
Yan Chunwei
40c6103ef8
[None][doc] add Llama PP known issue to release note ( #7959 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Guoming Zhang
663ce3a4de
[None][doc] fix invalid links in perf benchmarking. ( #7933 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Zac Patel
c38d4cf6a6
[None][doc] Update Perf-Overview.md for release/1.0 ( #7848 )
...
Signed-off-by: zpatel <22306219+zbpatel@users.noreply.github.com>
Signed-off-by: Yanchao Lu <yanchaol@nvidia.com>
Co-authored-by: Yanchao Lu <yanchaol@nvidia.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Yan Chunwei
57c098956e
[None][doc] add a guide for modifying APIs ( #7866 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Guoming Zhang
9f0f52249e
[None][doc] Rename TensorRT-LLM to TensorRT LLM for homepage and the … ( #7850 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Guoming Zhang
5ecc8d0ee2
[None][doc] Replace the main in the examples' link with commit id. ( #7837 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
Guoming Zhang
4a09be40f0
[None][doc] Update docker cmd in quick start guide and trtllm-serve … ( #7787 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-25 21:02:35 +08:00
WeiHaocheng
259cc66c34
[None][doc] scaffolding tech blog part one ( #7835 )
...
Signed-off-by: Fred Wei <20514172+WeiHaocheng@users.noreply.github.com>
Signed-off-by: zheyuf <zheyuf@NVIDIA.com>
Co-authored-by: zheyuf <zheyuf@NVIDIA.com>
2025-09-25 14:41:59 +08:00
Aurelien Chartier
98726a3bed
[None][chore] Update trtllm-bench documentation on setting FP8 KV cache ( #7885 )
...
Signed-off-by: Aurelien Chartier <2567591+achartier@users.noreply.github.com>
2025-09-25 09:28:53 +08:00
Leslie Fang
342014069e
[None][chore] Validate features combination ( #7630 )
...
Signed-off-by: leslie-fang25 <leslief@nvidia.com>
2025-09-25 08:01:13 +08:00
xxi
d471655242
[TRTLLM-7831][feat] Cherry-pick from #7423 Support fp8 block wide ep cherry pick ( #7712 )
2025-09-23 08:41:38 +08:00
Guoming Zhang
edbe270198
[TRTLLM-7958][doc] add 1.0 release notes ( #7605 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: pcastonguay <55748270+pcastonguay@users.noreply.github.com>
Signed-off-by: Sharan Chetlur <116769508+schetlur-nv@users.noreply.github.com>
Co-authored-by: pcastonguay <55748270+pcastonguay@users.noreply.github.com>
Co-authored-by: Sharan Chetlur <116769508+schetlur-nv@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Yan Chunwei
ba2864a2c6
[None][doc] Enhance api reference doc by labeling stable APIs ( #7751 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Guoming Zhang
e8a3e21b87
[ https://nvbugs/5519525 ][fix] fix doc invalid link for bug 5519525 ( #7753 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Guoming Zhang
bc7b50334c
[None][doc] Add labels description note into llm api section ( #7696 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Guoming Zhang
ab915fb333
[None][doc] Use hash id for external link ( #7641 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Guoming Zhang
5c54173054
[None][doc] Fix a invalid link and a typo. ( #7634 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Guoming Zhang
8fed8ee066
[None][doc] add blackwell information into support matrix ( #6740 )
...
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Yan Chunwei
2ffc33921f
[ https://nvbugs/5416501 ][doc] add known issues to llmapi doc ( #7560 )
...
Signed-off-by: Yan Chunwei <328693+Superjomn@users.noreply.github.com>
Co-authored-by: Ryan McCormick <mccormick.codes@gmail.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
2025-09-22 14:28:38 +08:00
Enwei Zhu
e943a39cbd
[None][doc] Update tech blog12 ( #7884 )
...
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-09-20 18:15:39 +08:00
Kanghwan
8fcd11515d
[ #7704 ][chore] Enable MathJax to fix formulas in documentation ( #7744 )
...
Signed-off-by: Kanghwan Jang <861393+karljang@users.noreply.github.com>
2025-09-19 08:42:26 -07:00
Enwei Zhu
c8cc16d38d
[None][doc] Tech blog: Combining Guided Decoding and Speculative Decoding: Making CPU and GPU Cooperate Seamlessly ( #7864 )
...
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2025-09-19 18:38:12 +08:00
dongfengy
026f22eb50
[None][doc] Cherry-pick deployment guide update from 1.1.0rc2 branch to main branch ( #7774 )
...
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
2025-09-18 22:50:26 +08:00
Wanli Jiang
fe104dc20d
[TRTLLM-7918][feat] Support kvcache reuse and chunk prefill for phi4mm ( #7723 )
...
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
2025-09-18 17:37:16 +08:00
Wanli Jiang
a7ca0fff54
[TRTLLM-6577][feat] Support nano_v2_vlm in pytorch backend ( #7207 )
...
Signed-off-by: Wanli Jiang <35160485+Wanli-Jiang@users.noreply.github.com>
2025-09-18 16:26:20 +08:00