xxi
|
d471655242
|
[TRTLLM-7831][feat] Cherry-pick from #7423 Support fp8 block wide ep cherry pick (#7712)
|
2025-09-23 08:41:38 +08:00 |
|
dongfengy
|
026f22eb50
|
[None][doc] Cherry-pick deployment guide update from 1.1.0rc2 branch to main branch (#7774)
Signed-off-by: Dongfeng Yu <dongfengy@nvidia.com>
|
2025-09-18 22:50:26 +08:00 |
|
Guoming Zhang
|
7f3f658d5f
|
[None][doc] Rename TensorRT-LLM to TensorRT LLM. (#7554)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-09 12:16:03 +08:00 |
|
Guoming Zhang
|
35dac55716
|
[None][doc] Update kvcache part (#7549)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-09 12:16:03 +08:00 |
|
Guoming Zhang
|
f53fb4c803
|
[TRTLLM-5930][doc] 1.0 Documentation. (#6696)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
Signed-off-by: Wangshanshan <30051912+dominicshanshan@users.noreply.github.com>
|
2025-09-09 12:16:03 +08:00 |
|
binghanc
|
14ee43e254
|
[None][docs] refine docs for accuracy evaluation of gpt-oss models (#7252)
Signed-off-by: 176802681+binghanc@users.noreply.github.com
|
2025-09-08 09:56:23 +08:00 |
|
dongfengy
|
48155f52bf
|
[TRTLLM-7321][doc] Refine GPT-OSS doc (#7180)
Signed-off-by: Dongfeng Yu
|
2025-08-24 08:53:53 -04:00 |
|
dongfengy
|
d94cc3fa3c
|
[TRTLLM-7321][doc] Add GPT-OSS Deployment Guide into official doc site (#7143)
Signed-off-by: Dongfeng Yu
|
2025-08-22 16:17:01 +08:00 |
|
Kaiyu Xie
|
9a74ee9dae
|
[None] [doc] Add more documents for large scale EP (#7029)
Signed-off-by: Kaiyu Xie <26294424+kaiyux@users.noreply.github.com>
|
2025-08-19 19:04:39 +08:00 |
|
Daniel Cámpora
|
53312eeebd
|
[TRTLLM-7157][feat] BREAKING CHANGE Introduce sampler_type, detect sampler according to options (#6831)
Signed-off-by: Daniel Campora <961215+dcampora@users.noreply.github.com>
|
2025-08-16 00:27:24 -04:00 |
|
JunyiXu-nv
|
70e352a6f7
|
[https://nvbugs/5437106][fix] Add L4 Scout benchmarking WAR option in deploy guide (#6829)
Signed-off-by: Junyi Xu <junyix@nvidia.com>
|
2025-08-15 08:53:13 +08:00 |
|
Tao Li @ NVIDIA
|
345d3d3524
|
[None][doc] update moe support matrix for DS R1 (#6883)
Signed-off-by: taoli <litaotju@users.noreply.github.com>
Co-authored-by: taoli <litaotju@users.noreply.github.com>
|
2025-08-14 13:55:11 +08:00 |
|
Zhenhua Wang
|
868c5d166e
|
[None][chore] fix markdown format for the deployment guide (#6879)
Signed-off-by: Zhenhua Wang <zhenhuaw@nvidia.com>
|
2025-08-13 22:19:11 -04:00 |
|
Zhenhua Wang
|
8416d7fea8
|
[https://nvbugs/5412885][doc] Add the workaround doc for H200 OOM (#6853)
Signed-off-by: Zhenhua Wang <4936589+zhenhuaw-me@users.noreply.github.com>
|
2025-08-13 19:51:38 +08:00 |
|
Guoming Zhang
|
0223de0727
|
[None][doc] Add deployment guide section for VDR task (#6669)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2025-08-07 10:30:47 -04:00 |
|