Guoming Zhang
|
93ac0bc1dc
|
[TRTLLM-10126][feat] Increase topk upper limit to 22 for NVLinkOneSid… (#10229)
Signed-off-by: nv-guomingz <137257613+nv-guomingz@users.noreply.github.com>
|
2025-12-27 22:48:10 +08:00 |
|
Bo Li
|
cc1323be24
|
[None][fix] Fix the bug for top_k=10 in NVLinkOneSided AlltoAll. (#10197)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-12-23 02:13:37 -05:00 |
|
Bo Li
|
472fe497dc
|
[None][chore] NVLinkOneSided AlltoAll Support zero local_num_tokens. (#9822)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-12-22 05:57:12 -05:00 |
|
Yihan Wang
|
9df4dad3b6
|
[None][fix] Introduce inline namespace to avoid symbol collision (#9541)
Signed-off-by: Yihan Wang <yihwang@nvidia.com>
|
2025-12-12 23:32:15 +08:00 |
|
Bo Li
|
9d3c675a0b
|
[None][chore] Support larger topK for NVLinkOneSided AlltoAll. (#9816)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-12-10 11:10:55 +08:00 |
|
Bo Li
|
fcfec93cad
|
[TRTLLM-9389][chore] Rename AlltoAll backend names (#9329)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-11-23 13:52:57 -08:00 |
|
Bo Li
|
d8b05894ee
|
[None][perf] Adjust select_alltoall_method_type. (#8950)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-11-19 07:43:55 -08:00 |
|
Bo Li
|
e4bf29bc66
|
[None][feat] Integrate MnnvlThroughput into TRTLLM MoE. (#8728)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
|
2025-11-04 21:36:29 +08:00 |
|
Bo Li
|
9c4432f8a4
|
[TRTLLM-7318][feat] MnnvlThroughput AlltoAll implementation. (#7499)
Signed-off-by: Bo Li <22713281+bobboli@users.noreply.github.com>
Co-authored-by: Jin Li <59594262+liji-nv@users.noreply.github.com>
|
2025-10-27 13:23:06 -04:00 |
|