Eran Geva
|
4da3121363
|
[#8921][chore] AutoDeploy NanoV3 to use SYMM_MEM allreduce strategy (#9797)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
|
2025-12-09 13:05:38 -08:00 |
|
Eran Geva
|
98db262a67
|
[None][fix] Switch AutoDeploy's default allreduce strategy to NCCL (#9666)
Signed-off-by: Eran Geva <19514940+MrGeva@users.noreply.github.com>
|
2025-12-08 03:26:21 -08:00 |
|
Lucas Liebenwein
|
a1964bcbbc
|
[#9643][fix] AutoDeploy: fix nano sharding config (#9668)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-12-04 03:10:25 +08:00 |
|
Chenghao Zhang
|
18fbda5cdb
|
[None][feat] AutoDeploy: Add A_log fusion for Mamba layers (#9422)
Signed-off-by: Chenghao Zhang <211069071+nvchenghaoz@users.noreply.github.com>
|
2025-11-26 14:39:20 -08:00 |
|
Suyog Gupta
|
efd503751f
|
[#9271][perf] Enable multi-stream MOE optimization in AutoDeploy (#9322)
Signed-off-by: Suyog Gupta <41447211+suyoggupta@users.noreply.github.com>
|
2025-11-24 19:50:10 -08:00 |
|
Lucas Liebenwein
|
6bf4e59267
|
[#8763][feature] AutoDeploy: configurable dtype for caching (#8812)
Signed-off-by: Lucas Liebenwein <11156568+lucaslie@users.noreply.github.com>
|
2025-11-10 22:17:14 -08:00 |
|