yunruis
|
b99c5ce8c1
|
Feat/ds r1 min latency opt round3, add router gemm, fused a gemm, PDL (#4560)
Signed-off-by: yunruis <yunruis@nvidia.com>
Signed-off-by: kduan <176893526+Kefeng-Duan@users.noreply.github.com>
Signed-off-by: Kefeng-Duan <176893526+Kefeng-Duan@users.noreply.github.com>
Co-authored-by: kduan <176893526+Kefeng-Duan@users.noreply.github.com>
|
2025-06-14 17:36:22 +08:00 |
|
Zongfei Jing
|
6d1f2d0fd7
|
[TRTLLM-3927] [feat] Finalize + Allreduce + add + rmsnorm fusion (#4756)
Signed-off-by: Zongfei Jing <20381269+zongfeijing@users.noreply.github.com>
|
2025-06-10 19:55:16 +08:00 |
|
Daniel Stokes
|
3a4851b7c3
|
feat: Add Mixture of Experts FP8xMXFP4 support (#4750)
Signed-off-by: Daniel Stokes <40156487+djns99@users.noreply.github.com>
|
2025-06-09 13:25:04 +08:00 |
|
Void
|
62bb7f9286
|
fix potential issues in allreduce fusion kernel and ut (#4226)
fix allreduce fuison kernels and ut
Signed-off-by: Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
---------
Co-authored-by: AIDC-AI <AIDC-AIB@365fanyi.com>
|
2025-05-19 17:38:29 +08:00 |
|
Void
|
316e5c3be3
|
feat: fix and improve allreduce and fusion kernels (#3064)
Signed-off-by: Yilin Zhang <18275976+yilin-void@users.noreply.github.com>
|
2025-04-08 19:33:52 +08:00 |
|