Commit Graph

3 Commits

Author SHA1 Message Date
Bo Deng
910c070e88
[None][fix] fix accuracy issue(cherry-pick https://github.com/NVIDIA/TensorRT-LLM/pull/11157 and https://github.com/NVIDIA/TensorRT-LLM/pull/9530) (#11222)
Signed-off-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
Signed-off-by: Bo Deng <deemod@nvidia.com>
Co-authored-by: Enwei Zhu <21126786+syuoni@users.noreply.github.com>
2026-02-04 11:24:21 +08:00
ChristinaZ
be576a3152
[None] [feat] Enable run_post_quant_allgather for MoE TRTLLM backend (#6794)
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
2025-09-23 08:24:21 +08:00
ChristinaZ
c7269ea93a
[https://nvbugs/5392414] [fix] Add customized default routing method (#6818)
Signed-off-by: Christina Zhang <83400082+ChristinaZ@users.noreply.github.com>
2025-08-21 16:58:41 +08:00