TensorRT-LLMs/tensorrt_llm/_torch/distributed
Yukun He bb7bcc75c2
feat: Fallback to NCCL for various patterns when input size is large. (#4080)
* Fallback to NCCL for various patterns when input size is large.
Move the previous implementation to cpp side.

Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>

* Revising.

Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>

---------

Signed-off-by: Yukun He <23156053+hyukn@users.noreply.github.com>
2025-05-08 11:13:13 -07:00
..
__init__.py Clean up allreduce op in Deepseek V3 model. (#3829) 2025-05-01 07:56:36 +08:00
communicator.py chore: bump version to 0.19.0 (#3598) (#3841) 2025-04-29 16:57:22 +08:00
ops.py feat: Fallback to NCCL for various patterns when input size is large. (#4080) 2025-05-08 11:13:13 -07:00