mirror of
https://github.com/NVIDIA/nccl-tests.git
synced 2026-04-23 16:08:20 +08:00
commit
3a3f790efd
@ -140,5 +140,6 @@ To obtain a bus bandwidth which should be independent of the number of ranks _n_
|
||||
* AllGather : (_n_-1)/_n_
|
||||
* Broadcast : 1
|
||||
* Reduce : 1
|
||||
* AlltoAll: (_n_-1)/_n_
|
||||
|
||||
The bus bandwidth should reflect the speed of the hardware bottleneck : NVLink, PCI, QPI, or network.
|
||||
|
||||
Loading…
Reference in New Issue
Block a user