Commit Graph

29 Commits

Author SHA1 Message Date
David Addison
ae1ce98e69 Add boot_id to the hostname hash due to collisions on Azure
Fixes #60
2021-01-04 11:38:45 -08:00
Sylvain Jeaugey
464f038106
Merge pull request #61 from jithinjosepkl/master
Use DJB2a hash algorithm in getHostHash()
2020-12-18 10:39:43 -08:00
Jithin Jose
da67a81c8e Use DJB2a hash algorithm in getHostHash() 2020-12-18 10:12:54 -08:00
Sylvain Jeaugey
bd0755c95c
Merge pull request #48 from NVIDIA/fix-makefile-typo
Fix typo in src/Makefile
2020-06-24 14:52:55 -07:00
Luke Yeager
afdaf59b3b
Fix typo in src/Makefile 2020-06-24 14:39:22 -07:00
Sylvain Jeaugey
b2603a2e85 Add gencode for CUDA11 2020-06-23 18:16:46 -07:00
Sylvain Jeaugey
ec1b5e22e6 Change all_gather/reduce_scatter algbw to match the documentation.
Fix #45 : All_gather and reduce_scatter algorithm bandwidth was
computed as time/count*(nranks-1) which is not consistent with the
way we compute it for other collectives.

This change makes algbw higher; busbw is unchanged.
2020-06-19 10:42:19 -07:00
Sylvain Jeaugey
07ac716c1a Fix #47 : compilation error on NCCL<2.7
Return an error when trying to run alltoall test when compiled
against NCCL<2.7.
2020-06-18 15:02:51 -07:00
Sylvain Jeaugey
a7b304dde5
Merge pull request #46 from NVIDIA/p2p
Add alltoall perf test
2020-06-17 10:45:29 -07:00
Luke Yeager
af4fa0f4cf Fix some memory leaks 2020-06-17 10:44:32 -07:00
Sylvain Jeaugey
7a833631b2 Remove sm_30 2020-06-15 08:54:21 -07:00
Sylvain Jeaugey
ba924dac95 Fix #43 : Add .gitignore for build dir 2020-06-03 15:10:38 -07:00
Sylvain Jeaugey
119a0ecf60 Add alltoall perf test 2020-03-17 12:00:19 -07:00
Sylvain Jeaugey
c864b73a27
Merge pull request #31 from wzamazon/fix_makefile
Add -L$(MPI_HOME)/lib64 to NVLDFLAGS
2020-01-06 10:38:40 -08:00
Wei Zhang
0f173234bb Add -L$(MPI_HOME)/lib64 to NVLDFLAGS
In some cases, the MPI library is not in $(MPI_HOME)/lib but
in $(MPI_HOME)/lib64. For example, on RedHat like Linux system
(CentOS, Amazon Linux), and MPI is installed by yum or rpm.

Under such circumstance, the current make file will cause failure.
This patch address this issue by adding -L$(MPI_HOME)/lib64 to
NVLDFLAGS in src/Makefile.

Signed-off-by: Wei Zhang <wzam@amazon.com>
2019-12-16 16:18:22 -08:00
Sylvain Jeaugey
a2af1d959d
Update README.md
Checks are now fully local, no need to disable them at scale.
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey
ca7a565236
Update README.md 2019-08-16 09:06:28 -07:00
David Addison
cbe7f65400 Resync all tests with test code from NCCL 2.4
Major rework to merge most of the changes from the NCCL internal
tests into the public ones

Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey
dcf818955f Added a precision for AllGather and ReduceScatter sizes since NCCL uses the size per rank. 2018-08-17 14:58:44 -07:00
Sylvain Jeaugey
eb4c43ff3d Clarification 2018-01-30 09:17:29 -08:00
Sylvain Jeaugey
e00cb1f1c4 Typos/Clarifications 2018-01-30 09:15:58 -08:00
Sylvain Jeaugey
db39a88f8a Fix link to performance page 2018-01-30 09:14:49 -08:00
Sylvain Jeaugey
222f94f949 Added explanation about performance numbers 2018-01-30 09:13:52 -08:00
Sylvain Jeaugey
925a70576e Print NCCL version at start 2017-12-21 15:10:09 -08:00
Sylvain Jeaugey
25016c8eeb Fix NCCL_HOME to be consistent with README 2017-08-09 10:41:31 -07:00
Sylvain Jeaugey
9ec3e35276 Fix typo in Readme 2017-08-08 16:29:25 -07:00
Sylvain Jeaugey
a15599f5cf Improve Readme 2017-08-08 16:28:46 -07:00
Sylvain Jeaugey
caede2fbd6 Improve Readme 2017-08-08 16:25:07 -07:00
Sylvain Jeaugey
b188a15299 Initial commit 2017-08-08 16:18:34 -07:00