Commit Graph

22 Commits

Author SHA1 Message Date
David Addison
7278698c1b Clarified use of Mebibytes and Gibibytes for sizes 2026-01-06 14:59:17 -08:00
David Addison
4bc314aa27 Add README.md text for -J option 2025-11-21 11:31:48 -08:00
David Addison
e2af90af76 Add new report_timestamps option to README.md 2025-10-28 10:21:58 -07:00
David Addison
f2015cbe82 Modified warmup to run for more message sizes
Loops between minBytes and maxBytes doubling size each time

Reduced default warmup iteration count to 1 (was 5)
2025-08-25 13:57:51 -07:00
David Addison
0c60e6a8e4 Fix formatting errors in README.md 2025-05-30 17:43:30 -07:00
David Addison
a5c539e68b Add support for Symmetric Memory Registration
From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2)
2025-05-30 17:31:34 -07:00
David Addison
1021260ca9 Make verifiable a DSO and add NAME_SUFFIX support
Build option DSO=1 generates libverifiable.so which can be
used to reduce the combined binary size.

Build option NAME_SUFFIX can be used to a add suffix to all
generated binaries. e.g. NAME_SUFFIX=_mpi

Added new make target: clean_intermediates
2025-04-23 17:07:24 -07:00
Sylvain Jeaugey
903918fc54
Add NCCL_TESTS_SPLIT documentation in the README 2025-02-06 14:10:07 +01:00
David Addison
0d86b5a6e7 Added some missing command line options to README.md
Also updated single and multi-node examples.
2024-07-30 14:50:45 -07:00
David Addison
6c46206a47 Make the -c option be a datacheck iteration count parameter
Default is 1
2023-09-13 14:03:38 -07:00
Felix Abecassis
17d0a42d5a
Update README.md 2023-03-23 09:05:41 -07:00
Sylvain Jeaugey
2cbb968101
Update README.md
Improve MPI example to avoid confusion of number of processes / total number of GPUs.

https://github.com/NVIDIA/nccl-tests/issues/54#issuecomment-1212023369
2023-01-03 08:47:43 +01:00
David Addison
11cff17a04 Updated with new command line arguments 2021-07-06 16:27:45 -07:00
Sylvain Jeaugey
a2af1d959d
Update README.md
Checks are now fully local, no need to disable them at scale.
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey
ca7a565236
Update README.md 2019-08-16 09:06:28 -07:00
David Addison
cbe7f65400 Resync all tests with test code from NCCL 2.4
Major rework to merge most of the changes from the NCCL internal
tests into the public ones

Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey
db39a88f8a Fix link to performance page 2018-01-30 09:14:49 -08:00
Sylvain Jeaugey
222f94f949 Added explanation about performance numbers 2018-01-30 09:13:52 -08:00
Sylvain Jeaugey
9ec3e35276 Fix typo in Readme 2017-08-08 16:29:25 -07:00
Sylvain Jeaugey
a15599f5cf Improve Readme 2017-08-08 16:28:46 -07:00
Sylvain Jeaugey
caede2fbd6 Improve Readme 2017-08-08 16:25:07 -07:00
Sylvain Jeaugey
b188a15299 Initial commit 2017-08-08 16:18:34 -07:00