David Addison
7278698c1b
Clarified use of Mebibytes and Gibibytes for sizes
2026-01-06 14:59:17 -08:00
David Addison
4bc314aa27
Add README.md text for -J option
2025-11-21 11:31:48 -08:00
David Addison
e2af90af76
Add new report_timestamps option to README.md
2025-10-28 10:21:58 -07:00
David Addison
f2015cbe82
Modified warmup to run for more message sizes
...
Loops between minBytes and maxBytes doubling size each time
Reduced default warmup iteration count to 1 (was 5)
2025-08-25 13:57:51 -07:00
David Addison
0c60e6a8e4
Fix formatting errors in README.md
2025-05-30 17:43:30 -07:00
David Addison
a5c539e68b
Add support for Symmetric Memory Registration
...
From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2)
2025-05-30 17:31:34 -07:00
David Addison
1021260ca9
Make verifiable a DSO and add NAME_SUFFIX support
...
Build option DSO=1 generates libverifiable.so which can be
used to reduce the combined binary size.
Build option NAME_SUFFIX can be used to a add suffix to all
generated binaries. e.g. NAME_SUFFIX=_mpi
Added new make target: clean_intermediates
2025-04-23 17:07:24 -07:00
Sylvain Jeaugey
903918fc54
Add NCCL_TESTS_SPLIT documentation in the README
2025-02-06 14:10:07 +01:00
David Addison
0d86b5a6e7
Added some missing command line options to README.md
...
Also updated single and multi-node examples.
2024-07-30 14:50:45 -07:00
David Addison
6c46206a47
Make the -c option be a datacheck iteration count parameter
...
Default is 1
2023-09-13 14:03:38 -07:00
Felix Abecassis
17d0a42d5a
Update README.md
2023-03-23 09:05:41 -07:00
Sylvain Jeaugey
2cbb968101
Update README.md
...
Improve MPI example to avoid confusion of number of processes / total number of GPUs.
https://github.com/NVIDIA/nccl-tests/issues/54#issuecomment-1212023369
2023-01-03 08:47:43 +01:00
David Addison
11cff17a04
Updated with new command line arguments
2021-07-06 16:27:45 -07:00
Sylvain Jeaugey
a2af1d959d
Update README.md
...
Checks are now fully local, no need to disable them at scale.
2019-10-10 10:51:05 -07:00
Sylvain Jeaugey
ca7a565236
Update README.md
2019-08-16 09:06:28 -07:00
David Addison
cbe7f65400
Resync all tests with test code from NCCL 2.4
...
Major rework to merge most of the changes from the NCCL internal
tests into the public ones
Added "-m <agg_iters>" operation aggregation option.
Data integrity checking is now much more performant at scale.
Startup times at scale are improved.
Test latency units are now displayed in usec.
2019-04-05 13:42:15 -07:00
Sylvain Jeaugey
db39a88f8a
Fix link to performance page
2018-01-30 09:14:49 -08:00
Sylvain Jeaugey
222f94f949
Added explanation about performance numbers
2018-01-30 09:13:52 -08:00
Sylvain Jeaugey
9ec3e35276
Fix typo in Readme
2017-08-08 16:29:25 -07:00
Sylvain Jeaugey
a15599f5cf
Improve Readme
2017-08-08 16:28:46 -07:00
Sylvain Jeaugey
caede2fbd6
Improve Readme
2017-08-08 16:25:07 -07:00
Sylvain Jeaugey
b188a15299
Initial commit
2017-08-08 16:18:34 -07:00