From a5c539e68bb7263304997012498b0cd0667b99e8 Mon Sep 17 00:00:00 2001 From: David Addison Date: Mon, 19 May 2025 18:20:22 -0700 Subject: [PATCH] Add support for Symmetric Memory Registration From NCCL 2.27.x we can now use the Symmetric Memory APIs (-R 2) --- README.md | 2 +- src/common.cu | 58 +++++++++++++++++++++++++++++++++++++++++---------- 2 files changed, 48 insertions(+), 12 deletions(-) diff --git a/README.md b/README.md index bdafbe5..22687d4 100644 --- a/README.md +++ b/README.md @@ -78,7 +78,7 @@ All tests support the same set of arguments : * `-z,--blocking <0/1>` Make NCCL collective blocking, i.e. have CPUs wait and sync after each collective. Default : 0. * `-G,--cudagraph ` Capture iterations as a CUDA graph and then replay specified number of times. Default : 0. * `-C,--report_cputime <0/1>]` Report CPU time instead of latency. Default : 0. - * `-R,--local_register <1/0>` enable local buffer registration on send/recv buffers. Default : 0. + * `-R,--local_register <0/1/2> enable local (1) or symmetric (2) buffer registration on send/recv buffers. Default : 0. * `-T,--timeout