1. CUDA


https://developer.download.nvidia.com/CUDA/training/StreamsAndConcurrencyWebinar.pdf

https://developer.download.nvidia.com/CUDA/training/StreamsAndConcurrencyWebinar.pdf

Untitled

2. NCCL


NVIDIA Collective Communications Library

Multi-node communication primitives

Untitled