Untitled

Roofline model, based on [49]. The x-axis represents the operational or computational intensity (CI) and y-axis represents the attainable performance (AP) or throughput. Computational roof and I/O bandwidth roof limit the achievable AP. On the right (yellow area), the algorithms are compute-bound, while on the left (orange area), they are memory-bound

Roofline model, based on [49]. The x-axis represents the operational or computational intensity (CI) and y-axis represents the attainable performance (AP) or throughput. Computational roof and I/O bandwidth roof limit the achievable AP. On the right (yellow area), the algorithms are compute-bound, while on the left (orange area), they are memory-bound

COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training

COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training