Roofline model, based on [49]. The x-axis represents the operational or computational intensity (CI) and y-axis represents the attainable performance (AP) or throughput. Computational roof and I/O bandwidth roof limit the achievable AP. On the right (yellow area), the algorithms are compute-bound, while on the left (orange area), they are memory-bound
COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
[x] how one should work when trying to optimize its deployments, which GPU card should I use, A100, A10g, or 4xA10g.