The Roofline model is a performance model used to estimate the performance of compute kernels or applications running on multi-core, many-core, or accelerator processor architectures. It combines locality, bandwidth, and parallelization paradigms into a single performance figure, providing insights on both implementation and inherent performance limitations. The most basic Roofline model can be visualized by plotting floating-point performance against machine peak performance and bandwidth.
UC Berkeley
Spring 2020
The course addresses programming parallel computers to solve complex scientific and engineering problems. It covers an array of parallelization strategies for numerical simulation, data analysis, and machine learning, and provides experience with popular parallel programming tools.
No concepts data
+ 36 more concepts