Description
Loop kernels can be analyzed and modeled to estimate performance on target hardware.
It is useful for HPC developers and researchers tuning numerical kernels or memory-bound code. Performance models are approximations, so validate predictions with real measurements on the intended machine.