An Approach for Realistically Simulating the Performance of Scientific Applications on High Performance Computing Systems
Ali Mohammed, Ahmed Eleliemy, Florina M. Ciorba, Franziska Kasielke,, Ioana Banicescu

TL;DR
This paper presents a realistic simulation approach for scientific applications with irregular, stochastic tasks on HPC systems, enabling better selection of dynamic loop self-scheduling techniques through cost-effective exploratory experiments.
Contribution
It introduces a novel simulation strategy that transforms native application code into a simulative version, accurately predicting performance characteristics for HPC applications with load imbalance.
Findings
Simulation approach accurately captures most performance characteristics.
Transformative method compares native and simulated application performance.
Highlights importance of realistic simulations for scheduling technique selection.
Abstract
Scientific applications often contain large, computationally-intensive, and irregular parallel loops or tasks that exhibit stochastic characteristics. Applications may suffer from load imbalance during their execution on high-performance computing (HPC) systems due to such characteristics. Dynamic loop self-scheduling (DLS) techniques are instrumental in improving the performance of scientific applications on HPC systems via load balancing. Selecting a DLS technique that results in the best performance for different problems and system sizes requires a large number of exploratory experiments. A theoretical model that can be used to predict the scheduling technique that yields the best performance for a given problem and system has not yet been identified. Therefore, simulation is the most appropriate approach for conducting such exploratory experiments with reasonable costs. This work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
