Design Strategies and Approximation Methods for High-Performance Computing Variability Management
Yueyao Wang, Li Xu, Yili Hong, Rong Pan, Tyler Chang and, Thomas Lux, Jon Bernard, Layne Watson, Kirk Cameron

TL;DR
This paper evaluates the effectiveness of space-filling designs and surrogate models like Gaussian processes for managing I/O performance variability in high-performance computing, emphasizing design efficiency, accuracy, and scalability.
Contribution
It customizes and investigates the application of SFDs and Gaussian processes in HPC variability prediction, providing guidance on their use based on data collection methods and surface properties.
Findings
GP with SFDs generally outperforms other methods
SFDs require fewer points than GBDs for similar accuracy
Tailored SFDs are effective for high-dimensional, non-smooth surfaces
Abstract
Performance variability management is an active research area in high-performance computing (HPC). We focus on input/output (I/O) variability. To study the performance variability, computer scientists often use grid-based designs (GBDs) to collect I/O variability data, and use mathematical approximation methods to build a prediction model. Mathematical approximation models could be biased particularly if extrapolations are needed. Space-filling designs (SFDs) and surrogate models such as Gaussian process (GP) are popular for data collection and building predictive models. The applicability of SFDs and surrogates in the HPC variability needs investigation. We investigate their applicability in the HPC setting in terms of design efficiency, prediction accuracy, and scalability. We first customize the existing SFDs so that they can be applied in the HPC setting. We conduct a comprehensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSimulation Techniques and Applications · Advanced Multi-Objective Optimization Algorithms · Manufacturing Process and Optimization
