Scaling Structured Multigrid to 500K+ Cores through Coarse-Grid Redistribution
Andrew Reisner, Luke N. Olson, J. David Moulton

TL;DR
This paper presents a novel coarse-grid redistribution algorithm for structured multigrid methods, significantly improving scalability on supercomputers with over 500,000 cores by reducing communication overhead.
Contribution
It introduces a predictive, performance-model-guided redistribution algorithm for coarse-grid problems in structured multigrid, enhancing parallel scalability on large-scale systems.
Findings
Achieved scalable multigrid solutions on 500K+ cores.
Demonstrated performance gains over previous agglomeration methods.
Validated scalability on two large supercomputing systems.
Abstract
The efficient solution of sparse, linear systems resulting from the discretization of partial differential equations is crucial to the performance of many physics-based simulations. The algorithmic optimality of multilevel approaches for common discretizations makes them a good candidate for an efficient parallel solver. Yet, modern architectures for high-performance computing systems continue to challenge the parallel scalability of multilevel solvers. While algebraic multigrid methods are robust for solving a variety of problems, the increasing importance of data locality and cost of data movement in modern architectures motivates the need to carefully exploit structure in the problem. Robust logically structured variational multigrid methods, such as Black Box Multigrid (BoxMG), maintain structure throughout the multigrid hierarchy. This avoids indirection and increased coarse-grid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
