Tight Bounds for Low Dimensional Star Stencils in the Parallel External Memory Model
Philipp Hupp, Riko Jacob

TL;DR
This paper establishes tight bounds for the minimum non-compulsory I/O operations in low-dimensional star stencil computations within the external memory model, significantly narrowing the gap between theoretical limits and practical algorithms.
Contribution
It provides the first tight bounds for star stencil computations in the external memory model across various dimensions, improving previous results and closing the performance gap.
Findings
Matching lower and upper bounds in 2D for star stencils.
Improved bounds in 3D, reducing the gap by a factor of $2 \, \sqrt{3}\sqrt{B}$.
First analysis of the constant of the leading term of non-compulsory I/Os for arbitrary dimensions.
Abstract
Stencil computations on low dimensional grids are kernels of many scientific applications including finite difference methods used to solve partial differential equations. On typical modern computer architectures, such stencil computations are limited by the performance of the memory subsystem, namely by the bandwidth between main memory and the cache. This work considers the computation of star stencils, like the 5-point and 7-point stencil, in the external memory model and parallel external memory model and analyses the constant of the leading term of the non-compulsory I/Os. While optimizing stencil computations is an active field of research, there has been a significant gap between the lower bounds and the performance of the algorithms so far. In two dimensions, this work provides matching constants for lower and upper bounds closing a multiplicative gap of 4. In three dimensions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
