GPU Optimizations for the Hierarchical Poincar\'e-Steklov Scheme
Anna Yesypenko, Per-Gunnar Martinsson

TL;DR
This paper introduces GPU optimizations for the 2D Hierarchical Poincaré-Steklov scheme, significantly enhancing computational speed while maintaining high accuracy for solving high-frequency Helmholtz equations.
Contribution
The work develops straightforward GPU optimizations for the HPS method, enabling faster local static condensation on high-order discretizations with minimal implementation complexity.
Findings
GPU optimizations reduce local static condensation cost.
Numerical experiments show high accuracy of HPS with GPU acceleration.
Significant speedup in solving high wavenumber Helmholtz problems.
Abstract
This manuscript presents GPU optimizations for the 2D Hierarchical Poincar\'e-Steklov (HPS) discretization scheme. HPS is a multi-domain spectral collocation method that combines high-order discretizations with direct solvers to accurately resolve highly oscillatory solutions. The domain decomposition approach of HPS connects domains directly via a sparse direct solver. The proposed optimizations exploit batched linear algebra on modern hybrid architectures, are straightforward to implement, and improve the solver's practical speed. The manuscript demonstrates that GPU optimizations can significantly reduce the traditionally high cost of performing local static condensation for discretizations with very high local order . Numerical experiments for the Helmholtz equation with high wavenumbers on curved and rectangular domains confirm the high accuracy achieved by the HPS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Methods in Computational Mathematics · Numerical methods for differential equations · Matrix Theory and Algorithms
