Simple and efficient GPU parallelization of existing H-Matrix accelerated BEM code
Kerstin Vater, Timo Betcke, Boris Dilba

TL;DR
This paper presents a simple GPU acceleration method for Hierarchical Matrix-based Boundary Element Method computations, achieving up to 5.5 times faster performance with minimal code modifications.
Contribution
It introduces a GPU offloading strategy for ACA-based BEM routines that can be integrated into existing codebases with minimal changes.
Findings
Speed-up of up to 5.5 times in BEM simulations.
Efficient GPU utilization with minimal code modifications.
Successful application to high-frequency sound scattering problem.
Abstract
In this paper, we demonstrate how GPU-accelerated BEM routines can be used in a simple black-box fashion to accelerate fast boundary element formulations based on Hierarchical Matrices (H-Matrices) with ACA (Adaptive Cross Approximation). In particular, we focus on the expensive evaluation of the discrete weak form of boundary operators associated with the Laplace and the Helmholtz equation in three space dimensions. The method is based on offloading the CPU assembly of elements during the ACA assembly onto a GPU device and to use threading strategies across ACA blocks to create sufficient workload for the GPU. The proposed GPU strategy is designed such that it can be implemented in existing code with minimal changes to the surrounding application structure. This is in particular interesting for existing legacy code that is not from the ground-up designed with GPU computing in mind. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsElectromagnetic Scattering and Analysis · Electromagnetic Simulation and Numerical Methods · Advanced Numerical Methods in Computational Mathematics
