GPGPU Acceleration of All-Electron Electronic Structure Theory Using Localized Numeric Atom-Centered Basis Functions
William Huhn, Bj\"orn Lange, Victor Wen-zhe Yu, Mina Yoon and, Volker Blum

TL;DR
This paper introduces a GPGPU-accelerated all-electron density-functional theory implementation using localized basis functions, achieving significant speedups in large-scale electronic structure calculations on parallel architectures.
Contribution
It presents a novel GPGPU-based implementation of all-electron DFT with domain decomposition for parallel real-space operations, enabling efficient large-scale simulations.
Findings
Speedups of 4.5 to 6.6 in SCF iterations with GPGPU acceleration
Overall speedup of 3-4 for entire calculations on multi-atom systems
Scalability demonstrated for large 375-atom Bi2Se3 bilayer
Abstract
We present an implementation of all-electron density-functional theory for massively parallel GPGPU-based platforms, using localized atom-centered basis functions and real-space integration grids. Special attention is paid to domain decomposition of the problem on non-uniform grids, which enables compute- and memory-parallel execution across thousands of nodes for real-space operations, e.g. the update of the electron density, the integration of the real-space Hamiltonian matrix, and calculation of Pulay forces. To assess the performance of our GPGPU implementation, we performed benchmarks on three different architectures using a 103-material test set. We find that operations which rely on dense serial linear algebra show dramatic speedups from GPGPU acceleration: in particular, SCF iterations including force and stress calculations exhibit speedups ranging from 4.5 to 6.6. For the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
