Accelerating finite-element-based projector augmented-wave density functional theory calculations with scalable GPU-centric computational methods
Kartick Ramakrishnan, Phani Motamarri

TL;DR
This paper introduces GPU-optimized finite-element PAW methods enabling scalable, accurate large-scale DFT calculations on exascale systems, significantly reducing computation time compared to traditional approaches.
Contribution
The authors develop a GPU-centric finite-element PAW DFT implementation with novel algorithms and mixed-precision strategies for scalable, accurate electronic-structure calculations on exascale architectures.
Findings
Achieves up to 8x and 20x CPU-GPU speedups on Intel and AMD GPUs.
Reduces time-to-solution by nearly 8x for 10,000-electron systems compared to plane-wave PAW methods.
Demonstrates scalability to 130,000-electron systems, suitable for exascale computing.
Abstract
Accurate large-scale Kohn-Sham density functional theory (DFT) calculations are essential for modeling complex material systems, including interfaces, defects, nanoclusters, and twisted two-dimensional heterostructures. Achieving chemical accuracy at scales of - electrons with practical time-to-solution, however, remains challenging for existing DFT implementations. We present GPU-centric computational methods and algorithmic innovations within a finite-element (FE) discretized projector augmented-wave (PAW) formulation (PAW-FE) for accurate, efficient, and scalable electronic-structure calculations on modern exascale systems. The FE discretization, developed within a collinear spin formalism, accommodates generic boundary conditions and employs multi-resolution quadrature for accurate evaluation of atom-centered PAW integrals on coarse grids. The resulting generalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
