Phaseless Auxiliary-Field Quantum Monte Carlo on Graphical Processing Units
James Shee, Evan J. Arthur, Shiwei Zhang, David R. Reichman, Richard, A. Friesner

TL;DR
This paper introduces a GPU-accelerated implementation of phaseless Auxiliary-Field Quantum Monte Carlo, achieving significant speed-ups and enabling the study of larger, more complex electronic systems with improved efficiency and accuracy.
Contribution
The paper presents a novel GPU-based implementation of ph-AFQMC with algorithmic innovations that drastically reduce computation time and extend applicability to larger systems.
Findings
Achieved roughly two orders of magnitude speed-up with a single GPU.
Demonstrated near-perfect parallel efficiency across 8 GPUs.
Successfully applied to hydrogen chains and transition metal atoms for ionization potentials.
Abstract
We present an implementation of phaseless Auxiliary-Field Quantum Monte Carlo (ph-AFQMC) utilizing graphical processing units (GPUs). The AFQMC method is recast in terms of matrix operations which are spread across thousands of processing cores and are executed in batches using custom Compute Unified Device Architecture kernels and the hardware-optimized cuBLAS matrix library. Algorithmic advances include a batched Sherman-Morrison-Woodbury algorithm to quickly update matrix determinants and inverses, density-fitting of the two-electron integrals, an energy algorithm involving a high-dimensional precomputed tensor, and the use of single-precision floating point arithmetic. These strategies result in dramatic reductions in wall-times for both single- and multi-determinant trial wavefunctions. For typical calculations we find speed-ups of roughly two orders of magnitude using just a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
