cuGUGA: Operator-Direct Graphical Unitary Group Approach Accelerated with CUDA
Zihan Pengmei

TL;DR
cuGUGA is a GPU-accelerated solver for configuration interaction calculations that achieves significant speedups over CPU implementations by leveraging CUDA kernels and optimized data structures, enabling efficient quantum chemistry computations.
Contribution
The paper introduces cuGUGA, a novel GPU-accelerated GUGA CI solver with constant-time CSF ranking and evaluation, improving computational efficiency in quantum chemistry.
Findings
Reproduces reference energies at 10^{-11} Eh accuracy.
Achieves up to 10x speedup on RTX 4090 for small active spaces.
CPU backend outperforms PySCF determinant and CSF backends.
Abstract
We present cuGUGA, an operator-direct graphical unitary group approach (GUGA) configuration interaction (CI) solver in a spin-adapted configuration state function (CSF) basis. Dynamic-programming walk counts provide constant-time CSF ranking/unranking, and pretabulated segment factors enable constant-time evaluation of coupling coefficients. Two-electron contributions are organized through an intermediate-weight formulation that separates sparse generator enumeration from integral contraction and supports both dense and density-fitted/Cholesky backends. We further map the same primitives to GPUs by implementing the irregular DRT traversal and accumulation in custom CUDA kernels while delegating contractions to CUDA libraries. The implementation reproduces reference energies at the 10^{-11} Eh level and matches CPU/GPU sigma-vectors to 10^{-14}. On an RTX 4090, the GPU backend provides…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Machine Learning in Materials Science · Protein Structure and Dynamics
