Designing quantum chemistry algorithms with just-in-time compilation
Xiaojie Wu, Qiming Sun, Yuanheng Wang

TL;DR
This paper introduces just-in-time compilation for Gaussian-type orbital integral kernels, significantly improving the efficiency of quantum chemistry calculations on GPUs with notable speedups and a compact implementation.
Contribution
It presents a novel JIT compilation approach for integral kernels in quantum chemistry, enabling faster computations and a compact CUDA implementation with support for single-precision arithmetic.
Findings
2x speedup for small basis sets on GPU
Up to 4x efficiency improvement for large basis sets
3x speedup with single-precision implementation
Abstract
We introduce just-in-time (JIT) compilation to the integral kernels for Gaussian-type orbitals (GTOs) to enhance the efficiency of electron repulsion integral computations. For Coulomb and exchange (JK) matrices, JIT-based algorithms yield a 2x speedup for the small 6-31G* basis set over GPU4PySCF v1.4 on an NVIDIA A100-80G GPU. By incorporating a novel algorithm designed for orbitals with high angular momentum, the efficiency of JK evaluations with the large def2-TZVPP basis set is improved by up to 4x. The core CUDA implementation is compact, comprising only ~1,000 lines of code, including support for single-precision arithmetic. Furthermore, the single-precision implementation achieves a 3x speedup over the previous state-of-the-art.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
