Efficient Mixed-Precision Matrix Factorization of the Inverse Overlap Matrix in Electronic Structure Calculations with AI-Hardware and GPUs
Adela Habib, Joshua Finkelstein, Anders M. N. Niklasson

TL;DR
This paper introduces a mixed-precision iterative refinement algorithm leveraging Nvidia Tensor cores to efficiently compute the inverse overlap matrix in electronic structure calculations, significantly improving performance on AI hardware.
Contribution
It develops a novel mixed-precision approach using Tensor cores for matrix factorization of the inverse overlap matrix, enhancing computational efficiency in electronic structure theory.
Findings
Tensor core-based implementation outperforms GPU-only single/double precision methods
The algorithm maintains accuracy with a robust non-parametric stopping criterion
Significant speedup in matrix factorization for electronic structure calculations
Abstract
In recent years, a new kind of accelerated hardware has gained popularity in the Artificial Intelligence (AI) and Machine Learning (ML) communities which enables extremely high-performance tensor contractions in reduced precision for deep neural network calculations. In this article, we exploit Nvidia Tensor cores, a prototypical example of such AI/ML hardware, to develop a mixed precision approach for computing a dense matrix factorization of the inverse overlap matrix in electronic structure theory, . This factorization of , written as , is used to transform the general matrix eigenvalue problem into a standard matrix eigenvalue problem. Here we present a mixed precision iterative refinement algorithm where is given recursively using matrix-matrix multiplications and can be computed with high performance on Tensor cores. To understand the performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Advanced Measurement and Metrology Techniques · Manufacturing Process and Optimization
