Deflation as a Method of Variance Reduction for Estimating the Trace of a Matrix Inverse
Arjun Singh Gambhir, Andreas Stathopoulos, Kostas Orginos

TL;DR
This paper investigates how deflation affects variance reduction in trace estimation of matrix inverses, revealing that deflation can increase variance for Hermitian matrices but not for non-Hermitian ones, and demonstrates significant variance reduction in Lattice QCD applications.
Contribution
The paper provides a theoretical analysis of deflation's impact on variance in trace estimation and combines deflation with Hierarchical Probing for large-scale Lattice QCD computations.
Findings
Deflation may increase variance for Hermitian matrices but not for non-Hermitian matrices.
Combining deflation with Hierarchical Probing reduces variance by over 60 times in Lattice QCD.
Precomputing smallest singular values enables efficient large-scale eigenvalue computations.
Abstract
Many fields require computing the trace of the inverse of a large, sparse matrix. The typical method used for such computations is the Hutchinson method which is a Monte Carlo (MC) averaging over matrix quadratures. To improve its convergence, several variance reductions techniques have been proposed. In this paper, we study the effects of deflating the near null singular value space. We make two main contributions. First, we analyze the variance of the Hutchinson method as a function of the deflated singular values and vectors. Although this provides good intuition in general, by assuming additionally that the singular vectors are random unitary matrices, we arrive at concise formulas for the deflated variance that include only the variance and mean of the singular values. We make the remarkable observation that deflation may increase variance for Hermitian matrices but not for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
