Lower bounds for trace estimation via Block Krylov and other methods
Shi Jie Yu

TL;DR
This paper establishes theoretical lower bounds for trace estimation of matrix functions, analyzing the number of Krylov steps needed and connecting it to polynomial approximation limits, especially for inverse and Wishart matrices.
Contribution
It introduces fundamental lower bounds on the number of queries and Krylov steps required for trace estimation, linking these to polynomial approximation theory.
Findings
Derived upper bounds on Krylov steps for specific matrix functions.
Established lower bounds on the number of queries for trace estimation.
Clarified the relationship between Krylov methods and polynomial approximation degree.
Abstract
This paper studies theoretical lower bounds for estimating the trace of a matrix function, , focusing on methods that use Hutchinson's method along with Block Krylov techniques. These methods work by approximating matrix-vector products like using a Block Krylov subspace. This is closely related to approximating functions with polynomials. We derive theoretical upper bounds on how many Krylov steps are needed for functions such as and by analyzing the upper bounds from the polynomial approximation of their scalar equivalent. In addition, we also develop lower limits on the number of queries needed for trace estimation, specifically for where is a Wishart matrix. Our study clarifies the connection between the number of steps in Block Krylov methods and the degree of the polynomial used for approximation. This links the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Matrix Theory and Algorithms · Advanced Optimization Algorithms Research
