Entropic Determinants
Diego Granziol, Stephen Roberts

TL;DR
This paper demonstrates that Maximum Entropy methods provide an optimal approach for approximating computationally expensive log determinant calculations in large-scale machine learning, supported by theoretical proofs and empirical validation.
Contribution
It establishes the theoretical optimality of Maximum Entropy methods for log determinant approximation and links moment information to eigenvalue distributions in large data limits.
Findings
Maximum Entropy methods are optimal for log determinant approximation.
Eigenvalue distributions can be fully characterized by moments.
Adding more moments reduces divergence between proposal and true eigenvalue distributions.
Abstract
The ability of many powerful machine learning algorithms to deal with large data sets without compromise is often hampered by computationally expensive linear algebra tasks, of which calculating the log determinant is a canonical example. In this paper we demonstrate the optimality of Maximum Entropy methods in approximating such calculations. We prove the equivalence between mean value constraints and sample expectations in the big data limit, that Covariance matrix eigenvalue distributions can be completely defined by moment information and that the reduction of the self entropy of a maximum entropy proposal distribution, achieved by adding more moments reduces the KL divergence between the proposal and true eigenvalue distribution. We empirically verify our results on a variety of SparseSuite matrices and establish best practices.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Gaussian Processes and Bayesian Inference · Advanced Thermodynamics and Statistical Mechanics
