Preconditioning Hamiltonian Monte Carlo by minimizing Fisher Divergence
Adrian Seyboldt, Eliot L. Carlson, Bob Carpenter

TL;DR
This paper introduces a novel Fisher divergence minimization approach for preconditioning Hamiltonian Monte Carlo, significantly improving efficiency over traditional variance-based methods across various models.
Contribution
It proposes a new estimator based on minimizing Fisher divergence, with three forms, that outperforms existing variance-based preconditioning methods in HMC.
Findings
Diagonal Fisher divergence minimizer outperforms standard estimators by median factor of 1.3.
Low-rank plus diagonal Fisher divergence minimizer outperforms standard estimators by median factor of 4.
Method tested on 114 models from posteriordb showing consistent improvements.
Abstract
Although Hamiltonian Monte Carlo (HMC) scales as O(d^(1/4)) in dimension, there is a large constant factor determined by the curvature of the target density. This constant factor can be reduced in most cases through preconditioning, the state of the art for which uses diagonal or dense penalized maximum likelihood estimation of (co)variance based on a sample of warmup draws. These estimates converge slowly in the diagonal case and scale poorly when expanded to the dense case. We propose a more effective estimator based on minimizing the sample Fisher divergence from a linearly transformed density to a standard normal distribution. We present this estimator in three forms, (a) diagonal, (b) dense, and (c) low-rank plus diagonal. Using a collection of 114 models from posteriordb, we demonstrate that the diagonal minimizer of Fisher divergence outperforms the industry-standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference
