A Non-asymptotic Analysis for Learning and Applying a Preconditioner in MCMC
Max Hird, Florian Maire, Jeffrey Negrea

TL;DR
This paper provides a non-asymptotic analysis of learning and applying preconditioners in MCMC algorithms, demonstrating efficiency improvements and theoretical guarantees for preconditioned Langevin methods.
Contribution
It introduces non-asymptotic bounds for preconditioned MCMC schemes that learn their preconditioners, bridging modern theory with classical heuristics.
Findings
Preconditioned ULA with learned preconditioner has non-asymptotic guarantees.
Learning preconditioners reduces finite-time computational costs.
Results extend to underdamped Langevin algorithms with contraction conditions.
Abstract
Preconditioning is a common method applied to modify Markov chain Monte Carlo algorithms with the goal of making them more efficient. In practice it is often extremely effective, even when the preconditioner is learned from the chain. We analyse and compare the finite-time computational costs of schemes which learn a preconditioner based on the target covariance or the expected Hessian of the target potential with that of a corresponding scheme that does not use preconditioning. We apply our results to the Unadjusted Langevin Algorithm (ULA) for an appropriately regular target, establishing non-asymptotic guarantees for preconditioned ULA which learns its preconditioner. Our results are also applied to the unadjusted underdamped Langevin algorithm in the supplementary material. To do so, we establish non-asymptotic guarantees on the time taken to collect approximately independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference
