Improving the stability of the covariance-controlled adaptive Langevin thermostat for large-scale Bayesian sampling
Jiani Wei, Xiaocheng Shang

TL;DR
This paper introduces a modified CCAdL thermostat with enhanced numerical stability for large-scale Bayesian sampling, improving accuracy and stability over existing methods.
Contribution
It proposes a new mCCAdL thermostat using advanced numerical methods and symmetric splitting, significantly enhancing stability and accuracy in Bayesian sampling.
Findings
mCCAdL outperforms original CCAdL in stability and accuracy
The new method allows for larger stepsizes in numerical integration
Experimental results show superior performance in large-scale applications
Abstract
Stochastic gradient Langevin dynamics and its variants approximate the likelihood of an entire dataset, via random (and typically much smaller) subsets, in the setting of Bayesian sampling. Due to the (often substantial) improvement of the computational efficiency, they have been widely used in large-scale machine learning applications. It has been demonstrated that the so-called covariance-controlled adaptive Langevin (CCAdL) thermostat, which incorporates an additional term involving the covariance matrix of the noisy force, outperforms popular alternative methods. A moving average is used in CCAdL to estimate the covariance matrix of the noisy force, in which case the covariance matrix will converge to a constant matrix in long-time limit. Moreover, it appears in our numerical experiments that the use of a moving average could reduce the stability of the numerical integrators,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
