Convergence of Contrastive Divergence with Annealed Learning Rate in Exponential Family
Bai Jiang, Tung-yu Wu, Wing H. Wong

TL;DR
This paper proves the consistency and convergence rate of contrastive divergence with annealed learning rate in exponential family models, extending previous fixed-rate results and providing theoretical guarantees.
Contribution
It establishes the convergence properties of CD with annealed learning rate, including the rate of convergence and the impact of MCMC steps, using a novel super-martingale approach.
Findings
Convergence rate of 1/n^{1/3} for parameter estimates.
Number of MCMC steps affects convergence coefficient.
Experimental validation on a Boltzmann Machine.
Abstract
In our recent paper, we showed that in exponential family, contrastive divergence (CD) with fixed learning rate will give asymptotically consistent estimates \cite{wu2016convergence}. In this paper, we establish consistency and convergence rate of CD with annealed learning rate . Specifically, suppose CD- generates the sequence of parameters using an i.i.d. data sample of size , then converges in probability to 0 at a rate of . The number () of MCMC transitions in CD only affects the coefficient factor of convergence rate. Our proof is not a simple extension of the one in \cite{wu2016convergence}. which depends critically on the fact that is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference
