Learning Non-Convergent Non-Persistent Short-Run MCMC Toward Energy-Based Model
Erik Nijkamp, Mitch Hill, Song-Chun Zhu, Ying Nian Wu

TL;DR
This paper investigates the use of non-convergent, short-run MCMC in training energy-based models, demonstrating that it can generate realistic images and perform image reconstruction and interpolation, challenging traditional assumptions.
Contribution
It introduces a novel approach of using non-convergent short-run MCMC as a learned generator, showing its effectiveness in image synthesis and reconstruction.
Findings
Learned short-run MCMC can generate realistic images.
It can reconstruct and interpolate observed images.
The approach challenges traditional views on MCMC convergence in EBMs.
Abstract
This paper studies a curious phenomenon in learning energy-based model (EBM) using MCMC. In each learning iteration, we generate synthesized examples by running a non-convergent, non-mixing, and non-persistent short-run MCMC toward the current model, always starting from the same initial distribution such as uniform noise distribution, and always running a fixed number of MCMC steps. After generating synthesized examples, we then update the model parameters according to the maximum likelihood learning gradient, as if the synthesized examples are fair samples from the current model. We treat this non-convergent short-run MCMC as a learned generator model or a flow model. We provide arguments for treating the learned non-convergent short-run MCMC as a valid model. We show that the learned short-run MCMC is capable of generating realistic images. More interestingly, unlike traditional EBM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Neural Networks and Applications
Methodsenergy-based model
