Learning Energy-Based Prior Model with Diffusion-Amortized MCMC
Peiyu Yu, Yaxuan Zhu, Sirui Xie, Xiaojian Ma, Ruiqi Gao, Song-Chun, Zhu, Ying Nian Wu

TL;DR
This paper introduces a diffusion-based amortization technique to improve long-run MCMC sampling in latent space Energy-Based Models, enhancing generation quality and training stability in high-dimensional, multi-modal distributions.
Contribution
It proposes a novel diffusion-amortized MCMC method for latent space EBMs, addressing sampling quality issues and enabling stable, high-quality generative modeling.
Findings
Superior performance on image benchmarks
Effective long-run MCMC sampling demonstrated
Improved training stability and generation quality
Abstract
Latent space Energy-Based Models (EBMs), also known as energy-based priors, have drawn growing interests in the field of generative modeling due to its flexibility in the formulation and strong modeling power of the latent space. However, the common practice of learning latent space EBMs with non-convergent short-run MCMC for prior and posterior sampling is hindering the model from further progress; the degenerate MCMC sampling quality in practice often leads to degraded generation quality and instability in training, especially with highly multi-modal and/or high-dimensional target distributions. To remedy this sampling issue, in this paper we introduce a simple but effective diffusion-based amortization method for long-run MCMC sampling and develop a novel learning algorithm for the latent space EBM based on it. We provide theoretical evidence that the learned amortization of MCMC is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare
Methodsenergy-based model
