Learning Latent Space Hierarchical EBM Diffusion Models

Jiali Cui; Tian Han

arXiv:2405.13910·cs.LG·May 29, 2024

Learning Latent Space Hierarchical EBM Diffusion Models

Jiali Cui, Tian Han

PDF

Open Access

TL;DR

This paper introduces a novel approach combining diffusion probabilistic schemes with energy-based models to improve learning and sampling in hierarchical latent space models, addressing prior limitations and enhancing performance.

Contribution

It proposes using diffusion schemes to effectively learn and sample from multi-modal EBM priors in hierarchical latent space models, overcoming prior sampling challenges.

Findings

01

Superior performance on challenging tasks

02

Effective mitigation of EBM sampling issues

03

Enhanced expressivity of hierarchical models

Abstract

This work studies the learning problem of the energy-based prior model and the multi-layer generator model. The multi-layer generator model, which contains multiple layers of latent variables organized in a top-down hierarchical structure, typically assumes the Gaussian prior model. Such a prior model can be limited in modelling expressivity, which results in a gap between the generator posterior and the prior model, known as the prior hole problem. Recent works have explored learning the energy-based (EBM) prior model as a second-stage, complementary model to bridge the gap. However, the EBM defined on a multi-layer latent space can be highly multi-modal, which makes sampling from such marginal EBM prior challenging in practice, resulting in ineffectively learned EBM. To tackle the challenge, we propose to leverage the diffusion probabilistic scheme to mitigate the burden of EBM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques

MethodsDiffusion · energy-based model