Exploring bidirectional bounds for minimax-training of Energy-based models

Cong Geng; Jia Wang; Li Chen; Zhiyong Gao; Jes Frellsen; and S{\o}ren Hauberg

arXiv:2506.04609·cs.LG·June 6, 2025

Exploring bidirectional bounds for minimax-training of Energy-based models

Cong Geng, Jia Wang, Li Chen, Zhiyong Gao, Jes Frellsen, and S{\o}ren Hauberg

PDF

Open Access

TL;DR

This paper introduces a novel approach to training Energy-based models using bidirectional bounds, which improves stability and quality in density estimation and sample generation.

Contribution

It proposes the use of bidirectional bounds for EBM training, including new bounds based on Jacobian singular values, mutual information, and diffusion processes.

Findings

01

Bidirectional bounds stabilize EBM training.

02

Enhanced density estimation quality.

03

Improved sample generation results.

Abstract

Energy-based models (EBMs) estimate unnormalized densities in an elegant framework, but they are generally difficult to train. Recent work has linked EBMs to generative adversarial networks, by noting that they can be trained through a minimax game using a variational lower bound. To avoid the instabilities caused by minimizing a lower bound, we propose to instead work with bidirectional bounds, meaning that we maximize a lower bound and minimize an upper bound when training the EBM. We investigate four different bounds on the log-likelihood derived from different perspectives. We derive lower bounds based on the singular values of the generator Jacobian and on mutual information. To upper bound the negative log-likelihood, we consider a gradient penalty-like bound, as well as one based on diffusion processes. In all cases, we provide algorithms for evaluating the bounds. We compare the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques

MethodsDiffusion · energy-based model