A Near-Optimal Gradient Flow for Learning Neural Energy-Based Models

Yang Wu; Pengxu Wei; Liang Lin

arXiv:1910.14216·cs.CV·May 1, 2023

A Near-Optimal Gradient Flow for Learning Neural Energy-Based Models

Yang Wu, Pengxu Wei, Liang Lin

PDF

Open Access

TL;DR

This paper introduces a new Wasserstein gradient flow-based numerical scheme for training neural energy-based models, improving the approximation of data distributions and generating high-quality high-dimensional data.

Contribution

It proposes a second-order Wasserstein gradient flow approach for EBMs, addressing limitations of previous methods and enhancing distribution fitting and data generation.

Findings

01

Outperforms existing schemes in fitting complex distributions

02

Produces higher quality high-dimensional data

03

Demonstrates practical superiority through extensive experiments

Abstract

In this paper, we propose a novel numerical scheme to optimize the gradient flows for learning energy-based models (EBMs). From a perspective of physical simulation, we redefine the problem of approximating the gradient flow utilizing optimal transport (i.e. Wasserstein) metric. In EBMs, the learning process of stepwise sampling and estimating data distribution performs the functional gradient of minimizing the global relative entropy between the current and target real distribution, which can be treated as dynamic particles moving from disorder to target manifold. Previous learning schemes mainly minimize the entropy concerning the consecutive time KL divergence in each learning step. However, they are prone to being stuck in the local KL divergence by projecting non-smooth information within smooth manifold, which is against the optimal transport principle. To solve this problem, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications

MethodsGraph Convolutional Network