Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces
Tobias Schr\"oder, Zijing Ou, Yingzhen Li, Andrew B. Duncan

TL;DR
This paper introduces a novel method for training energy-based models on discrete and mixed data spaces using heat equations and diffusion processes, avoiding complex sampling methods.
Contribution
It proposes a new training approach with Energy Discrepancy that leverages data perturbations informed by graph structures, enabling efficient learning without MCMC.
Findings
Effective in estimating discrete densities with non-binary vocabularies
Successful binary image modeling using the proposed method
Applicable to tabular data for synthetic data generation and classification
Abstract
Energy-based models (EBMs) offer a flexible framework for probabilistic modelling across various data domains. However, training EBMs on data in discrete or mixed state spaces poses significant challenges due to the lack of robust and fast sampling methods. In this work, we propose to train discrete EBMs with Energy Discrepancy, a loss function which only requires the evaluation of the energy function at data points and their perturbed counterparts, thus eliminating the need for Markov chain Monte Carlo. We introduce perturbations of the data distribution by simulating a diffusion process on the discrete state space endowed with a graph structure. This allows us to inform the choice of perturbation from the structure of the modelled discrete variable, while the continuous time parameter enables fine-grained control of the perturbation. Empirically, we demonstrate the efficacy of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNeural Networks and Applications
MethodsDiffusion
