Training Discrete Energy-Based Models with Energy Discrepancy
Tobias Schr\"oder, Zijing Ou, Yingzhen Li, Andrew B. Duncan

TL;DR
This paper introduces a new contrastive loss called energy discrepancy for training discrete energy-based models, eliminating the need for sampling methods like MCMC and providing theoretical guarantees across various perturbation processes.
Contribution
The paper proposes energy discrepancy as a novel training method for discrete EBMs that avoids sampling and offers theoretical guarantees for different perturbation types.
Findings
Energy discrepancy performs well on lattice Ising models.
It effectively trains models on binary synthetic data.
It shows promising results on discrete image datasets.
Abstract
Training energy-based models (EBMs) on discrete spaces is challenging because sampling over such spaces can be difficult. We propose to train discrete EBMs with energy discrepancy (ED), a novel type of contrastive loss functional which only requires the evaluation of the energy function at data points and their perturbed counter parts, thus not relying on sampling strategies like Markov chain Monte Carlo (MCMC). Energy discrepancy offers theoretical guarantees for a broad class of perturbation processes of which we investigate three types: perturbations based on Bernoulli noise, based on deterministic transforms, and based on neighbourhood structures. We demonstrate their relative performance on lattice Ising models, binary synthetic data, and discrete image data sets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Electron and X-Ray Spectroscopy Techniques
