Your Classifier is Secretly an Energy Based Model and You Should Treat   it Like One

Will Grathwohl; Kuan-Chieh Wang; J\"orn-Henrik Jacobsen; David; Duvenaud; Mohammad Norouzi; Kevin Swersky

arXiv:1912.03263·cs.LG·September 16, 2020·86 cites

Your Classifier is Secretly an Energy Based Model and You Should Treat it Like One

Will Grathwohl, Kuan-Chieh Wang, J\"orn-Henrik Jacobsen, David, Duvenaud, Mohammad Norouzi, Kevin Swersky

PDF

Open Access 5 Repos

TL;DR

This paper reinterprets standard classifiers as energy-based models for joint distribution, enabling improved calibration, robustness, and generative capabilities while maintaining training efficiency.

Contribution

It introduces a novel framework that treats classifiers as energy-based models for joint distribution, unifying discriminative and generative learning.

Findings

01

Improves calibration, robustness, and out-of-distribution detection.

02

Generates high-quality samples comparable to GANs.

03

Achieves state-of-the-art performance in both discriminative and generative tasks.

Abstract

We propose to reinterpret a standard discriminative classifier of p(y|x) as an energy based model for the joint distribution p(x,y). In this setting, the standard class probabilities can be easily computed as well as unnormalized values of p(x) and p(x|y). Within this framework, standard discriminative architectures may beused and the model can also be trained on unlabeled data. We demonstrate that energy based training of the joint distribution improves calibration, robustness, andout-of-distribution detection while also enabling our models to generate samplesrivaling the quality of recent GAN approaches. We improve upon recently proposed techniques for scaling up the training of energy based models and presentan approach which adds little overhead compared to standard classification training. Our approach is the first to achieve performance rivaling the state-of-the-artin both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729