Towards Bridging the Performance Gaps of Joint Energy-based Models
Xiulong Yang, Qing Su, Shihao Ji

TL;DR
This paper introduces SADA-JEM, a training method for Joint Energy-based Models that significantly improves classification accuracy and image generation quality, bridging existing performance gaps and enhancing robustness.
Contribution
The paper proposes a novel training approach using sharpness-aware minimization and data augmentation mitigation to enhance JEM performance across multiple metrics.
Findings
SADA-JEM outperforms previous JEM in classification accuracy.
SADA-JEM achieves superior image generation quality.
The method improves calibration, OOD detection, and adversarial robustness.
Abstract
Can we train a hybrid discriminative-generative model within a single network? This question has recently been answered in the affirmative, introducing the field of Joint Energy-based Model (JEM), which achieves high classification accuracy and image generation quality simultaneously. Despite recent advances, there remain two performance gaps: the accuracy gap to the standard softmax classifier, and the generation quality gap to state-of-the-art generative models. In this paper, we introduce a variety of training techniques to bridge the accuracy gap and the generation quality gap of JEM. 1) We incorporate a recently proposed sharpness-aware minimization (SAM) framework to train JEM, which promotes the energy landscape smoothness and the generalizability of JEM. 2) We exclude data augmentation from the maximum likelihood estimate pipeline of JEM, and mitigate the negative impact of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Cell Image Analysis Techniques
MethodsSoftmax · Sharpness-Aware Minimization
