Expanding the Role of Diffusion Models for Robust Classifier Training

Pin-Han Huang; Shang-Tse Chen; Hsuan-Tien Lin

arXiv:2602.19931·cs.LG·February 24, 2026

Expanding the Role of Diffusion Models for Robust Classifier Training

Pin-Han Huang, Shang-Tse Chen, Hsuan-Tien Lin

PDF

Open Access

TL;DR

This paper explores how diffusion models' internal representations can enhance adversarial training for robust image classifiers, showing that combining these representations with synthetic data improves robustness and feature disentanglement across multiple datasets.

Contribution

It introduces the novel idea of using diffusion model representations as auxiliary signals in adversarial training, beyond synthetic data generation.

Findings

01

Diffusion representations are diverse and partially robust.

02

Incorporating diffusion representations improves adversarial robustness.

03

Diffusion models encourage more disentangled features.

Abstract

Incorporating diffusion-generated synthetic data into adversarial training (AT) has been shown to substantially improve the training of robust image classifiers. In this work, we extend the role of diffusion models beyond merely generating synthetic data, examining whether their internal representations, which encode meaningful features of the data, can provide additional benefits for robust classifier training. Through systematic experiments, we show that diffusion models offer representations that are both diverse and partially robust, and that explicitly incorporating diffusion representations as an auxiliary learning signal during AT consistently improves robustness across settings. Furthermore, our representation analysis indicates that incorporating diffusion models into AT encourages more disentangled features, while diffusion representations and diffusion-generated synthetic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning