DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models
Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik, Nandakumar

TL;DR
DiffuseMix introduces a diffusion model-based data augmentation method that preserves labels and enhances robustness, outperforming existing techniques across multiple datasets and tasks.
Contribution
The paper presents a novel diffusion model-based augmentation technique that maintains label integrity and improves model robustness, addressing limitations of prior image-mixing methods.
Findings
Outperforms state-of-the-art methods on seven datasets.
Enhances adversarial robustness and generalization.
Effective across classification, fine-grained tasks, and data scarcity scenarios.
Abstract
Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an augmented image. Such methods may not only omit important portions of the input images but also introduce label ambiguities by mixing images across labels resulting in misleading supervisory signals. To address these limitations, we propose DiffuseMix, a novel data augmentation technique that leverages a diffusion model to reshape training images, supervised by our bespoke conditional prompts. First, concatenation of a partial natural image and its generated counterpart is obtained which helps in avoiding the generation of unrealistic images or label ambiguities. Then, to enhance resilience against adversarial attacks and improves safety measures, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Natural Language Processing Techniques · Recommender Systems and Techniques
MethodsSparse Evolutionary Training · Diffusion
