SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation

Yixuan Dong; Fang-Yi Su; Jung-Hsien Chiang

arXiv:2505.11813·cs.CV·May 20, 2025

SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation

Yixuan Dong, Fang-Yi Su, Jung-Hsien Chiang

PDF

Open Access

TL;DR

This paper introduces SGD-Mix, a novel data augmentation framework for domain-specific image classification that combines saliency-guided mixing and diffusion models to improve diversity, faithfulness, and label clarity, leading to better downstream performance.

Contribution

The paper presents a new augmentation method integrating saliency-guided mixing with diffusion models to address key challenges in domain-specific image classification.

Findings

01

Outperforms state-of-the-art augmentation methods across various tasks.

02

Enhances foreground semantics and background diversity effectively.

03

Improves robustness in fine-grained, long-tail, and few-shot scenarios.

Abstract

Data augmentation for domain-specific image classification tasks often struggles to simultaneously address diversity, faithfulness, and label clarity of generated data, leading to suboptimal performance in downstream tasks. While existing generative diffusion model-based methods aim to enhance augmentation, they fail to cohesively tackle these three critical aspects and often overlook intrinsic challenges of diffusion models, such as sensitivity to model characteristics and stochasticity under strong transformations. In this paper, we propose a novel framework that explicitly integrates diversity, faithfulness, and label clarity into the augmentation process. Our approach employs saliency-guided mixing and a fine-tuned diffusion model to preserve foreground semantics, enrich background diversity, and ensure label consistency, while mitigating diffusion model limitations. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis

MethodsDiffusion