GenMix: Effective Data Augmentation with Generative Diffusion Model Image Editing
Khawar Islam, Muhammad Zaigham Zaheer, Arif Mahmood, Karthik Nandakumar, Naveed Akhtar

TL;DR
GenMix introduces a prompt-guided generative data augmentation method using image editing and fractal patterns to improve visual classification performance across various domains and tasks.
Contribution
This paper presents GenMix, a novel data augmentation approach that enhances in-domain and cross-domain classification through prompt-guided image editing and blending techniques.
Findings
Improves classification accuracy on eight public datasets.
Enhances adversarial robustness and performance in data-scarce scenarios.
Outperforms existing state-of-the-art augmentation methods.
Abstract
Data augmentation is widely used to enhance generalization in visual classification tasks. However, traditional methods struggle when source and target domains differ, as in domain adaptation, due to their inability to address domain gaps. This paper introduces GenMix, a generalizable prompt-guided generative data augmentation approach that enhances both in-domain and cross-domain image classification. Our technique leverages image editing to generate augmented images based on custom conditional prompts, designed specifically for each problem type. By blending portions of the input image with its edited generative counterpart and incorporating fractal patterns, our approach mitigates unrealistic images and label ambiguity, improving the performance and adversarial robustness of the resulting models. Efficacy of our method is established with extensive experiments on eight public…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Medical Image Segmentation Techniques · Computer Graphics and Visualization Techniques
