Data Augmentation for Image Classification using Generative AI

Fazle Rahat; M Shifat Hossain; Md Rubel Ahmed; Sumit Kumar Jha; and; Rickard Ewetz

arXiv:2409.00547·cs.CV·September 4, 2024

Data Augmentation for Image Classification using Generative AI

Fazle Rahat, M Shifat Hossain, Md Rubel Ahmed, Sumit Kumar Jha, and, Rickard Ewetz

PDF

Open Access

TL;DR

This paper introduces AGA, a novel data augmentation framework leveraging generative AI to enhance image classification datasets by preserving foregrounds and diversifying backgrounds, leading to significant accuracy improvements.

Contribution

The paper presents a new augmentation method combining LLMs, diffusion, and segmentation models to improve dataset diversity while maintaining subject authenticity.

Findings

01

15.6% accuracy increase on in-distribution data

02

23.5% accuracy increase on out-of-distribution data

03

64.3% SIC score improvement over baselines

Abstract

Scaling laws dictate that the performance of AI models is proportional to the amount of available data. Data augmentation is a promising solution to expanding the dataset size. Traditional approaches focused on augmentation using rotation, translation, and resizing. Recent approaches use generative AI models to improve dataset diversity. However, the generative methods struggle with issues such as subject corruption and the introduction of irrelevant artifacts. In this paper, we propose the Automated Generative Data Augmentation (AGA). The framework combines the utility of large language models (LLMs), diffusion models, and segmentation models to augment data. AGA preserves foreground authenticity while ensuring background diversity. Specific contributions include: i) segment and superclass based object extraction, ii) prompt diversity with combinatorial complexity using prompt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDiffusion