Rebalancing the Scales: A Systematic Mapping Study of Generative Adversarial Networks (GANs) in Addressing Data Imbalance
Pankaj Yadav, Gulshan Sihag, Vivek Vijay

TL;DR
This systematic mapping study analyzes 100 key papers on GAN-based techniques for addressing data imbalance across various domains, highlighting advancements, popular architectures, and future research directions.
Contribution
It provides a comprehensive categorization of GAN applications, techniques, and variants used for imbalanced data, revealing trends and gaps in current research.
Findings
GAN-based over-sampling is effective for data imbalance.
Advanced GAN architectures improve synthetic data quality.
Interest in GANs for imbalanced data has surged recently.
Abstract
Machine learning algorithms are used in diverse domains, many of which face significant challenges due to data imbalance. Studies have explored various approaches to address the issue, like data preprocessing, cost-sensitive learning, and ensemble methods. Generative Adversarial Networks (GANs) showed immense potential as a data preprocessing technique that generates good quality synthetic data. This study employs a systematic mapping methodology to analyze 3041 papers on GAN-based sampling techniques for imbalanced data sourced from four digital libraries. A filtering process identified 100 key studies spanning domains such as healthcare, finance, and cybersecurity. Through comprehensive quantitative analysis, this research introduces three categorization mappings as application domains, GAN techniques, and GAN variants used to handle the imbalanced nature of the data. GAN-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Anomaly Detection Techniques and Applications · Generative Adversarial Networks and Image Synthesis
MethodsDiffusion
