BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions
Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

TL;DR
BSGAN introduces a hybrid oversampling method combining borderline SMOTE and GANs to generate diverse, Gaussian-distributed data, effectively addressing class imbalance in pattern recognition tasks.
Contribution
This work presents BSGAN, a novel oversampling technique that improves diversity and distribution of synthetic samples beyond existing borderline SMOTE methods.
Findings
BSGAN outperforms existing oversampling techniques on multiple datasets.
Generated data from BSGAN follow Gaussian distributions.
BSGAN enhances data diversity after oversampling.
Abstract
Class imbalanced problems (CIP) are one of the potential challenges in developing unbiased Machine Learning (ML) models for predictions. CIP occurs when data samples are not equally distributed between the two or multiple classes. Borderline-Synthetic Minority Oversampling Techniques (SMOTE) is one of the approaches that has been used to balance the imbalance data by oversampling the minor (limited) samples. One of the potential drawbacks of existing Borderline-SMOTE is that it focuses on the data samples that lay at the border point and gives more attention to the extreme observations, ultimately limiting the creation of more diverse data after oversampling, and that is the almost scenario for the most of the borderline-SMOTE based oversampling strategies. As an effect, marginalization occurs after oversampling. To address these issues, in this work, we propose a hybrid oversampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Spectroscopy and Chemometric Analyses
MethodsSynthetic Minority Over-sampling Technique.
