Imbalanced Classification via a Tabular Translation GAN

Jonathan Gradstein; Moshe Salhov; Yoav Tulpan; Ofir Lindenbaum; Amir; Averbuch

arXiv:2204.08683·cs.LG·April 20, 2022·1 cites

Imbalanced Classification via a Tabular Translation GAN

Jonathan Gradstein, Moshe Salhov, Yoav Tulpan, Ofir Lindenbaum, Amir, Averbuch

PDF

Open Access

TL;DR

This paper introduces a GAN-based method for imbalanced binary classification on tabular data, generating synthetic minority samples to improve classifier performance, especially in severe class imbalance scenarios.

Contribution

It proposes a novel translation GAN with regularization and sample selection to enhance minority class modeling in imbalanced tabular datasets.

Findings

01

Improves average precision over re-weighting and oversampling methods.

02

Effective in severe class imbalance scenarios.

03

Enhances minority class representation with synthetic samples.

Abstract

When presented with a binary classification problem where the data exhibits severe class imbalance, most standard predictive methods may fail to accurately model the minority class. We present a model based on Generative Adversarial Networks which uses additional regularization losses to map majority samples to corresponding synthetic minority samples. This translation mechanism encourages the synthesized samples to be close to the class boundary. Furthermore, we explore a selection criterion to retain the most useful of the synthesized samples. Experimental results using several downstream classifiers on a variety of tabular class-imbalanced datasets show that the proposed method improves average precision when compared to alternative re-weighting and oversampling techniques.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Anomaly Detection Techniques and Applications