Potential Anchoring for imbalanced data classification

Micha{\l} Koziarski

arXiv:2104.08548·cs.LG·April 20, 2021

Potential Anchoring for imbalanced data classification

Micha{\l} Koziarski

PDF

1 Repo

TL;DR

This paper introduces Potential Anchoring, a novel resampling framework using radial basis functions to improve classification on imbalanced datasets by preserving class distribution shapes and optimizing synthetic data placement.

Contribution

It presents a unified over- and undersampling method that outperforms existing algorithms, especially on naturally complex, noise-free datasets.

Findings

01

Outperforms state-of-the-art resampling algorithms on 60 datasets

02

Effectively preserves class distribution shapes during resampling

03

Excels on naturally complex, noise-free datasets

Abstract

Data imbalance remains one of the factors negatively affecting the performance of contemporary machine learning algorithms. One of the most common approaches to reducing the negative impact of data imbalance is preprocessing the original dataset with data-level strategies. In this paper we propose a unified framework for imbalanced data over- and undersampling. The proposed approach utilizes radial basis functions to preserve the original shape of the underlying class distributions during the resampling process. This is done by optimizing the positions of generated synthetic observations with respect to the potential resemblance loss. The final Potential Anchoring algorithm combines over- and undersampling within the proposed framework. The results of the experiments conducted on 60 imbalanced datasets show outperformance of Potential Anchoring over state-of-the-art resampling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

michalkoziarski/PotentialAnchoring
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.