Multi-granularity Relabeled Under-sampling Algorithm for Imbalanced Data
Qi Dai, Jian-wei Liu, Yang Liu

TL;DR
This paper introduces a multi-granularity relabeled under-sampling algorithm (MGRU) that improves the handling of imbalanced data by considering local data structures, leading to better classification accuracy and generalization.
Contribution
The paper proposes a novel MGRU algorithm that enhances Tomek-Link based under-sampling by incorporating local data information for improved imbalanced classification.
Findings
MGRU outperforms baseline algorithms in classification accuracy.
Optimal global relabeled index enhances under-sampling effectiveness.
Local information consideration improves detection of overlapping instances.
Abstract
The imbalanced classification problem turns out to be one of the important and challenging problems in data mining and machine learning. The performances of traditional classifiers will be severely affected by many data problems, such as class imbalanced problem, class overlap and noise. The Tomek-Link algorithm was only used to clean data when it was proposed. In recent years, there have been reports of combining Tomek-Link algorithm with sampling technique. The Tomek-Link sampling algorithm can effectively reduce the class overlap on data, remove the majority instances that are difficult to distinguish, and improve the algorithm classification accuracy. However, the Tomek-Links under-sampling algorithm only considers the boundary instances that are the nearest neighbors to each other globally and ignores the potential local overlapping instances. When the number of minority instances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Text and Document Classification Technologies · Artificial Intelligence in Healthcare
