Automated Imbalanced Classification via Layered Learning
Vitor Cerqueira, Luis Torgo, Paula Branco, Colin Bellinger

TL;DR
This paper introduces ICLL, a layered learning approach for imbalanced binary classification that avoids resampling, automatically defines layers via hierarchical clustering, and outperforms existing methods on benchmark datasets.
Contribution
The paper proposes a novel layered learning method called ICLL that models data without resampling and automatically determines layers using hierarchical clustering.
Findings
ICLL outperforms several state-of-the-art methods on benchmark datasets.
The automatic layer definition reduces manual intervention and domain knowledge dependence.
Layered learning effectively handles class imbalance without resampling.
Abstract
In this paper we address imbalanced binary classification (IBC) tasks. Applying resampling strategies to balance the class distribution of training instances is a common approach to tackle these problems. Many state-of-the-art methods find instances of interest close to the decision boundary to drive the resampling process. However, under-sampling the majority class may potentially lead to important information loss. Over-sampling also may increase the chance of overfitting by propagating the information contained in instances from the minority class. The main contribution of our work is a new method called ICLL for tackling IBC tasks which is not based on resampling training observations. Instead, ICLL follows a layered learning paradigm to model the data in two stages. In the first layer, ICLL learns to distinguish cases close to the decision boundary from cases which are clearly from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques · Text and Document Classification Technologies · Artificial Intelligence in Healthcare
