Learning Confidence Bounds for Classification with Imbalanced Data

Matt Clifford; Jonathan Erskine; Alexander Hepburn; Ra\'ul Santos-Rodr\'iguez; Dario Garcia-Garcia

arXiv:2407.11878·cs.LG·October 22, 2025

Learning Confidence Bounds for Classification with Imbalanced Data

Matt Clifford, Jonathan Erskine, Alexander Hepburn, Ra\'ul Santos-Rodr\'iguez, Dario Garcia-Garcia

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new framework that uses learning theory and confidence bounds to improve classification accuracy and reliability on imbalanced datasets, addressing limitations of traditional sampling methods.

Contribution

It presents a novel approach that incorporates class-dependent confidence bounds into the learning process to better handle class imbalance.

Findings

01

Framework effectively adapts to class imbalance

02

Improves robustness and reliability of classification

03

Offers a promising direction for future research

Abstract

Class imbalance poses a significant challenge in classification tasks, where traditional approaches often lead to biased models and unreliable predictions. Undersampling and oversampling techniques have been commonly employed to address this issue, yet they suffer from inherent limitations stemming from their simplistic approach such as loss of information and additional biases respectively. In this paper, we propose a novel framework that leverages learning theory and concentration inequalities to overcome the shortcomings of traditional solutions. We focus on understanding the uncertainty in a class-dependent manner, as captured by confidence bounds that we directly embed into the learning process. By incorporating class-dependent estimates, our method can effectively adapt to the varying degrees of imbalance across different classes, resulting in more robust and reliable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mattclifford1/confidence-bounds-ecai24
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Artificial Intelligence in Healthcare

MethodsFocus