An Experimental Study on the Rashomon Effect of Balancing Methods in Imbalanced Classification

Mustafa Cavus; Przemys{\l}aw Biecek

arXiv:2405.01557·cs.LG·May 18, 2026

An Experimental Study on the Rashomon Effect of Balancing Methods in Imbalanced Classification

Mustafa Cavus, Przemys{\l}aw Biecek

PDF

1 Repo

TL;DR

This study investigates how data balancing methods influence the Rashomon effect in imbalanced classification, revealing that such methods increase predictive multiplicity and affect model selection reliability.

Contribution

It introduces a new metric, obscurity, to measure predictive multiplicity and proposes an extended performance-gain plot for responsible model evaluation.

Findings

01

Balancing methods increase predictive multiplicity in models.

02

Different balancing techniques yield varying model predictions.

03

The extended performance-gain plot helps monitor the trade-off between performance and multiplicity.

Abstract

Predictive models may generate biased predictions when classifying imbalanced datasets. This happens when the model favors the majority class, leading to low performance in accurately predicting the minority class. To address this issue, balancing or resampling methods are critical data-centric AI approaches in the modeling process to improve prediction performance. However, there have been debates and questions about the functionality of these methods in recent years. In particular, many candidate models may exhibit very similar predictive performance, called the Rashomon effect, in model selection, and they may even produce different predictions for the same observations. Selecting one of these models without considering the predictive multiplicity -- which is the case of yielding conflicting models' predictions for any sample -- can result in blind selection. In this paper, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mcavs/ECML2024_Imbalanced_Rashomon_Paper
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques

MethodsSparse Evolutionary Training