TL;DR
This study investigates how data balancing methods influence the Rashomon effect in imbalanced classification, revealing that such methods increase predictive multiplicity and affect model selection reliability.
Contribution
It introduces a new metric, obscurity, to measure predictive multiplicity and proposes an extended performance-gain plot for responsible model evaluation.
Findings
Balancing methods increase predictive multiplicity in models.
Different balancing techniques yield varying model predictions.
The extended performance-gain plot helps monitor the trade-off between performance and multiplicity.
Abstract
Predictive models may generate biased predictions when classifying imbalanced datasets. This happens when the model favors the majority class, leading to low performance in accurately predicting the minority class. To address this issue, balancing or resampling methods are critical data-centric AI approaches in the modeling process to improve prediction performance. However, there have been debates and questions about the functionality of these methods in recent years. In particular, many candidate models may exhibit very similar predictive performance, called the Rashomon effect, in model selection, and they may even produce different predictions for the same observations. Selecting one of these models without considering the predictive multiplicity -- which is the case of yielding conflicting models' predictions for any sample -- can result in blind selection. In this paper, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
MethodsSparse Evolutionary Training
