Imbalanced Classification through the Lens of Spurious Correlations

Jakob Hackstein; Sidney Bender

arXiv:2510.27650·cs.LG·November 3, 2025

Imbalanced Classification through the Lens of Spurious Correlations

Jakob Hackstein, Sidney Bender

PDF

Open Access

TL;DR

This paper introduces a novel approach using Explainable AI to identify and eliminate spurious correlations caused by class imbalance, improving classification reliability and providing new insights into imbalance effects.

Contribution

It presents a counterfactual explanations-based method to detect and mitigate Clever Hans effects in imbalanced datasets, a perspective not addressed by previous techniques.

Findings

01

Achieves competitive classification performance on three datasets.

02

Demonstrates emergence of Clever Hans effects under class imbalance.

03

Provides a new perspective on imbalance-related spurious correlations.

Abstract

Class imbalance poses a fundamental challenge in machine learning, frequently leading to unreliable classification performance. While prior methods focus on data- or loss-reweighting schemes, we view imbalance as a data condition that amplifies Clever Hans (CH) effects by underspecification of minority classes. In a counterfactual explanations-based approach, we propose to leverage Explainable AI to jointly identify and eliminate CH effects emerging under imbalance. Our method achieves competitive classification performance on three datasets and demonstrates how CH effects emerge under imbalance, a perspective largely overlooked by existing approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Explainable Artificial Intelligence (XAI) · Financial Distress and Bankruptcy Prediction