Uncovering Memorization Effect in the Presence of Spurious Correlations

Chenyu You; Haocheng Dai; Yifei Min; Jasjeet S. Sekhon; Sarang Joshi; James S. Duncan

arXiv:2501.00961·cs.LG·June 6, 2025

Uncovering Memorization Effect in the Presence of Spurious Correlations

Chenyu You, Haocheng Dai, Yifei Min, Jasjeet S. Sekhon, Sarang Joshi, James S. Duncan

PDF

Open Access

TL;DR

This paper investigates how neural networks memorize spurious correlations, especially in minority groups, and demonstrates that removing such memorization can improve model fairness and robustness.

Contribution

It provides the first evidence linking memorization of spurious features in specific neurons to imbalanced group performance and proposes a framework to mitigate this during training.

Findings

01

Spurious features are stored in a small subset of neurons.

02

Memorization of minority group information correlates with imbalanced performance.

03

Removing spurious memorization improves minority group accuracy.

Abstract

Machine learning models often rely on simple spurious features -- patterns in training data that correlate with targets but are not causally related to them, like image backgrounds in foreground classification. This reliance typically leads to imbalanced test performance across minority and majority groups. In this work, we take a closer look at the fundamental cause of such imbalanced performance through the lens of memorization, which refers to the ability to predict accurately on atypical examples (minority groups) in the training set but failing in achieving the same accuracy in the testing set. This paper systematically shows the ubiquitous existence of spurious features in a small set of neurons within the network, providing the first-ever evidence that memorization may contribute to imbalanced group performance. Through three experimental sources of converging empirical evidence,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Imbalanced Data Classification Techniques

MethodsSparse Evolutionary Training