The Pragmatic Frames of Spurious Correlations in Machine Learning: Interpreting How and Why They Matter
Samuel J. Bell, Skyler Wang

TL;DR
This paper explores how machine learning researchers interpret and judge the significance of correlations, especially spurious ones, through pragmatic frames based on practical, ethical, and normative considerations.
Contribution
It introduces the concept of pragmatic frames to understand how ML research assesses the relevance, generalizability, human-likeness, and harmfulness of correlations beyond formal definitions.
Findings
Identifies four pragmatic frames: relevance, generalizability, human-likeness, harmfulness.
Shows that correlation desirability depends on situated judgments, not just statistical properties.
Highlights the social and ethical dimensions in defining spuriousness in ML.
Abstract
Learning correlations from data forms the foundation of today's machine learning (ML) and artificial intelligence research. While contemporary methods enable the automatic discovery of complex patterns, they are prone to failure when unintended correlations are captured. This vulnerability has spurred a growing interest in interrogating spuriousness, which is often seen as a threat to model performance, fairness, and robustness. In this article, we trace departures from the conventional statistical definition of spuriousness-which denotes a non-causal relationship arising from coincidence or confounding-to examine how its meaning is negotiated in ML research. Rather than relying solely on formal definitions, researchers assess spuriousness through what we call pragmatic frames: Judgments based on what a correlation does in practice-how it affects model behavior, supports or impedes task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
