From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching

Ruqi Bai; Yao Ji; Zeyu Zhou; David I. Inouye

arXiv:2505.24843·cs.LG·November 11, 2025

From Invariant Representations to Invariant Data: Provable Robustness to Spurious Correlations via Noisy Counterfactual Matching

Ruqi Bai, Yao Ji, Zeyu Zhou, David I. Inouye

PDF

Open Access 1 Repo 3 Reviews

TL;DR

This paper introduces Noisy Counterfactual Matching (NCM), a data-centric approach that enhances model robustness to spurious correlations by leveraging noisy counterfactual data pairs, backed by theoretical guarantees and empirical validation.

Contribution

The paper proposes NCM, a novel method that uses noisy counterfactual pairs to improve robustness against spurious correlations, providing theoretical bounds and empirical evidence.

Findings

01

NCM improves robustness with noisy counterfactual pairs.

02

Theoretical bounds relate test error to in-domain error and counterfactual quality.

03

Empirical results validate NCM's effectiveness on real datasets.

Abstract

Models that learn spurious correlations from training data often fail when deployed in new environments. While many methods aim to learn invariant representations to address this, they often underperform standard empirical risk minimization (ERM). We propose a data-centric alternative that shifts the focus from learning invariant representations to leveraging invariant data pairs -- pairs of samples that should have the same prediction. We prove that certain counterfactuals naturally satisfy this invariance property. Based on this, we introduce Noisy Counterfactual Matching (NCM), a simple constraint-based method that improves robustness by leveraging even a small number of \emph{noisy} counterfactual pairs -- improving upon prior works that do not explicitly consider noise. For linear causal models, we prove that NCM's test-domain error is bounded by its in-domain error plus a term…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 8Confidence 3

Strengths

- The proposed method, Noisy Counterfactual Matching (NCM), is well-motivated and intuitive—it estimates a “spurious” subspace from pairwise differences and removes it via projection. - NCM is sample efficient. This is often easier to obtain in practice. - The implementation of NCM is simple, and has potential to be applied to different applications. - The method addresses the biggest practical challenge—noisy counterfactual pairs—and remains robust through a simple truncated-SVD design with a p

Weaknesses

- The projection removes only observed linear directions of domain shift, leaving potential nonlinear or unseen correlations unaddressed.

Reviewer 02Rating 4Confidence 4

Strengths

1. Addressing spurious correlations and domain generalization remains an important and open challenge. 2. The paper is well-written and conceptually easy to follow, with a consistent flow from motivation to theory to experiments. 3. Experimental results show modest but consistent improvements on benchmark datasets.

Weaknesses

1. The core idea of enforcing invariance through matching or orthogonality constraints is not entirely new. Previous works such as MatchDG, IRMv1 have already explored related ideas. NCM’s main distinction, using noisy counterfactual pairs, is interesting but incremental and mostly limited to linear settings. 2. The theoretical guarantees hold only for linear models with idealized “counterfactual pairs.” In realistic nonlinear cases (e.g., deep networks), the method lacks justification. The auth

Reviewer 03Rating 4Confidence 2

Strengths

The paper presents a an approach to domain generalization based on counterfactual matching. Its originality lies in reformulating invariant representation learning without relying on explicit domain labels, supported by a well-developed theoretical framework. The analysis is technically sound, with proofs that connect counterfactual consistency to invariance and empirical results that align with the theory. The experimental setup is appropriate, using standard benchmarks.

Weaknesses

While the paper is well-executed, its novelty is somewhat limited relative to prior work on counterfactual and invariant learning, such as MatchDG (ICLR 2021) . The distinction between NCM and these methods, beyond the use of formal proofs, could be explained more clearly. The empirical evaluation, though thorough on benchmark datasets, depends on synthetic or curated counterfactuals (e.g., Waterbirds-CF), which limits evidence of real-world applicability. Demonstrating how counterfactuals could

Code & Models

Repositories

inouye-lab/ncm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Bayesian Modeling and Causal Inference

MethodsSparse Evolutionary Training