Feature contamination: Neural networks learn uncorrelated features and   fail to generalize

Tianren Zhang; Chujie Zhao; Guanyu Chen; Yizhou Jiang; Feng Chen

arXiv:2406.03345·cs.LG·February 14, 2025·1 cites

Feature contamination: Neural networks learn uncorrelated features and fail to generalize

Tianren Zhang, Chujie Zhao, Guanyu Chen, Yizhou Jiang, Feng Chen

PDF

Open Access 1 Repo

TL;DR

This paper investigates why neural networks fail to generalize under distribution shifts, revealing a phenomenon called feature contamination where uncorrelated features are learned alongside predictive ones, impairing robustness.

Contribution

It provides the first empirical and theoretical analysis of feature contamination, showing its role in out-of-distribution generalization failure in neural networks.

Findings

01

Neural networks can learn uncorrelated features with predictive ones, leading to failure under distribution shifts.

02

Explicitly fitting a teacher network's representations does not guarantee out-of-distribution generalization.

03

Feature contamination differs from spurious correlation explanations in the literature.

Abstract

Learning representations that generalize under distribution shifts is critical for building robust machine learning models. However, despite significant efforts in recent years, algorithmic advances in this direction have been limited. In this work, we seek to understand the fundamental difficulty of out-of-distribution generalization with deep neural networks. We first empirically show that perhaps surprisingly, even allowing a neural network to explicitly fit the representations obtained from a teacher network that can generalize out-of-distribution is insufficient for the generalization of the student network. Then, by a theoretical study of two-layer ReLU networks optimized by stochastic gradient descent (SGD) under a structured feature model, we identify a fundamental yet unexplored feature learning proclivity of neural networks, feature contamination: neural networks can learn…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

trzhang0116/feature-contamination
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications