Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise
Haotian Ye, James Zou, Linjun Zhang

TL;DR
This paper introduces a theoretical framework and a new algorithm, Freeze then Train (FTT), to improve representation learning under spurious correlations and feature noise, demonstrating superior performance over existing methods.
Contribution
The paper provides a theoretical understanding of feature learning under noise and proposes FTT, a novel algorithm that enhances test-time feature retention and robustness.
Findings
FTT outperforms ERM, IRM, JTT, and CVaR-DRO in accuracy by 4.5% on spurious correlation datasets.
Core features are better learned when their noise is smaller than spurious features, under certain conditions.
FTT improves generalization under distribution shifts, especially with high feature noise.
Abstract
The existence of spurious correlations such as image backgrounds in the training environment can make empirical risk minimization (ERM) perform badly in the test environment. To address this problem, Kirichenko et al. (2022) empirically found that the core features that are related to the outcome can still be learned well even with the presence of spurious correlations. This opens a promising strategy to first train a feature learner rather than a classifier, and then perform linear probing (last layer retraining) in the test environment. However, a theoretical understanding of when and why this approach works is lacking. In this paper, we find that core features are only learned well when their associated non-realizable noise is smaller than that of spurious features, which is not necessarily true in practice. We provide both theories and experiments to support this finding and to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Generative Adversarial Networks and Image Synthesis
MethodsTest
