Robust Learning with Progressive Data Expansion Against Spurious Correlation
Yihe Deng, Yu Yang, Baharan Mirzasoleiman, Quanquan Gu

TL;DR
This paper introduces PDE, a training algorithm that progressively expands data groups to improve model robustness against spurious correlations, achieving better worst-group accuracy and faster training on various datasets.
Contribution
The paper provides a theoretical analysis of nonlinear CNNs learning spurious features and proposes PDE, a novel data expansion method that enhances robustness and efficiency.
Findings
PDE improves worst-group accuracy by 2.8% over state-of-the-art.
PDE achieves up to 10x faster training efficiency.
Experimental results on synthetic and real datasets validate PDE's effectiveness.
Abstract
While deep learning models have shown remarkable performance in various tasks, they are susceptible to learning non-generalizable spurious features rather than the core features that are genuinely correlated to the true label. In this paper, beyond existing analyses of linear models, we theoretically examine the learning process of a two-layer nonlinear convolutional neural network in the presence of spurious features. Our analysis suggests that imbalanced data groups and easily learnable spurious features can lead to the dominance of spurious features during the learning process. In light of this, we propose a new training algorithm called PDE that efficiently enhances the model's robustness for a better worst-group performance. PDE begins with a group-balanced subset of training data and progressively expands it to facilitate the learning of the core features. Experiments on synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Text and Document Classification Technologies
