Just Train Twice: Improving Group Robustness without Training Group Information
Evan Zheran Liu, Behzad Haghgoo, Annie S. Chen, Aditi Raghunathan,, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

TL;DR
The paper introduces JTT, a simple two-stage training method that improves worst-group accuracy in models with spurious correlations without needing group annotations during training.
Contribution
JTT is a novel two-stage approach that enhances group robustness by upweighting misclassified examples, reducing reliance on expensive group annotations.
Findings
JTT closes 75% of the worst-group accuracy gap in experiments.
JTT performs well across image and language tasks with spurious correlations.
JTT requires only small validation set annotations for hyperparameter tuning.
Abstract
Standard training via empirical risk minimization (ERM) can produce models that achieve high accuracy on average but low accuracy on certain groups, especially in the presence of spurious correlations between the input and label. Prior approaches that achieve high worst-group accuracy, like group distributionally robust optimization (group DRO) require expensive group annotations for each training point, whereas approaches that do not use such group annotations typically achieve unsatisfactory worst-group accuracy. In this paper, we propose a simple two-stage approach, JTT, that first trains a standard ERM model for several epochs, and then trains a second model that upweights the training examples that the first model misclassified. Intuitively, this upweights examples from groups on which standard ERM models perform poorly, leading to improved worst-group performance. Averaged over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Anomaly Detection Techniques and Applications
