Robustness to Spurious Correlations via Human Annotations
Megha Srivastava, Tatsunori Hashimoto, Percy Liang

TL;DR
This paper proposes a human-annotated, causality-based framework to improve model robustness against spurious correlations, using a new distributionally robust optimization method to handle unmeasured confounders.
Contribution
It introduces a novel approach combining human annotations and UV-DRO to mitigate spurious correlations in machine learning models.
Findings
Achieved 5-10% improvement on a rotation-confounded digit recognition task.
Achieved 1.5-5% improvement on NYPD Police Stops analysis.
Demonstrated robustness gains against confounders in empirical tests.
Abstract
The reliability of machine learning systems critically assumes that the associations between features and labels remain similar between training and test distributions. However, unmeasured variables, such as confounders, break this assumption---useful correlations between features and labels at training time can become useless or even harmful at test time. For example, high obesity is generally predictive for heart disease, but this relation may not hold for smokers who generally have lower rates of obesity and higher rates of heart disease. We present a framework for making models robust to spurious correlations by leveraging humans' common sense knowledge of causality. Specifically, we use human annotation to augment each training example with a potential unmeasured variable (i.e. an underweight patient with heart disease may be a smoker), reducing the problem to a covariate shift…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
