Robustness to Spurious Correlations via Human Annotations

Megha Srivastava; Tatsunori Hashimoto; Percy Liang

arXiv:2007.06661·cs.LG·August 17, 2020·26 cites

Robustness to Spurious Correlations via Human Annotations

Megha Srivastava, Tatsunori Hashimoto, Percy Liang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes a human-annotated, causality-based framework to improve model robustness against spurious correlations, using a new distributionally robust optimization method to handle unmeasured confounders.

Contribution

It introduces a novel approach combining human annotations and UV-DRO to mitigate spurious correlations in machine learning models.

Findings

01

Achieved 5-10% improvement on a rotation-confounded digit recognition task.

02

Achieved 1.5-5% improvement on NYPD Police Stops analysis.

03

Demonstrated robustness gains against confounders in empirical tests.

Abstract

The reliability of machine learning systems critically assumes that the associations between features and labels remain similar between training and test distributions. However, unmeasured variables, such as confounders, break this assumption---useful correlations between features and labels at training time can become useless or even harmful at test time. For example, high obesity is generally predictive for heart disease, but this relation may not hold for smokers who generally have lower rates of obesity and higher rates of heart disease. We present a framework for making models robust to spurious correlations by leveraging humans' common sense knowledge of causality. Specifically, we use human annotation to augment each training example with a potential unmeasured variable (i.e. an underweight patient with heart disease may be a smoker), reducing the problem to a covariate shift…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://worksheets.codalab.org/worksheets/0xc8aa93946ffc48a69e59041144491fe1
noneOfficial

Videos

Robustness to Spurious Correlations via Human Annotations· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI)