Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation
Aahlad Puli, Nitish Joshi, Yoav Wald, He He, Rajesh Ranganath

TL;DR
This paper introduces a data augmentation method that corrupts semantic features to identify and adjust for nuisance correlations, improving model robustness across various out-of-distribution tasks.
Contribution
It proposes a novel approach to use semantic corruptions in data to detect and mitigate nuisance-label correlations, enhancing robustness without relying on nuisance annotations.
Findings
Effective in reducing spurious correlations in OOD tasks
Improves model robustness across vision and language tasks
Outperforms baseline methods in experiments
Abstract
In prediction tasks, there exist features that are related to the label in the same way across different settings for that task; these are semantic features or semantics. Features with varying relationships to the label are nuisances. For example, in detecting cows from natural images, the shape of the head is semantic but because images of cows often have grass backgrounds but not always, the background is a nuisance. Models that exploit nuisance-label relationships face performance degradation when these relationships change. Building models robust to such changes requires additional knowledge beyond samples of the features and labels. For example, existing work uses annotations of nuisances or assumes ERM-trained models depend on nuisances. Approaches to integrate new kinds of additional knowledge enlarge the settings where robust models can be built. We develop an approach to use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
