ASPIRE: Language-Guided Data Augmentation for Improving Robustness   Against Spurious Correlations

Sreyan Ghosh; Chandra Kiran Reddy Evuru; Sonal Kumar; Utkarsh Tyagi,; Sakshi Singh; Sanjoy Chowdhury; Dinesh Manocha

arXiv:2308.10103·cs.CV·June 10, 2024

ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations

Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi,, Sakshi Singh, Sanjoy Chowdhury, Dinesh Manocha

PDF

Open Access 1 Repo 1 Video

TL;DR

ASPIRE uses language-guided image editing and data augmentation to improve neural classifiers' robustness against spurious correlations, enhancing performance in real-world scenarios.

Contribution

It introduces a novel language-guided data augmentation method that generates non-spurious images without requiring group labels or existing non-spurious images.

Findings

01

Improves worst-group accuracy by up to 38%.

02

Effective across multiple datasets and baseline methods.

03

Contributes a new challenging test set for Hard ImageNet.

Abstract

Neural image classifiers can often learn to make predictions by overly relying on non-predictive features that are spuriously correlated with the class labels in the training data. This leads to poor performance in real-world atypical scenarios where such features are absent. This paper presents ASPIRE (Language-guided Data Augmentation for SPurIous correlation REmoval), a simple yet effective solution for supplementing the training dataset with images without spurious features, for robust learning against spurious correlations via better generalization. ASPIRE, guided by language at various steps, can generate non-spurious images without requiring any group labeling or existing non-spurious images in the training set. Precisely, we employ LLMs to first extract foreground and background features from textual descriptions of an image, followed by advanced language-guided image editing to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sreyan88/aspire
pytorchOfficial

Videos

ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling