ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi,, Sakshi Singh, Sanjoy Chowdhury, Dinesh Manocha

TL;DR
ASPIRE uses language-guided image editing and data augmentation to improve neural classifiers' robustness against spurious correlations, enhancing performance in real-world scenarios.
Contribution
It introduces a novel language-guided data augmentation method that generates non-spurious images without requiring group labels or existing non-spurious images.
Findings
Improves worst-group accuracy by up to 38%.
Effective across multiple datasets and baseline methods.
Contributes a new challenging test set for Hard ImageNet.
Abstract
Neural image classifiers can often learn to make predictions by overly relying on non-predictive features that are spuriously correlated with the class labels in the training data. This leads to poor performance in real-world atypical scenarios where such features are absent. This paper presents ASPIRE (Language-guided Data Augmentation for SPurIous correlation REmoval), a simple yet effective solution for supplementing the training dataset with images without spurious features, for robust learning against spurious correlations via better generalization. ASPIRE, guided by language at various steps, can generate non-spurious images without requiring any group labeling or existing non-spurious images in the training set. Precisely, we employ LLMs to first extract foreground and background features from textual descriptions of an image, followed by advanced language-guided image editing to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
