MaskTune: Mitigating Spurious Correlations by Forcing to Explore
Saeid Asgari Taghanaki, Aliasghar Khani, Fereshte Khani, Ali Gholami,, Linh Tran, Ali Mahdavi-Amiri, Ghassan Hamarneh

TL;DR
MaskTune is a masking strategy that enhances model robustness by encouraging exploration of new features during finetuning, effectively mitigating reliance on spurious correlations without requiring supervision.
Contribution
This paper introduces MaskTune, a novel, unsupervised masking method that prevents overfitting to spurious features during model finetuning.
Findings
Effective on biased datasets like MNIST, CelebA, Waterbirds, and ImagenNet-9L.
Outperforms or matches existing methods in selective classification tasks.
Does not require supervision or annotations for spurious features.
Abstract
A fundamental challenge of over-parameterized deep learning models is learning meaningful data representations that yield good performance on a downstream task without over-fitting spurious input features. This work proposes MaskTune, a masking strategy that prevents over-reliance on spurious (or a limited number of) features. MaskTune forces the trained model to explore new features during a single epoch finetuning by masking previously discovered features. MaskTune, unlike earlier approaches for mitigating shortcut learning, does not require any supervision, such as annotating spurious features or labels for subgroup samples in a dataset. Our empirical results on biased MNIST, CelebA, Waterbirds, and ImagenNet-9L datasets show that MaskTune is effective on tasks that often suffer from the existence of spurious correlations. Finally, we show that MaskTune outperforms or achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis
