TL;DR
This paper introduces a novel entropic adversarial data augmentation method to improve neural network robustness against spurious biases, especially in challenging synthetic benchmarks, without relying on minority counterexamples.
Contribution
It proposes a new architecture that uses entropic adversarial training to remove shortcut biases, outperforming existing methods on synthetic benchmarks and real datasets.
Findings
State-of-the-art strategies fail on proposed benchmarks.
The proposed method effectively removes shortcuts.
Competitive results on the BAR dataset.
Abstract
Deep neural networks do not discriminate between spurious and causal patterns, and will only learn the most predictive ones while ignoring the others. This shortcut learning behaviour is detrimental to a network's ability to generalize to an unknown test-time distribution in which the spurious correlations do not hold anymore. Debiasing methods were developed to make networks robust to such spurious biases but require to know in advance if a dataset is biased and make heavy use of minority counterexamples that do not display the majority bias of their class. In this paper, we argue that such samples should not be necessarily needed because the ''hidden'' causal information is often also contained in biased images. To study this idea, we propose 3 publicly released synthetic classification benchmarks, exhibiting predictive classification shortcuts, each of a different and challenging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
