Localising Shortcut Learning in Pixel Space via Ordinal Scoring Correlations for Attribution Representations (OSCAR)
Akshit Achara, Peter Triantafillou, Esther Puyol-Ant\'on, Alexander Hammers, Andrew P. King

TL;DR
OSCAR is a model-agnostic framework that quantifies and localizes shortcut learning in neural networks by converting attribution maps into dataset-level rank profiles and analyzing correlations, applicable even in non-human-visible domains.
Contribution
Introduces OSCAR, a novel quantitative method for localizing shortcut features in pixel space, overcoming limitations of qualitative, image-level inspection methods.
Findings
Correlations are stable across seeds and data splits.
Method detects the level of shortcut association in training data.
Can distinguish localized from diffuse shortcut features.
Abstract
Deep neural networks often exploit shortcuts. These are spurious cues which are associated with output labels in the training data but are unrelated to task semantics. When the shortcut features are associated with sensitive attributes, shortcut learning can lead to biased model performance. Existing methods for localising and understanding shortcut learning are mostly based upon qualitative, image-level inspection and assume cues are human-visible, limiting their use in domains such as medical imaging. We introduce OSCAR (Ordinal Scoring Correlations for Attribution Representations), a model-agnostic framework for quantifying shortcut learning and localising shortcut features. OSCAR converts image-level task attribution maps into dataset-level rank profiles of image regions and compares them across three models: a balanced baseline model (BA), a test model (TS), and a sensitive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
