ShortcutProbe: Probing Prediction Shortcuts for Learning Robust Models
Guangtao Zheng, Wenqian Ye, Aidong Zhang

TL;DR
ShortcutProbe is a post hoc framework that detects and mitigates spurious prediction shortcuts in deep learning models without needing group labels, enhancing robustness against biases.
Contribution
It introduces a novel method to identify and remove prediction shortcuts, improving model robustness without costly annotations or subtle bias detection.
Findings
Effectively identifies prediction shortcuts in models.
Improves robustness to spurious biases across datasets.
Does not require group labels for bias mitigation.
Abstract
Deep learning models often achieve high performance by inadvertently learning spurious correlations between targets and non-essential features. For example, an image classifier may identify an object via its background that spuriously correlates with it. This prediction behavior, known as spurious bias, severely degrades model performance on data that lacks the learned spurious correlations. Existing methods on spurious bias mitigation typically require a variety of data groups with spurious correlation annotations called group labels. However, group labels require costly human annotations and often fail to capture subtle spurious biases such as relying on specific pixels for predictions. In this paper, we propose a novel post hoc spurious bias mitigation framework without requiring group labels. Our framework, termed ShortcutProbe, identifies prediction shortcuts that reflect potential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Anomaly Detection Techniques and Applications
MethodsHigh-Order Consensuses
