Localized Shortcut Removal

Nicolas M. M\"uller; Jochen Jacobs; Jennifer Williams; Konstantin; B\"ottinger

arXiv:2211.15510·cs.CV·May 24, 2023

Localized Shortcut Removal

Nicolas M. M\"uller, Jochen Jacobs, Jennifer Williams, Konstantin, B\"ottinger

PDF

Open Access

TL;DR

This paper introduces a novel adversarial method to detect and remove localized shortcuts in datasets, improving model generalization without sacrificing performance on clean data.

Contribution

It proposes a new approach using an adversarially trained lens to identify and neutralize localized shortcuts in images, enhancing dataset quality and model robustness.

Findings

01

Successfully detects and removes localized shortcuts in synthetic and real data

02

Maintains model performance on clean data after shortcut removal

03

Improves model generalization and robustness

Abstract

Machine learning is a data-driven field, and the quality of the underlying datasets plays a crucial role in learning success. However, high performance on held-out test data does not necessarily indicate that a model generalizes or learns anything meaningful. This is often due to the existence of machine learning shortcuts - features in the data that are predictive but unrelated to the problem at hand. To address this issue for datasets where the shortcuts are smaller and more localized than true features, we propose a novel approach to detect and remove them. We use an adversarially trained lens to detect and eliminate highly predictive but semantically unconnected clues in images. In our experiments on both synthetic and real-world data, we show that our proposed approach reliably identifies and neutralizes such shortcuts without causing degradation of model performance on clean data.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · COVID-19 diagnosis using AI

MethodsTest