The Invisible Gorilla Effect in Out-of-distribution Detection

Harry Anthony; Ziyun Liang; Hermione Warr; Konstantinos Kamnitsas

arXiv:2602.20068·cs.CV·February 24, 2026

The Invisible Gorilla Effect in Out-of-distribution Detection

Harry Anthony, Ziyun Liang, Hermione Warr, Konstantinos Kamnitsas

PDF

Open Access

TL;DR

This paper uncovers a bias in out-of-distribution detection where detection performance improves when artefacts resemble the model's ROI, revealing a failure mode called the Invisible Gorilla Effect that impacts robustness.

Contribution

It identifies the Invisible Gorilla Effect in OOD detection, demonstrating how artefact similarity influences detection performance and providing a comprehensive evaluation across multiple methods and datasets.

Findings

01

Detection improves with artefact similarity to ROI

02

Most OOD methods show performance drops on dissimilar artefacts

03

Annotated artefacts and generated counterfactuals reveal bias in detection methods

Abstract

Deep Neural Networks achieve high performance in vision tasks by learning features from regions of interest (ROI) within images, but their performance degrades when deployed on out-of-distribution (OOD) data that differs from training data. This challenge has led to OOD detection methods that aim to identify and reject unreliable predictions. Although prior work shows that OOD detection performance varies by artefact type, the underlying causes remain underexplored. To this end, we identify a previously unreported bias in OOD detection: for hard-to-detect artefacts (near-OOD), detection performance typically improves when the artefact shares visual similarity (e.g. colour) with the model's ROI and drops when it does not - a phenomenon we term the Invisible Gorilla Effect. For example, in a skin lesion classifier with red lesion ROI, we show the method Mahalanobis Score achieves a 31.5%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCutaneous Melanoma Detection and Management · Advanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis