Robustness of Object Recognition under Extreme Occlusion in Humans and   Computational Models

Hongru Zhu; Peng Tang; Jeongho Park; Soojin Park; Alan Yuille

arXiv:1905.04598·cs.CV·June 5, 2019·33 cites

Robustness of Object Recognition under Extreme Occlusion in Humans and Computational Models

Hongru Zhu, Peng Tang, Jeongho Park, Soojin Park, Alan Yuille

PDF

Open Access 1 Repo

TL;DR

This study compares human and computational object recognition under extreme occlusion, revealing humans' robustness, CNNs' limitations, and the effectiveness of compositional models in handling real-world occlusion.

Contribution

It demonstrates that current CNNs lack robustness to real-world occlusion and introduces a compositional model that performs better, aligning more closely with human recognition patterns.

Findings

01

Humans are highly robust to extreme occlusion.

02

Standard CNNs perform poorly under extreme occlusion.

03

Compositional models outperform CNNs and resemble human error patterns.

Abstract

Most objects in the visual world are partially occluded, but humans can recognize them without difficulty. However, it remains unknown whether object recognition models like convolutional neural networks (CNNs) can handle real-world occlusion. It is also a question whether efforts to make these models robust to constant mask occlusion are effective for real-world occlusion. We test both humans and the above-mentioned computational models in a challenging task of object recognition under extreme occlusion, where target objects are heavily occluded by irrelevant real objects in real backgrounds. Our results show that human vision is very robust to extreme occlusion while CNNs are not, even with modifications to handle constant mask occlusion. This implies that the ability to handle constant mask occlusion does not entail robustness to real-world occlusion. As a comparison, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mattfeng/recurrent-extreme-occlusion
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning