PEEKABOO: Hiding parts of an image for unsupervised object localization

Hasib Zunair; A. Ben Hamza

arXiv:2407.17628·cs.CV·July 26, 2024

PEEKABOO: Hiding parts of an image for unsupervised object localization

Hasib Zunair, A. Ben Hamza

PDF

Open Access 1 Repo

TL;DR

PEEKABOO introduces a simple, effective single-stage framework for unsupervised object localization by hiding parts of images and using remaining context to infer object locations, outperforming existing methods.

Contribution

It proposes a novel context-based learning approach with image masking for unsupervised object localization, reducing computational complexity and improving accuracy.

Findings

01

Competitive performance on benchmark datasets

02

Effective in both object discovery and salient object detection

03

Simplifies the unsupervised localization process

Abstract

Localizing objects in an unsupervised manner poses significant challenges due to the absence of key visual information such as the appearance, type and number of objects, as well as the lack of labeled object classes typically available in supervised settings. While recent approaches to unsupervised object localization have demonstrated significant progress by leveraging self-supervised visual representations, they often require computationally intensive training processes, resulting in high resource demands in terms of computation, learnable parameters, and data. They also lack explicit modeling of visual context, potentially limiting their accuracy in object localization. To tackle these challenges, we propose a single-stage learning framework, dubbed PEEKABOO, for unsupervised object localization by learning context-based representations at both the pixel- and shape-level of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hasibzunair/peekaboo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications