DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised   Transformers for Weakly Supervised Object Localization

Shakeeb Murtaza; Soufiane Belharbi; Marco Pedersoli; Aydin Sarraf,; Eric Granger

arXiv:2310.06196·cs.CV·October 20, 2023

DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised Transformers for Weakly Supervised Object Localization

Shakeeb Murtaza, Soufiane Belharbi, Marco Pedersoli, Aydin Sarraf,, Eric Granger

PDF

Open Access 1 Repo

TL;DR

This paper introduces DiPS, a novel method leveraging self-supervised transformers and discriminative pseudo-label sampling to improve weakly-supervised object localization using only image-class labels.

Contribution

DiPS uses class-agnostic attention maps and a pre-trained classifier to select discriminative regions, training a transformer-based model for improved localization and classification.

Findings

01

Outperforms state-of-the-art WSOL methods on multiple datasets

02

Effective in distinguishing objects from background noise

03

Enhances object coverage with diverse proposals

Abstract

Self-supervised vision transformers (SSTs) have shown great potential to yield rich localization maps that highlight different objects in an image. However, these maps remain class-agnostic since the model is unsupervised. They often tend to decompose the image into multiple maps containing different objects while being unable to distinguish the object of interest from background noise objects. In this paper, Discriminative Pseudo-label Sampling (DiPS) is introduced to leverage these class-agnostic maps for weakly-supervised object localization (WSOL), where only image-class labels are available. Given multiple attention maps, DiPS relies on a pre-trained classifier to identify the most discriminative regions of each attention map. This ensures that the selected ROIs cover the correct image object while discarding the background ones, and, as such, provides a rich pool of diverse and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shakeebmurtaza/dips
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning