Discriminative Sampling of Proposals in Self-Supervised Transformers for   Weakly Supervised Object Localization

Shakeeb Murtaza; Soufiane Belharbi; Marco Pedersoli; Aydin Sarraf,; Eric Granger

arXiv:2209.09209·cs.CV·November 22, 2022·1 cites

Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization

Shakeeb Murtaza, Soufiane Belharbi, Marco Pedersoli, Aydin Sarraf,, Eric Granger

PDF

Open Access

TL;DR

This paper introduces a novel discriminative proposal sampling method leveraging self-supervised transformers and CNN classifiers to improve weakly supervised object localization, demonstrating superior results on drone and bird datasets.

Contribution

The paper proposes DiPS, a new approach that uses multiple transformer heads and CNN-based discriminative sampling to enhance WSOL performance.

Findings

01

Outperforms state-of-the-art on TelDrone dataset

02

Effective on CUB dataset for different tasks

03

Improves object localization accuracy

Abstract

Drones are employed in a growing number of visual recognition applications. A recent development in cell tower inspection is drone-based asset surveillance, where the autonomous flight of a drone is guided by localizing objects of interest in successive aerial images. In this paper, we propose a method to train deep weakly-supervised object localization (WSOL) models based only on image-class labels to locate object with high confidence. To train our localizer, pseudo labels are efficiently harvested from a self-supervised vision transformers (SSTs). However, since SSTs decompose the scene into multiple maps containing various object parts, and do not rely on any explicit supervisory signal, they cannot distinguish between the object of interest and other objects, as required WSOL. To address this issue, we propose leveraging the multiple maps generated by the different transformer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Domain Adaptation and Few-Shot Learning