Box-based Refinement for Weakly Supervised and Unsupervised Localization   Tasks

Eyal Gomel; Tal Shaharabany; Lior Wolf

arXiv:2309.03874·cs.CV·September 8, 2023

Box-based Refinement for Weakly Supervised and Unsupervised Localization Tasks

Eyal Gomel, Tal Shaharabany, Lior Wolf

PDF

Open Access 1 Repo

TL;DR

This paper introduces a box-based refinement method that enhances weakly supervised and unsupervised localization tasks by training detectors on network outputs, leading to significant performance improvements.

Contribution

The paper proposes a novel box-based refinement approach that trains detectors on network outputs rather than raw images, improving localization performance in weakly supervised and unsupervised tasks.

Findings

01

Improved phrase grounding performance.

02

Enhanced unsupervised object discovery.

03

Detectors trained on network outputs yield better localization.

Abstract

It has been established that training a box-based detector network can enhance the localization performance of weakly supervised and unsupervised methods. Moreover, we extend this understanding by demonstrating that these detectors can be utilized to improve the original network, paving the way for further advancements. To accomplish this, we train the detectors on top of the network output instead of the image data and apply suitable loss backpropagation. Our findings reveal a significant improvement in phrase grounding for the ``what is where by looking'' task, as well as various methods of unsupervised object discovery. Our code is available at https://github.com/eyalgomel/box-based-refinement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eyalgomel/box-based-refinement
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning