Two-Phase Learning for Weakly Supervised Object Localization

Dahun Kim; Donghyeon Cho; Donggeun Yoo; and In So Kweon

arXiv:1708.02108·cs.CV·August 17, 2017·40 cites

Two-Phase Learning for Weakly Supervised Object Localization

Dahun Kim, Donghyeon Cho, Donggeun Yoo, and In So Kweon

PDF

Open Access

TL;DR

This paper introduces a two-phase learning approach for weakly supervised object localization that progressively captures the entire object by suppressing salient parts in the second phase.

Contribution

The paper proposes a novel two-phase training scheme that enhances object localization by iteratively focusing on less salient regions, improving upon existing weakly supervised methods.

Findings

01

Effective in capturing entire objects rather than just salient parts

02

Improves performance in weakly supervised segmentation and localization tasks

03

Demonstrates significant gains in experimental evaluations

Abstract

Weakly supervised semantic segmentation and localiza- tion have a problem of focusing only on the most important parts of an image since they use only image-level annota- tions. In this paper, we solve this problem fundamentally via two-phase learning. Our networks are trained in two steps. In the first step, a conventional fully convolutional network (FCN) is trained to find the most discriminative parts of an image. In the second step, the activations on the most salient parts are suppressed by inference conditional feedback, and then the second learning is performed to find the area of the next most important parts. By combining the activations of both phases, the entire portion of the tar- get object can be captured. Our proposed training scheme is novel and can be utilized in well-designed techniques for weakly supervised semantic segmentation, salient region detection, and object…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques