WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection

Shijie Fang; Yuhang Cao; Xinjiang Wang; Kai Chen; Dahua Lin; Wayne; Zhang

arXiv:2105.11293·cs.CV·May 25, 2021·5 cites

WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection

Shijie Fang, Yuhang Cao, Xinjiang Wang, Kai Chen, Dahua Lin, Wayne, Zhang

PDF

Open Access

TL;DR

This paper introduces WSSOD, a novel weakly- and semi-supervised object detection framework that effectively balances detection accuracy and annotation cost by leveraging both fully and weakly labeled data through a two-stage learning process and innovative strategies.

Contribution

The paper proposes a new WSSOD framework with a two-stage learning process, incorporating weakly-supervised loss, label attention, and pseudo-label sampling to improve detection with limited annotations.

Findings

01

Achieves high detection performance with only one third of annotations.

02

Demonstrates effectiveness on PASCAL-VOC and MSCOCO datasets.

03

Comparable to fully-supervised methods in accuracy.

Abstract

The performance of object detection, to a great extent, depends on the availability of large annotated datasets. To alleviate the annotation cost, the research community has explored a number of ways to exploit unlabeled or weakly labeled data. However, such efforts have met with limited success so far. In this work, we revisit the problem with a pragmatic standpoint, trying to explore a new balance between detection performance and annotation cost by jointly exploiting fully and weakly annotated data. Specifically, we propose a weakly- and semi-supervised object detection framework (WSSOD), which involves a two-stage learning procedure. An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images. The underlying assumptions in the current as well as common semi-supervised pipelines are also carefully examined under a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning