WSSOD: A New Pipeline for Weakly- and Semi-Supervised Object Detection
Shijie Fang, Yuhang Cao, Xinjiang Wang, Kai Chen, Dahua Lin, Wayne, Zhang

TL;DR
This paper introduces WSSOD, a novel weakly- and semi-supervised object detection framework that effectively balances detection accuracy and annotation cost by leveraging both fully and weakly labeled data through a two-stage learning process and innovative strategies.
Contribution
The paper proposes a new WSSOD framework with a two-stage learning process, incorporating weakly-supervised loss, label attention, and pseudo-label sampling to improve detection with limited annotations.
Findings
Achieves high detection performance with only one third of annotations.
Demonstrates effectiveness on PASCAL-VOC and MSCOCO datasets.
Comparable to fully-supervised methods in accuracy.
Abstract
The performance of object detection, to a great extent, depends on the availability of large annotated datasets. To alleviate the annotation cost, the research community has explored a number of ways to exploit unlabeled or weakly labeled data. However, such efforts have met with limited success so far. In this work, we revisit the problem with a pragmatic standpoint, trying to explore a new balance between detection performance and annotation cost by jointly exploiting fully and weakly annotated data. Specifically, we propose a weakly- and semi-supervised object detection framework (WSSOD), which involves a two-stage learning procedure. An agent detector is first trained on a joint dataset and then used to predict pseudo bounding boxes on weakly-annotated images. The underlying assumptions in the current as well as common semi-supervised pipelines are also carefully examined under a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
