Semi-Weakly Supervised Object Detection by Sampling Pseudo Ground-Truth Boxes
Akhil Meethal, Marco Pedersoli, Zhongwen Zhu, Francisco Perdigon, Romero, and Eric Granger

TL;DR
This paper introduces a sampling-based semi-weakly supervised object detection method that generates pseudo-ground-truth boxes online, reducing training complexity and achieving state-of-the-art results with minimal fully-labeled data.
Contribution
A novel sampling strategy for pseudo-ground-truth generation that simplifies training and enhances detection performance with limited fully-labeled images.
Findings
Improves detection performance by 5% on Pascal VOC datasets.
Achieves over 10% mAP increase with only 5-10% fully annotated images.
Eliminates multi-stage training and student-teacher configurations.
Abstract
Semi- and weakly-supervised learning have recently attracted considerable attention in the object detection literature since they can alleviate the cost of annotation needed to successfully train deep learning models. State-of-art approaches for semi-supervised learning rely on student-teacher models trained using a multi-stage process, and considerable data augmentation. Custom networks have been developed for the weakly-supervised setting, making it difficult to adapt to different detectors. In this paper, a weakly semi-supervised training method is introduced that reduces these training challenges, yet achieves state-of-the-art performance by leveraging only a small fraction of fully-labeled images with information in weakly-labeled images. In particular, our generic sampling-based learning strategy produces pseudo-ground-truth (GT) bounding box annotations in an online fashion,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
