HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection
Liujuan Cao, Jianghang Lin, Zebo Hong, Yunhang Shen, Shaohui Lin, Chao, Chen, Rongrong Ji

TL;DR
HUWSOD introduces a unified self-training framework for weakly supervised object detection that replaces traditional proposals with self-supervised modules, achieving competitive results without external supervision.
Contribution
The paper presents HUWSOD, a novel end-to-end WSOD network utilizing self-supervised proposal generation and holistic self-training, eliminating the need for external modules or additional supervision.
Findings
HUWSOD achieves performance close to fully-supervised Faster R-CNN.
Self-supervised proposal generators outperform traditional object proposals.
Randomly initialized boxes are effective for WSOD training.
Abstract
Most WSOD methods rely on traditional object proposals to generate candidate regions and are confronted with unstable training, which easily gets stuck in a poor local optimum. In this paper, we introduce a unified, high-capacity weakly supervised object detection (WSOD) network called HUWSOD, which utilizes a comprehensive self-training framework without needing external modules or additional supervision. HUWSOD innovatively incorporates a self-supervised proposal generator and an autoencoder proposal generator with a multi-rate resampling pyramid to replace traditional object proposals, enabling end-to-end WSOD training and inference. Additionally, we implement a holistic self-training scheme that refines detection scores and coordinates through step-wise entropy minimization and consistency-constraint regularization, ensuring consistent predictions across stochastic augmentations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsRegion Proposal Network · Softmax · RoIPool · Convolution · Faster R-CNN
