W2N:Switching From Weak Supervision to Noisy Supervision for Object Detection
Zitong Huang, Yiping Bao, Bowen Dong, Erjin Zhou, Wangmeng Zuo

TL;DR
This paper introduces W2N, a novel framework that transitions from weak supervision to noisy supervision in object detection, using iterative refinement of pseudo labels to improve detection accuracy beyond existing methods.
Contribution
The paper proposes a new paradigm and a two-module iterative training algorithm for better pseudo label refinement in weakly-supervised object detection.
Findings
W2N outperforms existing WSOD methods on benchmarks.
The localization adaptation module improves pseudo ground-truth quality.
The semi-supervised module enhances detector training with high-quality labels.
Abstract
Weakly-supervised object detection (WSOD) aims to train an object detector only requiring the image-level annotations. Recently, some works have managed to select the accurate boxes generated from a well-trained WSOD network to supervise a semi-supervised detection framework for better performance. However, these approaches simply divide the training set into labeled and unlabeled sets according to the image-level criteria, such that sufficient mislabeled or wrongly localized box predictions are chosen as pseudo ground-truths, resulting in a sub-optimal solution of detection performance. To overcome this issue, we propose a novel WSOD framework with a new paradigm that switches from weak supervision to noisy supervision (W2N). Generally, with given pseudo ground-truths generated from the well-trained WSOD network, we propose a two-module iterative training algorithm to refine pseudo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
