Multiple Instance Detection Network with Online Instance Classifier Refinement
Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

TL;DR
This paper introduces an end-to-end deep learning framework for weakly supervised object detection that refines instance classifiers online using multiple streams, achieving state-of-the-art results on PASCAL VOC benchmarks.
Contribution
It proposes a novel online instance classifier refinement method integrated into a deep network for weakly supervised detection, improving accuracy without requiring object location annotations.
Findings
Achieved 47% mAP on PASCAL VOC 2007
Outperformed previous state-of-the-art methods
Demonstrated effective online classifier refinement
Abstract
Of late, weakly supervised object detection is with great importance in object recognition. Based on deep learning, weakly supervised detectors have achieved many promising results. However, compared with fully supervised detection, it is more challenging to train deep network based detectors in a weakly supervised manner. Here we formulate weakly supervised detection as a Multiple Instance Learning (MIL) problem, where instance classifiers (object detectors) are put into the network as hidden nodes. We propose a novel online instance classifier refinement algorithm to integrate MIL and the instance classifier refinement procedure into a single deep network, and train the network end-to-end with only image-level supervision, i.e., without object location information. More precisely, instance labels inferred from weak supervision are propagated to their spatially overlapped instances to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
