TL;DR
This paper introduces an online annotation module that enhances low-shot object detection by generating reliable annotations from weakly labeled data, significantly improving performance on standard benchmarks.
Contribution
The paper presents the first online annotation module that learns to generate many-shot annotations from weakly labeled images, integrated with existing detectors for end-to-end training.
Findings
Improves Fast R-CNN performance by 17% mAP on PASCAL VOC 2007.
Achieves 9% AP50 improvement on MS-COCO.
Outperforms existing mixed supervision methods.
Abstract
Object detection has witnessed significant progress by relying on large, manually annotated datasets. Annotating such datasets is highly time consuming and expensive, which motivates the development of weakly supervised and few-shot object detection methods. However, these methods largely underperform with respect to their strongly supervised counterpart, as weak training signals \emph{often} result in partial or oversized detections. Towards solving this problem we introduce, for the first time, an online annotation module (OAM) that learns to generate a many-shot set of \emph{reliable} annotations from a larger volume of weakly labelled images. Our OAM can be jointly trained with any fully supervised two-stage object detection method, providing additional training annotations on the fly. This results in a fully end-to-end strategy that only requires a low-shot set of fully annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
