Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection
Tiancai Wang, Tong Yang, Jiale Cao, Xiangyu Zhang

TL;DR
This paper introduces Co-mining, a self-supervised learning mechanism using Siamese networks to improve object detection performance with sparse annotations, demonstrating consistent gains across different detectors and settings.
Contribution
Proposes a novel Co-mining mechanism that enhances sparsely annotated object detection by mutual pseudo-label prediction between Siamese network branches.
Findings
Achieves 1.4% to 2.1% improvements on MS COCO with RetinaNet.
Surpasses existing methods under the same sparse annotation settings.
Effective for both anchor-based and anchor-free detectors.
Abstract
Object detectors usually achieve promising results with the supervision of complete instance annotations. However, their performance is far from satisfactory with sparse instance annotations. Most existing methods for sparsely annotated object detection either re-weight the loss of hard negative samples or convert the unlabeled instances into ignored regions to reduce the interference of false negatives. We argue that these strategies are insufficient since they can at most alleviate the negative effect caused by missing annotations. In this paper, we propose a simple but effective mechanism, called Co-mining, for sparsely annotated object detection. In our Co-mining, two branches of a Siamese network predict the pseudo-label sets for each other. To enhance multi-view learning and better mine unlabeled instances, the original image and corresponding augmented image are used as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Remote-Sensing Image Classification
Methods1x1 Convolution · Convolution · Feature Pyramid Network · Non Maximum Suppression · Siamese Network · Focal Loss · RetinaNet · FCOS
