Gram-SLD: Automatic Self-labeling and Detection for Instance Objects
Rui Wang, Chengtun Wu, Jiawen Xin, and Liang Zhang

TL;DR
This paper introduces Gram-SLD, a co-training framework that automatically labels data for instance object detection, significantly reducing manual annotation effort while maintaining high detection accuracy in complex environments.
Contribution
The paper proposes a novel Gram-SLD framework that uses gram loss, view construction, and sample selection to generate high-quality pseudo-labels with minimal manual annotation.
Findings
Achieves less than 2% mAP loss with only 5% labeled data.
Demonstrates competitive performance on multiple datasets.
Satisfies real-time and accuracy requirements in complex environments.
Abstract
Instance object detection plays an important role in intelligent monitoring, visual navigation, human-computer interaction, intelligent services and other fields. Inspired by the great success of Deep Convolutional Neural Network (DCNN), DCNN-based instance object detection has become a promising research topic. To address the problem that DCNN always requires a large-scale annotated dataset to supervise its training while manual annotation is exhausting and time-consuming, we propose a new framework based on co-training called Gram Self-Labeling and Detection (Gram-SLD). The proposed Gram-SLD can automatically annotate a large amount of data with very limited manually labeled key data and achieve competitive performance. In our framework, gram loss is defined and used to construct two fully redundant and independent views and a key sample selection strategy along with an automatic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
MethodsDiffusion-Convolutional Neural Networks
