Gram-SLD: Automatic Self-labeling and Detection for Instance Objects

Rui Wang; Chengtun Wu; Jiawen Xin; and Liang Zhang

arXiv:2112.03641·cs.CV·December 8, 2021

Gram-SLD: Automatic Self-labeling and Detection for Instance Objects

Rui Wang, Chengtun Wu, Jiawen Xin, and Liang Zhang

PDF

Open Access

TL;DR

This paper introduces Gram-SLD, a co-training framework that automatically labels data for instance object detection, significantly reducing manual annotation effort while maintaining high detection accuracy in complex environments.

Contribution

The paper proposes a novel Gram-SLD framework that uses gram loss, view construction, and sample selection to generate high-quality pseudo-labels with minimal manual annotation.

Findings

01

Achieves less than 2% mAP loss with only 5% labeled data.

02

Demonstrates competitive performance on multiple datasets.

03

Satisfies real-time and accuracy requirements in complex environments.

Abstract

Instance object detection plays an important role in intelligent monitoring, visual navigation, human-computer interaction, intelligent services and other fields. Inspired by the great success of Deep Convolutional Neural Network (DCNN), DCNN-based instance object detection has become a promising research topic. To address the problem that DCNN always requires a large-scale annotated dataset to supervise its training while manual annotation is exhausting and time-consuming, we propose a new framework based on co-training called Gram Self-Labeling and Detection (Gram-SLD). The proposed Gram-SLD can automatically annotate a large amount of data with very limited manually labeled key data and achieve competitive performance. In our framework, gram loss is defined and used to construct two fully redundant and independent views and a key sample selection strategy along with an automatic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsDiffusion-Convolutional Neural Networks