Weakly-supervised DCNN for RGB-D Object Recognition in Real-World Applications Which Lack Large-scale Annotated Training Data
Li Sun, Cheng Zhao, Rustam Stolkin

TL;DR
This paper introduces a weakly-supervised deep learning framework for RGB-D object recognition that effectively utilizes limited labeled data and unlabeled videos, achieving real-time detection without bounding-box annotations.
Contribution
The novel DCNN-GPC architecture combines parametric and non-parametric models for effective weakly-supervised learning in RGB-D recognition tasks.
Findings
Achieved performance comparable to state-of-the-art on public datasets.
Effectively labeled large-scale unlabeled data using small annotated samples.
Enabled real-time detection without bounding-box annotations.
Abstract
This paper addresses the problem of RGBD object recognition in real-world applications, where large amounts of annotated training data are typically unavailable. To overcome this problem, we propose a novel, weakly-supervised learning architecture (DCNN-GPC) which combines parametric models (a pair of Deep Convolutional Neural Networks (DCNN) for RGB and D modalities) with non-parametric models (Gaussian Process Classification). Our system is initially trained using a small amount of labeled data, and then automatically prop- agates labels to large-scale unlabeled data. We first run 3D- based objectness detection on RGBD videos to acquire many unlabeled object proposals, and then employ DCNN-GPC to label them. As a result, our multi-modal DCNN can be trained end-to-end using only a small amount of human annotation. Finally, our 3D-based objectness detection and multi-modal DCNN are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Video Surveillance and Tracking Methods
MethodsDiffusion-Convolutional Neural Networks
