Monocular SLAM Supported Object Recognition

Sudeep Pillai; John Leonard

arXiv:1506.01732·cs.RO·June 8, 2015

Monocular SLAM Supported Object Recognition

Sudeep Pillai, John Leonard

PDF

TL;DR

This paper presents a monocular SLAM-aware object recognition system that leverages multi-view proposals and efficient encoding to improve recognition accuracy and speed using a single RGB camera.

Contribution

It introduces a novel SLAM-aware recognition approach that integrates multi-view object proposals with efficient feature encoding for real-time performance.

Findings

01

Achieves stronger recognition performance than classical frame-by-frame systems

02

Operates in near-constant time with scalable runtime performance

03

Demonstrates effectiveness on the UW RGB-D Dataset

Abstract

In this work, we develop a monocular SLAM-aware object recognition system that is able to achieve considerably stronger recognition performance, as compared to classical object recognition systems that function on a frame-by-frame basis. By incorporating several key ideas including multi-view object proposals and efficient feature encoding methods, our proposed system is able to detect and robustly recognize objects in its environment using a single RGB camera in near-constant time. Through experiments, we illustrate the utility of using such a system to effectively detect and recognize objects, incorporating multiple object viewpoint detections into a unified prediction hypothesis. The performance of the proposed recognition system is evaluated on the UW RGB-D Dataset, showing strong recognition performance and scalable run-time performance compared to current state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.