MORE: Simultaneous Multi-View 3D Object Recognition and Pose Estimation

Tommaso Parisotto; Subhaditya Mukherjee; Hamidreza Kasaei

arXiv:2103.09863·cs.RO·April 10, 2023

MORE: Simultaneous Multi-View 3D Object Recognition and Pose Estimation

Tommaso Parisotto, Subhaditya Mukherjee, Hamidreza Kasaei

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep learning approach that simultaneously recognizes 3D objects and estimates their pose from multiple views, improving robotic interaction capabilities in real-world scenarios.

Contribution

It presents a novel method combining view selection and multi-task learning for concurrent object recognition and pose estimation.

Findings

01

Achieved high accuracy in object recognition and pose estimation.

02

Demonstrated effectiveness in a real-life robotic scenario.

03

Developed a view prediction model for optimal multi-view inputs.

Abstract

Simultaneous object recognition and pose estimation are two key functionalities for robots to safely interact with humans as well as environments. Although both object recognition and pose estimation use visual input, most state-of-the-art tackles them as two separate problems since the former needs a view-invariant representation while object pose estimation necessitates a view-dependent description. Nowadays, multi-view Convolutional Neural Network (MVCNN) approaches show state-of-the-art classification performance. Although MVCNN object recognition has been widely explored, there has been very little research on multi-view object pose estimation methods, and even less on addressing these two problems simultaneously. The pose of virtual cameras in MVCNN methods is often predefined in advance, leading to bound the application of such approaches. In this paper, we propose an approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

subhadityamukherjee/more_mvcnn
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Human Pose and Action Recognition