Pose from Shape: Deep Pose Estimation for Arbitrary 3D Objects

Yang Xiao; Xuchong Qiu; Pierre-Alain Langlois; Mathieu Aubry; Renaud; Marlet

arXiv:1906.05105·cs.CV·August 6, 2019·36 cites

Pose from Shape: Deep Pose Estimation for Arbitrary 3D Objects

Yang Xiao, Xuchong Qiu, Pierre-Alain Langlois, Mathieu Aubry, Renaud, Marlet

PDF

Open Access 2 Repos

TL;DR

This paper introduces a generic deep pose estimation method that predicts object pose from shape without prior training on specific categories, enabling robust interaction with new objects in real-world scenarios.

Contribution

It presents a shape-conditioned neural network for pose estimation that generalizes across categories without needing category-specific training.

Findings

01

Outperforms state-of-the-art on Pascal3D+, ObjectNet3D, Pix3D

02

Generalizes to unseen object types like animals from ImageNet

03

Effective on natural and man-made objects

Abstract

Most deep pose estimation methods need to be trained for specific object instances or categories. In this work we propose a completely generic deep pose estimation approach, which does not require the network to have been trained on relevant categories, nor objects in a category to have a canonical pose. We believe this is a crucial step to design robotic systems that can interact with new objects in the wild not belonging to a predefined category. Our main insight is to dynamically condition pose estimation with a representation of the 3D shape of the target object. More precisely, we train a Convolutional Neural Network that takes as input both a test image and a 3D model, and outputs the relative 3D pose of the object in the input image with respect to the 3D model. We demonstrate that our method boosts performances for supervised category pose estimation on standard benchmarks,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Human Pose and Action Recognition