Pose from Shape: Deep Pose Estimation for Arbitrary 3D Objects
Yang Xiao, Xuchong Qiu, Pierre-Alain Langlois, Mathieu Aubry, Renaud, Marlet

TL;DR
This paper introduces a generic deep pose estimation method that predicts object pose from shape without prior training on specific categories, enabling robust interaction with new objects in real-world scenarios.
Contribution
It presents a shape-conditioned neural network for pose estimation that generalizes across categories without needing category-specific training.
Findings
Outperforms state-of-the-art on Pascal3D+, ObjectNet3D, Pix3D
Generalizes to unseen object types like animals from ImageNet
Effective on natural and man-made objects
Abstract
Most deep pose estimation methods need to be trained for specific object instances or categories. In this work we propose a completely generic deep pose estimation approach, which does not require the network to have been trained on relevant categories, nor objects in a category to have a canonical pose. We believe this is a crucial step to design robotic systems that can interact with new objects in the wild not belonging to a predefined category. Our main insight is to dynamically condition pose estimation with a representation of the 3D shape of the target object. More precisely, we train a Convolutional Neural Network that takes as input both a test image and a 3D model, and outputs the relative 3D pose of the object in the input image with respect to the 3D model. We demonstrate that our method boosts performances for supervised category pose estimation on standard benchmarks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Human Pose and Action Recognition
