When Regression Meets Manifold Learning for Object Recognition and Pose   Estimation

Mai Bui; Sergey Zakharov; Shadi Albarqouni; Slobodan Ilic; Nassir; Navab

arXiv:1805.06400·cs.CV·April 19, 2019

When Regression Meets Manifold Learning for Object Recognition and Pose Estimation

Mai Bui, Sergey Zakharov, Shadi Albarqouni, Slobodan Ilic, Nassir, Navab

PDF

TL;DR

This paper introduces a multi-task learning framework combining manifold descriptor learning and pose regression for object recognition and pose estimation from depth images, achieving significant accuracy improvements.

Contribution

It presents a novel combined approach that integrates manifold learning with pose regression, enhancing discriminative view descriptors and pose estimation accuracy.

Findings

01

30% increase in relative pose accuracy

02

Improved discriminative view descriptors

03

Enhanced object recognition and pose retrieval

Abstract

In this work, we propose a method for object recognition and pose estimation from depth images using convolutional neural networks. Previous methods addressing this problem rely on manifold learning to learn low dimensional viewpoint descriptors and employ them in a nearest neighbor search on an estimated descriptor space. In comparison we create an efficient multi-task learning framework combining manifold descriptor learning and pose regression. By combining the strengths of manifold learning using triplet loss and pose regression, we could either estimate the pose directly reducing the complexity compared to NN search, or use learned descriptor for the NN descriptor matching. By in depth experimental evaluation of the novel loss function we observed that the view descriptors learned by the network are much more discriminative resulting in almost 30% increase regarding relative pose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.