When Regression Meets Manifold Learning for Object Recognition and Pose Estimation
Mai Bui, Sergey Zakharov, Shadi Albarqouni, Slobodan Ilic, Nassir, Navab

TL;DR
This paper introduces a multi-task learning framework combining manifold descriptor learning and pose regression for object recognition and pose estimation from depth images, achieving significant accuracy improvements.
Contribution
It presents a novel combined approach that integrates manifold learning with pose regression, enhancing discriminative view descriptors and pose estimation accuracy.
Findings
30% increase in relative pose accuracy
Improved discriminative view descriptors
Enhanced object recognition and pose retrieval
Abstract
In this work, we propose a method for object recognition and pose estimation from depth images using convolutional neural networks. Previous methods addressing this problem rely on manifold learning to learn low dimensional viewpoint descriptors and employ them in a nearest neighbor search on an estimated descriptor space. In comparison we create an efficient multi-task learning framework combining manifold descriptor learning and pose regression. By combining the strengths of manifold learning using triplet loss and pose regression, we could either estimate the pose directly reducing the complexity compared to NN search, or use learned descriptor for the NN descriptor matching. By in depth experimental evaluation of the novel loss function we observed that the view descriptors learned by the network are much more discriminative resulting in almost 30% increase regarding relative pose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
