Siamese Regression Networks with Efficient mid-level Feature Extraction for 3D Object Pose Estimation
Andreas Doumanoglou, Vassileios Balntas, Rigas Kouskouridas, Tae-Kyun, Kim

TL;DR
This paper introduces a Siamese Regression Network that directly estimates 3D object poses from images, leveraging pose-guided feature learning to improve accuracy, especially under occlusions, outperforming existing methods.
Contribution
It presents an end-to-end Siamese network framework for direct 3D pose regression, enhancing feature discriminability and robustness against occlusions.
Findings
Outperforms state-of-the-art methods in 3D pose estimation.
Effective under severe occlusions on a novel hand-object dataset.
Enables direct pose regression with improved feature learning.
Abstract
In this paper we tackle the problem of estimating the 3D pose of object instances, using convolutional neural networks. State of the art methods usually solve the challenging problem of regression in angle space indirectly, focusing on learning discriminative features that are later fed into a separate architecture for 3D pose estimation. In contrast, we propose an end-to-end learning framework for directly regressing object poses by exploiting Siamese Networks. For a given image pair, we enforce a similarity measure between the representation of the sample images in the feature and pose space respectively, that is shown to boost regression performance. Furthermore, we argue that our pose-guided feature learning using our Siamese Regression Network generates more discriminative features that outperform the state of the art. Last, our feature learning formulation provides the ability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · 3D Shape Modeling and Analysis · Robot Manipulation and Learning
