Deep Projective Rotation Estimation through Relative Supervision
Brian Okorn, Chuer Pan, Martial Hebert, David Held

TL;DR
This paper introduces a novel self-supervised learning algorithm for estimating object orientations from images, using Modified Rodrigues Parameters to improve convergence in the non-convex rotational space.
Contribution
It proposes a new algorithm that projects $SO(3)$ onto $ ^3$ for better optimization, enabling effective self-supervised orientation estimation.
Findings
Faster convergence to a consistent orientation frame.
Effective in both direct rotation optimization and CNN-based prediction.
Validated on rotational averaging tasks.
Abstract
Orientation estimation is the core to a variety of vision and robotics tasks such as camera and object pose estimation. Deep learning has offered a way to develop image-based orientation estimators; however, such estimators often require training on a large labeled dataset, which can be time-intensive to collect. In this work, we explore whether self-supervised learning from unlabeled data can be used to alleviate this issue. Specifically, we assume access to estimates of the relative orientation between neighboring poses, such that can be obtained via a local alignment method. While self-supervised learning has been used successfully for translational object keypoints, in this work, we show that naively applying relative supervision to the rotational group will often fail to converge due to the non-convexity of the rotational space. To tackle this challenge, we propose a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging
Methodsfail
