Deep Projective Rotation Estimation through Relative Supervision

Brian Okorn; Chuer Pan; Martial Hebert; David Held

arXiv:2211.11182·cs.CV·November 22, 2022

Deep Projective Rotation Estimation through Relative Supervision

Brian Okorn, Chuer Pan, Martial Hebert, David Held

PDF

Open Access

TL;DR

This paper introduces a novel self-supervised learning algorithm for estimating object orientations from images, using Modified Rodrigues Parameters to improve convergence in the non-convex rotational space.

Contribution

It proposes a new algorithm that projects $SO(3)$ onto $ ^3$ for better optimization, enabling effective self-supervised orientation estimation.

Findings

01

Faster convergence to a consistent orientation frame.

02

Effective in both direct rotation optimization and CNN-based prediction.

03

Validated on rotational averaging tasks.

Abstract

Orientation estimation is the core to a variety of vision and robotics tasks such as camera and object pose estimation. Deep learning has offered a way to develop image-based orientation estimators; however, such estimators often require training on a large labeled dataset, which can be time-intensive to collect. In this work, we explore whether self-supervised learning from unlabeled data can be used to alleviate this issue. Specifically, we assume access to estimates of the relative orientation between neighboring poses, such that can be obtained via a local alignment method. While self-supervised learning has been used successfully for translational object keypoints, in this work, we show that naively applying relative supervision to the rotational group $S O (3)$ will often fail to converge due to the non-convexity of the rotational space. To tackle this challenge, we propose a new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage · Advanced Vision and Imaging

Methodsfail