3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction

Jongmin Lee; Minsu Cho

arXiv:2411.00543·cs.CV·November 5, 2024

3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction

Jongmin Lee, Minsu Cho

PDF

Open Access 1 Video

TL;DR

This paper introduces a frequency-domain method for 3D pose estimation that directly predicts Wigner-D coefficients, enabling more accurate and data-efficient orientation predictions aligned with spherical CNNs.

Contribution

The paper proposes a novel SO(3)-equivariant approach that predicts Wigner-D coefficients directly in the frequency domain, overcoming limitations of spatial parametrizations.

Findings

01

Achieves state-of-the-art accuracy on ModelNet10-SO(3) and PASCAL3D+ benchmarks.

02

Demonstrates improved robustness and data efficiency over existing methods.

03

Provides a frequency-domain regression loss for better pose estimation.

Abstract

Determining the 3D orientations of an object in an image, known as single-image pose estimation, is a crucial task in 3D vision applications. Existing methods typically learn 3D rotations parametrized in the spatial domain using Euler angles or quaternions, but these representations often introduce discontinuities and singularities. SO(3)-equivariant networks enable the structured capture of pose patterns with data-efficient learning, but the parametrizations in spatial domain are incompatible with their architecture, particularly spherical CNNs, which operate in the frequency domain to enhance computational efficiency. To overcome these issues, we propose a frequency-domain approach that directly predicts Wigner-D coefficients for 3D rotation regression, aligning with the operations of spherical CNNs. Our SO(3)-equivariant pose harmonics predictor overcomes the limitations of spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction· slideslive

Taxonomy

TopicsImage and Object Detection Techniques · Hand Gesture Recognition Systems · Robot Manipulation and Learning