RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point   Cloud Representation Learning

Kunming Su; Qiuxia Wu; Panpan Cai; Xiaogang Zhu; Xuequan Lu; Zhiyong; Wang; Kun Hu

arXiv:2409.00353·cs.CV·December 30, 2024

RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning

Kunming Su, Qiuxia Wu, Panpan Cai, Xiaogang Zhu, Xuequan Lu, Zhiyong, Wang, Kun Hu

PDF

Open Access 1 Repo 1 Video

TL;DR

RI-MAE introduces a rotation-invariant self-supervised learning framework for point clouds, utilizing a novel RI-Transformer and a dual-branch architecture to achieve robust, rotation-invariant representations and superior downstream task performance.

Contribution

The paper proposes RI-MAE, a novel rotation-invariant masked autoencoder with a dual-branch architecture and RI-Transformer for self-supervised point cloud learning, addressing rotation sensitivity.

Findings

01

Achieves state-of-the-art performance on downstream tasks.

02

Demonstrates robustness to rotational variations.

03

Introduces a novel rotation-invariant transformer architecture.

Abstract

Masked point modeling methods have recently achieved great success in self-supervised learning for point cloud data. However, these methods are sensitive to rotations and often exhibit sharp performance drops when encountering rotational variations. In this paper, we propose a novel Rotation-Invariant Masked AutoEncoders (RI-MAE) to address two major challenges: 1) achieving rotation-invariant latent representations, and 2) facilitating self-supervised reconstruction in a rotation-invariant manner. For the first challenge, we introduce RI-Transformer, which features disentangled geometry content, rotation-invariant relative orientation and position embedding mechanisms for constructing rotation-invariant point cloud latent space. For the second challenge, a novel dual-branch student-teacher architecture is devised. It enables the self-supervised learning via the reconstruction of masked…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kunmingsu07/ri-mae
pytorchOfficial

Videos

RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning· underline

Taxonomy

Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis