RI-MAE: Rotation-Invariant Masked AutoEncoders for Self-Supervised Point Cloud Representation Learning
Kunming Su, Qiuxia Wu, Panpan Cai, Xiaogang Zhu, Xuequan Lu, Zhiyong, Wang, Kun Hu

TL;DR
RI-MAE introduces a rotation-invariant self-supervised learning framework for point clouds, utilizing a novel RI-Transformer and a dual-branch architecture to achieve robust, rotation-invariant representations and superior downstream task performance.
Contribution
The paper proposes RI-MAE, a novel rotation-invariant masked autoencoder with a dual-branch architecture and RI-Transformer for self-supervised point cloud learning, addressing rotation sensitivity.
Findings
Achieves state-of-the-art performance on downstream tasks.
Demonstrates robustness to rotational variations.
Introduces a novel rotation-invariant transformer architecture.
Abstract
Masked point modeling methods have recently achieved great success in self-supervised learning for point cloud data. However, these methods are sensitive to rotations and often exhibit sharp performance drops when encountering rotational variations. In this paper, we propose a novel Rotation-Invariant Masked AutoEncoders (RI-MAE) to address two major challenges: 1) achieving rotation-invariant latent representations, and 2) facilitating self-supervised reconstruction in a rotation-invariant manner. For the first challenge, we introduce RI-Transformer, which features disentangled geometry content, rotation-invariant relative orientation and position embedding mechanisms for constructing rotation-invariant point cloud latent space. For the second challenge, a novel dual-branch student-teacher architecture is devised. It enables the self-supervised learning via the reconstruction of masked…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis
