TL;DR
This paper introduces Point Regress AutoEncoder (Point-RAE), a novel self-supervised learning method for 3D point clouds that improves representation learning by decoupling encoder and decoder functions and leveraging a new finetuning mode.
Contribution
The paper proposes a regressive autoencoder with a mask regressor and alignment constraint, enhancing 3D point cloud self-supervised learning and downstream task performance.
Findings
Achieves 90.28% accuracy on ScanObjectNN hardest split
Achieves 94.1% accuracy on ModelNet40
Outperforms existing self-supervised methods
Abstract
Masked Autoencoders (MAE) have demonstrated promising performance in self-supervised learning for both 2D and 3D computer vision. Nevertheless, existing MAE-based methods still have certain drawbacks. Firstly, the functional decoupling between the encoder and decoder is incomplete, which limits the encoder's representation learning ability. Secondly, downstream tasks solely utilize the encoder, failing to fully leverage the knowledge acquired through the encoder-decoder architecture in the pre-text task. In this paper, we propose Point Regress AutoEncoder (Point-RAE), a new scheme for regressive autoencoders for point cloud self-supervised learning. The proposed method decouples functions between the decoder and the encoder by introducing a mask regressor, which predicts the masked patch representation from the visible patch representation encoded by the encoder and the decoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
