Attention-Guided Multi-Scale Local Reconstruction for Point Clouds via Masked Autoencoder Self-Supervised Learning
Xin Cao, Haoyu Wang, Yuzhu Mao, Xinda Liu, Linzhi Su, Kang Li

TL;DR
PointAMaLR introduces a novel self-supervised learning framework for point clouds that leverages attention-guided multi-scale local reconstruction and a local attention module to improve feature representation and reconstruction accuracy across multiple datasets.
Contribution
It proposes a hierarchical multi-scale local reconstruction method with attention guidance and a local attention module, enhancing feature learning in self-supervised point cloud processing.
Findings
Superior accuracy in classification and reconstruction on ModelNet and ShapeNet.
Highly competitive performance on real-world datasets like ScanObjectNN and S3DIS.
Effective multi-scale semantic understanding demonstrated through experiments.
Abstract
Self-supervised learning has emerged as a prominent research direction in point cloud processing. While existing models predominantly concentrate on reconstruction tasks at higher encoder layers, they often neglect the effective utilization of low-level local features, which are typically employed solely for activation computations rather than directly contributing to reconstruction tasks. To overcome this limitation, we introduce PointAMaLR, a novel self-supervised learning framework that enhances feature representation and processing accuracy through attention-guided multi-scale local reconstruction. PointAMaLR implements hierarchical reconstruction across multiple local regions, with lower layers focusing on fine-scale feature restoration while upper layers address coarse-scale feature reconstruction, thereby enabling complex inter-patch interactions. Furthermore, to augment feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
