Attention-Guided Multi-Scale Local Reconstruction for Point Clouds via Masked Autoencoder Self-Supervised Learning

Xin Cao; Haoyu Wang; Yuzhu Mao; Xinda Liu; Linzhi Su; Kang Li

arXiv:2507.04084·cs.GR·July 8, 2025

Attention-Guided Multi-Scale Local Reconstruction for Point Clouds via Masked Autoencoder Self-Supervised Learning

Xin Cao, Haoyu Wang, Yuzhu Mao, Xinda Liu, Linzhi Su, Kang Li

PDF

TL;DR

PointAMaLR introduces a novel self-supervised learning framework for point clouds that leverages attention-guided multi-scale local reconstruction and a local attention module to improve feature representation and reconstruction accuracy across multiple datasets.

Contribution

It proposes a hierarchical multi-scale local reconstruction method with attention guidance and a local attention module, enhancing feature learning in self-supervised point cloud processing.

Findings

01

Superior accuracy in classification and reconstruction on ModelNet and ShapeNet.

02

Highly competitive performance on real-world datasets like ScanObjectNN and S3DIS.

03

Effective multi-scale semantic understanding demonstrated through experiments.

Abstract

Self-supervised learning has emerged as a prominent research direction in point cloud processing. While existing models predominantly concentrate on reconstruction tasks at higher encoder layers, they often neglect the effective utilization of low-level local features, which are typically employed solely for activation computations rather than directly contributing to reconstruction tasks. To overcome this limitation, we introduce PointAMaLR, a novel self-supervised learning framework that enhances feature representation and processing accuracy through attention-guided multi-scale local reconstruction. PointAMaLR implements hierarchical reconstruction across multiple local regions, with lower layers focusing on fine-scale feature restoration while upper layers address coarse-scale feature reconstruction, thereby enabling complex inter-patch interactions. Furthermore, to augment feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.