ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding
Linshuang Diao, Sensen Song, Yurong Qian, Dayong Ren

TL;DR
ZigzagPointMamba introduces a spatially continuous token sequencing and semantic-aware masking strategy to improve point cloud understanding in self-supervised learning, leading to better downstream task performance.
Contribution
It proposes ZigzagPointMamba with a zigzag scan path and Semantic-Siamese Masking Strategy to enhance spatial and semantic modeling in point cloud self-supervised learning.
Findings
Achieves 1.59% mIoU improvement on ShapeNetPart.
Improves classification accuracy on ModelNet40 by 0.4%.
Enhances ScanObjectNN subset accuracies by up to 1.22%.
Abstract
State Space models (SSMs) such as PointMamba enable efficient feature extraction for point cloud self-supervised learning with linear complexity, outperforming Transformers in computational efficiency. However, existing PointMamba-based methods depend on complex token ordering and random masking, which disrupt spatial continuity and local semantic correlations. We propose ZigzagPointMamba to tackle these challenges. The core of our approach is a simple zigzag scan path that globally sequences point cloud tokens, enhancing spatial continuity by preserving the proximity of spatially adjacent point tokens. Nevertheless, random masking undermines local semantic modeling in self-supervised learning. To address this, we introduce a Semantic-Siamese Masking Strategy (SMS), which masks semantically similar tokens to facilitate reconstruction by integrating local features of original and similar…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · Image Processing and 3D Reconstruction
