ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

Linshuang Diao; Sensen Song; Yurong Qian; Dayong Ren

arXiv:2505.21381·cs.CV·October 28, 2025

ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

Linshuang Diao, Sensen Song, Yurong Qian, Dayong Ren

PDF

Open Access

TL;DR

ZigzagPointMamba introduces a spatially continuous token sequencing and semantic-aware masking strategy to improve point cloud understanding in self-supervised learning, leading to better downstream task performance.

Contribution

It proposes ZigzagPointMamba with a zigzag scan path and Semantic-Siamese Masking Strategy to enhance spatial and semantic modeling in point cloud self-supervised learning.

Findings

01

Achieves 1.59% mIoU improvement on ShapeNetPart.

02

Improves classification accuracy on ModelNet40 by 0.4%.

03

Enhances ScanObjectNN subset accuracies by up to 1.22%.

Abstract

State Space models (SSMs) such as PointMamba enable efficient feature extraction for point cloud self-supervised learning with linear complexity, outperforming Transformers in computational efficiency. However, existing PointMamba-based methods depend on complex token ordering and random masking, which disrupt spatial continuity and local semantic correlations. We propose ZigzagPointMamba to tackle these challenges. The core of our approach is a simple zigzag scan path that globally sequences point cloud tokens, enhancing spatial continuity by preserving the proximity of spatially adjacent point tokens. Nevertheless, random masking undermines local semantic modeling in self-supervised learning. To address this, we introduce a Semantic-Siamese Masking Strategy (SMS), which masks semantically similar tokens to facilitate reconstruction by integrating local features of original and similar…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications · Image Processing and 3D Reconstruction