StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning
Chuxin Wang, Yixin Zha, Wenfei Yang, Tianzhu Zhang

TL;DR
StruMamba3D introduces a novel self-supervised learning approach for point clouds that preserves spatial dependencies and improves robustness to input length variations, achieving state-of-the-art results on standard benchmarks.
Contribution
It proposes a new paradigm combining spatial states and a sequence length-adaptive strategy to enhance SSM-based point cloud learning.
Findings
Achieves 95.1% accuracy on ModelNet40
Attains 92.75% accuracy on ScanObjectNN
Outperforms existing methods on four downstream tasks
Abstract
Recently, Mamba-based methods have demonstrated impressive performance in point cloud representation learning by leveraging State Space Model (SSM) with the efficient context modeling ability and linear complexity. However, these methods still face two key issues that limit the potential of SSM: Destroying the adjacency of 3D points during SSM processing and failing to retain long-sequence memory as the input length increases in downstream tasks. To address these issues, we propose StruMamba3D, a novel paradigm for self-supervised point cloud representation learning. It enjoys several merits. First, we design spatial states and use them as proxies to preserve spatial dependencies among points. Second, we enhance the SSM with a state-wise update strategy and incorporate a lightweight convolution to facilitate interactions between spatial states for efficient structure modeling. Third,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis
MethodsConvolution
