PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining

Xuanyu Lin; Xiaona Zeng; Xianwei Zheng; Xutao Li

arXiv:2507.17296·cs.CV·July 24, 2025

PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining

Xuanyu Lin, Xiaona Zeng, Xianwei Zheng, Xutao Li

PDF

Open Access

TL;DR

PointLAMA introduces a novel pretraining framework for point clouds that combines task-aware serialization, a hybrid encoder with Latent Attention and Mamba blocks, and a diffusion mechanism, improving efficiency and local feature capture.

Contribution

It presents a new point cloud pretraining method integrating Latent Attention with Mamba, enhancing local geometric modeling while maintaining efficiency.

Findings

01

Achieves competitive results on benchmark datasets.

02

Uses minimal parameters and FLOPs.

03

Enhances local context modeling in point cloud tasks.

Abstract

Mamba has recently gained widespread attention as a backbone model for point cloud modeling, leveraging a state-space architecture that enables efficient global sequence modeling with linear complexity. However, its lack of local inductive bias limits its capacity to capture fine-grained geometric structures in 3D data. To address this limitation, we propose \textbf{PointLAMA}, a point cloud pretraining framework that combines task-aware point cloud serialization, a hybrid encoder with integrated Latent Attention and Mamba blocks, and a conditional diffusion mechanism built upon the Mamba backbone. Specifically, the task-aware point cloud serialization employs Hilbert/Trans-Hilbert space-filling curves and axis-wise sorting to structurally align point tokens for classification and segmentation tasks, respectively. Our lightweight Latent Attention block features a Point-wise Multi-head…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · 3D Shape Modeling and Analysis