Exploring contextual modeling with linear complexity for point cloud   segmentation

Yong Xien Chng; Xuchong Qiu; Yizeng Han; Yifan Pu; Jiewei Cao; Gao; Huang

arXiv:2410.21211·cs.CV·October 29, 2024

Exploring contextual modeling with linear complexity for point cloud segmentation

Yong Xien Chng, Xuchong Qiu, Yizeng Han, Yifan Pu, Jiewei Cao, Gao, Huang

PDF

Open Access

TL;DR

This paper introduces MEEPO, a novel point cloud segmentation architecture that combines CNN and Mamba, achieving state-of-the-art accuracy with improved efficiency by addressing Mamba's limitations.

Contribution

The paper identifies key components for effective point cloud segmentation and enhances Mamba with bidirectional context capture, leading to superior performance and efficiency.

Findings

01

MEEPO surpasses PTv3 by up to +0.8 mIoU on benchmarks.

02

MEEPO is 42.1% faster than previous methods.

03

MEEPO is 5.53x more memory efficient.

Abstract

Point cloud segmentation is an important topic in 3D understanding that has traditionally has been tackled using either the CNN or Transformer. Recently, Mamba has emerged as a promising alternative, offering efficient long-range contextual modeling capabilities without the quadratic complexity associated with Transformer's attention mechanisms. However, despite Mamba's potential, early efforts have all failed to achieve better performance than the best CNN-based and Transformer-based methods. In this work, we address this challenge by identifying the key components of an effective and efficient point cloud segmentation architecture. Specifically, we show that: 1) Spatial locality and robust contextual understanding are critical for strong performance, and 2) Mamba features linear computational complexity, offering superior data and inference efficiency compared to Transformers, while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Byte Pair Encoding · Layer Normalization · Residual Connection · Multi-Head Attention · Softmax · Adam