PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model
Zhang Zhang, Chao Sun, Chao Yue, Da Wen, Tianze Wang, Jianghao Leng

TL;DR
This paper introduces PillarMamba, a novel roadside point cloud detection framework that combines local and global context modeling using a hybrid state space model, significantly improving detection performance.
Contribution
It proposes a hybrid state-space block for roadside point cloud perception, enhancing local-global context integration and outperforming existing methods on a large-scale benchmark.
Findings
Outperforms state-of-the-art on DAIR-V2X-I benchmark
Enhances local-global context understanding in roadside point clouds
Efficiently fuses features across stages for better detection
Abstract
Serving the Intelligent Transport System (ITS) and Vehicle-to-Everything (V2X) tasks, roadside perception has received increasing attention in recent years, as it can extend the perception range of connected vehicles and improve traffic safety. However, roadside point cloud oriented 3D object detection has not been effectively explored. To some extent, the key to the performance of a point cloud detector lies in the receptive field of the network and the ability to effectively utilize the scene context. The recent emergence of Mamba, based on State Space Model (SSM), has shaken up the traditional convolution and transformers that have long been the foundational building blocks, due to its efficient global receptive field. In this work, we introduce Mamba to pillar-based roadside point cloud perception and propose a framework based on Cross-stage State-space Group (CSG), called…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Traffic Prediction and Management Techniques
MethodsSoftmax · Attention Is All You Need · Mamba: Linear-Time Sequence Modeling with Selective State Spaces · Convolution
