LION: Linear Group RNN for 3D Object Detection in Point Clouds
Zhe Liu, Jinghua Hou, Xinyu Wang, Xiaoqing Ye, Jingdong Wang,, Hengshuang Zhao, Xiang Bai

TL;DR
LION introduces a window-based linear group RNN framework for 3D object detection in point clouds, achieving state-of-the-art results by enhancing spatial modeling and densifying features in sparse data.
Contribution
The paper proposes a novel linear group RNN framework with spatial descriptors and densification strategies for improved 3D object detection in sparse point clouds.
Findings
LION achieves state-of-the-art performance on Waymo, nuScenes, Argoverse V2, and ONCE datasets.
The framework effectively integrates various linear RNN operators like Mamba, RWKV, and RetNet.
The method supports multiple linear RNN variants on the KITTI dataset for quick deployment.
Abstract
The benefit of transformers in large-scale 3D point cloud perception tasks, such as 3D object detection, is limited by their quadratic computation cost when modeling long-range relationships. In contrast, linear RNNs have low computational complexity and are suitable for long-range modeling. Toward this goal, we propose a simple and effective window-based framework built on LInear grOup RNN (i.e., perform linear RNN for grouped features) for accurate 3D object detection, called LION. The key property is to allow sufficient feature interaction in a much larger group than transformer-based methods. However, effectively applying linear group RNN to 3D object detection in highly sparse point clouds is not trivial due to its limitation in handling spatial modeling. To tackle this problem, we simply introduce a 3D spatial feature descriptor and integrate it into the linear group RNN operators…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
Topics3D Surveying and Cultural Heritage · Image Processing and 3D Reconstruction · Remote Sensing and LiDAR Applications
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces · Evolved Sign Momentum
