RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection
Cheng Lu, Mingqian Ji, Shanshan Zhang, Zhihao Li, Jian Yang

TL;DR
RayMamba introduces a geometry-aware serialization method for voxel-based 3D detectors, enhancing long-range detection by preserving contextual neighborhoods in sparse LiDAR data, with consistent improvements shown in experiments.
Contribution
It proposes a novel ray-aligned serialization strategy that improves long-range 3D detection performance in sparse scenes, compatible with various detectors.
Findings
Achieves up to 2.49 mAP and 1.59 NDS gain on nuScenes at 40-50m range.
Improves VoxelNeXt performance on Argoverse 2 from 30.3 to 31.2 mAP.
Demonstrates consistent improvements across multiple benchmarks.
Abstract
Long-range 3D object detection remains challenging because LiDAR observations become highly sparse and fragmented in the far field, making reliable context modeling difficult for existing detectors. To address this issue, recent state space model (SSM)-based methods have improved long-range modeling efficiency. However, their effectiveness is still limited by generic serialization strategies that fail to preserve meaningful contextual neighborhoods in sparse scenes. To address this issue, we propose RayMamba, a geometry-aware plug-and-play enhancement for voxel-based 3D detectors. RayMamba organizes sparse voxels into sector-wise ordered sequences through a ray-aligned serialization strategy, which preserves directional continuity and occlusion-related context for subsequent Mamba-based modeling. It is compatible with both LiDAR-only and multimodal detectors, while introducing only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
