RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Cheng Lu; Mingqian Ji; Shanshan Zhang; Zhihao Li; Jian Yang

arXiv:2604.02903·cs.CV·April 6, 2026

RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Cheng Lu, Mingqian Ji, Shanshan Zhang, Zhihao Li, Jian Yang

PDF

TL;DR

RayMamba introduces a geometry-aware serialization method for voxel-based 3D detectors, enhancing long-range detection by preserving contextual neighborhoods in sparse LiDAR data, with consistent improvements shown in experiments.

Contribution

It proposes a novel ray-aligned serialization strategy that improves long-range 3D detection performance in sparse scenes, compatible with various detectors.

Findings

01

Achieves up to 2.49 mAP and 1.59 NDS gain on nuScenes at 40-50m range.

02

Improves VoxelNeXt performance on Argoverse 2 from 30.3 to 31.2 mAP.

03

Demonstrates consistent improvements across multiple benchmarks.

Abstract

Long-range 3D object detection remains challenging because LiDAR observations become highly sparse and fragmented in the far field, making reliable context modeling difficult for existing detectors. To address this issue, recent state space model (SSM)-based methods have improved long-range modeling efficiency. However, their effectiveness is still limited by generic serialization strategies that fail to preserve meaningful contextual neighborhoods in sparse scenes. To address this issue, we propose RayMamba, a geometry-aware plug-and-play enhancement for voxel-based 3D detectors. RayMamba organizes sparse voxels into sector-wise ordered sequences through a ray-aligned serialization strategy, which preserves directional continuity and occlusion-related context for subsequent Mamba-based modeling. It is compatible with both LiDAR-only and multimodal detectors, while introducing only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.