MambaFusion: Adaptive State-Space Fusion for Multimodal 3D Object Detection
Venkatraman Narayanan, Bala Sai, Rahul Ahuja, Pratik Likhar, Varun Ravi Kumar, Senthil Yogamani

TL;DR
MambaFusion is a novel adaptive multi-modal 3D object detection framework that combines efficient state-space models, dynamic feature fusion, and structure-conditioned diffusion to achieve state-of-the-art performance in autonomous driving.
Contribution
It introduces a unified framework integrating selective state-space models, multi-modal token alignment, and diffusion-based reasoning for improved 3D perception.
Findings
Achieves new state-of-the-art on nuScenes benchmarks.
Operates with linear-time complexity.
Provides robust and interpretable 3D detection results.
Abstract
Reliable 3D object detection is fundamental to autonomous driving, and multimodal fusion algorithms using cameras and LiDAR remain a persistent challenge. Cameras provide dense visual cues but ill posed depth; LiDAR provides a precise 3D structure but sparse coverage. Existing BEV-based fusion frameworks have made good progress, but they have difficulties including inefficient context modeling, spatially invariant fusion, and reasoning under uncertainty. We introduce MambaFusion, a unified multi-modal detection framework that achieves efficient, adaptive, and physically grounded 3D perception. MambaFusion interleaves selective state-space models (SSMs) with windowed transformers to propagate the global context in linear time while preserving local geometric fidelity. A multi-modal token alignment (MTA) module and reliability-aware fusion gates dynamically re-weight camera-LiDAR features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Adversarial Robustness in Machine Learning
