M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction
Luoxi Zhang, Pragyan Shrestha, Yu Zhou, Chun Xie, and Itaru Kitahara

TL;DR
M3D introduces a dual-stream, depth-integrated framework for single-view 3D reconstruction, significantly improving accuracy and detail preservation in complex scenes by balancing global and local feature extraction.
Contribution
The paper presents a novel dual-stream feature extraction method with depth integration, enhancing reconstruction quality over existing neural implicit approaches.
Findings
Achieves state-of-the-art reconstruction accuracy.
Enhances geometric consistency and detail preservation.
Effectively balances global and local feature extraction.
Abstract
The precise reconstruction of 3D objects from a single RGB image in complex scenes presents a critical challenge in virtual reality, autonomous driving, and robotics. Existing neural implicit 3D representation methods face significant difficulties in balancing the extraction of global and local features, particularly in diverse and complex environments, leading to insufficient reconstruction precision and quality. We propose M3D, a novel single-view 3D reconstruction framework, to tackle these challenges. This framework adopts a dual-stream feature extraction strategy based on Selective State Spaces to effectively balance the extraction of global and local features, thereby improving scene comprehension and representation precision. Additionally, a parallel branch extracts depth information, effectively integrating visual and geometric features to enhance reconstruction quality and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Vision and Imaging · Advanced Memory and Neural Computing
