MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction
Nathaniel Simon, Anirudha Majumdar

TL;DR
MonoNav enables small MAVs to perform accurate 3D mapping and navigation using only monocular cameras and offboard computation, significantly improving safety in cluttered environments.
Contribution
This work introduces MonoNav, a novel system that combines monocular depth estimation and fusion for real-time 3D mapping and navigation on tiny MAVs.
Findings
MonoNav reduces collision rate by a factor of 4 compared to end-to-end methods.
MonoNav achieves fast indoor navigation at 0.5 m/s in cluttered environments.
The system demonstrates effective 3D scene reconstruction using off-the-shelf neural networks.
Abstract
A major challenge in deploying the smallest of Micro Aerial Vehicle (MAV) platforms (< 100 g) is their inability to carry sensors that provide high-resolution metric depth information (e.g., LiDAR or stereo cameras). Current systems rely on end-to-end learning or heuristic approaches that directly map images to control inputs, and struggle to fly fast in unknown environments. In this work, we ask the following question: using only a monocular camera, optical odometry, and offboard computation, can we create metrically accurate maps to leverage the powerful path planning and navigation approaches employed by larger state-of-the-art robotic systems to achieve robust autonomy in unknown environments? We present MonoNav: a fast 3D reconstruction and navigation stack for MAVs that leverages recent advances in depth prediction neural networks to enable metrically accurate 3D scene…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · Advanced Vision and Imaging
