MonoMPC: Monocular Vision Based Navigation with Learned Collision Model and Risk-Aware Model Predictive Control
Basant Sharma, Prajyot Jadhav, Pranjal Paul, K.Madhava Krishna, Arun Kumar Singh

TL;DR
MonoMPC enables effective monocular vision navigation in cluttered environments by combining learned collision prediction with risk-aware model predictive control, reducing collisions and improving navigation success.
Contribution
It introduces a novel joint learning pipeline that calibrates uncertainty in a collision model using noisy depth estimates as context for risk-aware planning.
Findings
Reduced collision rate in real-world tests
Improved goal reaching success
Faster navigation in cluttered environments
Abstract
Navigating unknown environments with a single RGB camera is challenging, as the lack of depth information prevents reliable collision-checking. While some methods use estimated depth to build collision maps, we found that depth estimates from vision foundation models are too noisy for zero-shot navigation in cluttered environments. We propose an alternative approach: instead of using noisy estimated depth for direct collision-checking, we use it as a rich context input to a learned collision model. This model predicts the distribution of minimum obstacle clearance that the robot can expect for a given control sequence. At inference, these predictions inform a risk-aware MPC planner that minimizes estimated collision risk. We proposed a joint learning pipeline that co-trains the collision model and risk metric using both safe and unsafe trajectories. Crucially, our joint-training ensures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robotics and Sensor-Based Localization · Robot Manipulation and Learning
