TL;DR
This paper introduces a temporal consistency framework for monocular depth estimation in mobile robots, utilizing wheel odometry and optical flow to improve stability and accuracy over time.
Contribution
It presents a novel method combining pose estimation, sparse depth triangulation, and Bayesian updates to enhance temporal consistency in monocular depth estimation.
Findings
Achieves stable depth predictions across frames in multiple datasets.
Demonstrates robustness to abrupt depth range changes.
Outperforms existing methods in accuracy and consistency.
Abstract
Monocular depth estimation (MDE) has been widely adopted in the perception systems of autonomous vehicles and mobile robots. However, existing approaches often struggle to maintain temporal consistency in depth estimation across consecutive frames. This inconsistency not only causes jitter but can also lead to estimation failures when the depth range changes abruptly. To address these challenges, this paper proposes a consistency-aware monocular depth estimation framework that leverages wheel odometry from a mobile robot to achieve stable and coherent depth predictions over time. Specifically, we estimate camera pose and sparse depth from triangulation using optical flow between consecutive frames. The sparse depth estimates are used to update a recursive Bayesian estimate of the metric scale, which is then applied to rescale the relative depth predicted by a pre-trained depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
