MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications
Gasser Elazab, Torben Gr\"aber, Michael Unterreiner, Olaf Hellwich

TL;DR
MonoPP introduces a self-supervised monocular depth estimation approach that uses planar-parallax geometry and vehicle mounting data to produce accurate metric-scaled depth maps, outperforming prior methods on driving datasets.
Contribution
The paper presents a novel self-supervised MDE model that leverages planar-parallax geometry and vehicle mounting info to achieve metric-scale depth without extra supervision.
Findings
State-of-the-art results on KITTI benchmark
First self-supervised metric-scaled depth on Cityscapes
Effective scene reconstruction using planar-parallax geometry
Abstract
Self-supervised monocular depth estimation (MDE) has gained popularity for obtaining depth predictions directly from videos. However, these methods often produce scale invariant results, unless additional training signals are provided. Addressing this challenge, we introduce a novel self-supervised metric-scaled MDE model that requires only monocular video data and the camera's mounting position, both of which are readily available in modern vehicles. Our approach leverages planar-parallax geometry to reconstruct scene structure. The full pipeline consists of three main networks, a multi-frame network, a singleframe network, and a pose network. The multi-frame network processes sequential frames to estimate the structure of the static scene using planar-parallax geometry and the camera mounting position. Based on this reconstruction, it acts as a teacher, distilling knowledge such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Vision Systems and Defect Detection · Optical measurement and interference techniques · Image Processing Techniques and Applications
