A benchmark with decomposed distribution shifts for 360 monocular depth estimation
Georgios Albanis, Nikolaos Zioulis, Petros Drakoulis, Federico, Alvarez, Dimitrios Zarpalas, Petros Daras

TL;DR
This paper introduces a benchmark for monocular depth estimation that decomposes distribution shifts into covariate, prior, and concept shifts, highlighting their unique challenges and the compounded difficulty when combined.
Contribution
It presents a novel benchmark that systematically analyzes different types of distribution shifts in monocular depth estimation, emphasizing their distinct and combined effects.
Findings
Each distribution shift type presents unique challenges.
Combining shifts leads to greater performance drops.
Standard approaches struggle with combined shifts.
Abstract
In this work we contribute a distribution shift benchmark for a computer vision task; monocular depth estimation. Our differentiation is the decomposition of the wider distribution shift of uncontrolled testing on in-the-wild data, to three distinct distribution shifts. Specifically, we generate data via synthesis and analyze them to produce covariate (color input), prior (depth output) and concept (their relationship) distribution shifts. We also synthesize combinations and show how each one is indeed a different challenge to address, as stacking them produces increased performance drops and cannot be addressed horizontally using standard approaches.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Robotics and Sensor-Based Localization
