CroMo: Cross-Modal Learning for Monocular Depth Estimation
Yannick Verdi\'e, Jifei Song, Barnab\'e Mas, Benjamin Busam, Ale\v{s}, Leonardis, Steven McDonagh

TL;DR
This paper introduces CroMo, a novel cross-modal learning approach for monocular depth estimation that integrates polarisation, ToF, and structured-light signals, utilizing a new dataset and self-supervised training to improve accuracy and robustness.
Contribution
We propose a new pipeline connecting scene geometry with multiple sensor signals and introduce CroMo, the first dataset with synchronized multimodal depth data for training and evaluation.
Findings
Outperforms existing monocular depth estimation methods
Effectively combines polarisation, ToF, and structured-light signals
Demonstrates robustness on challenging video scenes
Abstract
Learning-based depth estimation has witnessed recent progress in multiple directions; from self-supervision using monocular video to supervised methods offering highest accuracy. Complementary to supervision, further boosts to performance and robustness are gained by combining information from multiple signals. In this paper we systematically investigate key trade-offs associated with sensor and modality design choices as well as related model training strategies. Our study leads us to a new method, capable of connecting modality-specific advantages from polarisation, Time-of-Flight and structured-light inputs. We propose a novel pipeline capable of estimating depth from monocular polarisation for which we evaluate various training signals. The inversion of differentiable analytic models thereby connects scene geometry with polarisation and ToF signals and enables self-supervised and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
