On Robust Cross-View Consistency in Self-Supervised Monocular Depth Estimation
Haimei Zhao, Jing Zhang, Zhuo Chen, Bo Yuan, Dacheng Tao

TL;DR
This paper introduces robust cross-view consistency methods for self-supervised monocular depth estimation, improving robustness against scene challenges like illumination changes and occlusions by aligning features and voxel densities.
Contribution
It proposes Depth Feature Alignment and Voxel Density Alignment losses, shifting from point-to-point to region-to-region alignment for enhanced robustness in depth estimation.
Findings
Outperforms state-of-the-art methods on outdoor benchmarks
Demonstrates robustness in challenging scenes with occlusions and lighting variations
Validates effectiveness through extensive ablation studies
Abstract
Remarkable progress has been made in self-supervised monocular depth estimation (SS-MDE) by exploring cross-view consistency, e.g., photometric consistency and 3D point cloud consistency. However, they are very vulnerable to illumination variance, occlusions, texture-less regions, as well as moving objects, making them not robust enough to deal with various scenes. To address this challenge, we study two kinds of robust cross-view consistency in this paper. Firstly, the spatial offset field between adjacent frames is obtained by reconstructing the reference frame from its neighbors via deformable alignment, which is used to align the temporal depth features via a Depth Feature Alignment (DFA) loss. Secondly, the 3D point clouds of each reference frame and its nearby frames are calculated and transformed into voxel space, where the point density in each voxel is calculated and aligned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image Processing Techniques and Applications
MethodsALIGN · Direct Feedback Alignment
