ADAADepth: Adapting Data Augmentation and Attention for Self-Supervised Monocular Depth Estimation
Vinay Kaushik, Kartik Jindgar, Brejesh Lall

TL;DR
This paper introduces ADAADepth, a novel self-supervised monocular depth estimation method that combines data augmentation, relational self-attention, and progressive training to improve accuracy and robustness, achieving state-of-the-art results.
Contribution
The paper proposes a new framework integrating depth augmentation, a relational self-attention module, and progressive training for enhanced self-supervised monocular depth estimation.
Findings
Achieves state-of-the-art results on KITTI dataset.
Demonstrates better generalization on Make3D dataset.
Uses fewer trainable parameters than existing methods.
Abstract
Self-supervised learning of depth has been a highly studied topic of research as it alleviates the requirement of having ground truth annotations for predicting depth. Depth is learnt as an intermediate solution to the task of view synthesis, utilising warped photometric consistency. Although it gives good results when trained using stereo data, the predicted depth is still sensitive to noise, illumination changes and specular reflections. Also, occlusion can be tackled better by learning depth from a single camera. We propose ADAA, utilising depth augmentation as depth supervision for learning accurate and robust depth. We propose a relational self-attention module that learns rich contextual features and further enhances depth results. We also optimize the auto-masking strategy across all losses by enforcing L1 regularisation over mask. Our novel progressive training strategy first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
