Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using   Visual Foundation Models

Madhu Vankadari; Samuel Hodgson; Sangyun Shin; Kaichen Zhou Andrew; Markham; and Niki Trigoni

arXiv:2405.11158·cs.CV·May 21, 2024

Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation Models

Madhu Vankadari, Samuel Hodgson, Sangyun Shin, Kaichen Zhou Andrew, Markham, and Niki Trigoni

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-supervised stereo depth estimation method tailored for nighttime conditions, leveraging pretrained visual foundation models and novel masking and evaluation techniques to improve accuracy in challenging low-light scenarios.

Contribution

We propose a new approach that uses visual foundation models for feature extraction, a masking strategy to handle photometric violations, and novel evaluation metrics for nighttime stereo depth estimation.

Findings

01

Significant improvement on Oxford RobotCar and Multi-Spectral Stereo datasets.

02

Robust depth estimation in low-visibility and nighttime conditions.

03

Enhanced evaluation metrics for depth accuracy.

Abstract

Self-supervised depth estimation algorithms rely heavily on frame-warping relationships, exhibiting substantial performance degradation when applied in challenging circumstances, such as low-visibility and nighttime scenarios with varying illumination conditions. Addressing this challenge, we introduce an algorithm designed to achieve accurate self-supervised stereo depth estimation focusing on nighttime conditions. Specifically, we use pretrained visual foundation models to extract generalised features across challenging scenes and present an efficient method for matching and integrating these features from stereo frames. Moreover, to prevent pixels violating photometric consistency assumption from negatively affecting the depth predictions, we propose a novel masking approach designed to filter out such pixels. Lastly, addressing weaknesses in the evaluation of current depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

madhubabuv/dtd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques