Self-supervised monocular depth estimation from oblique UAV videos

Logambal Madhuanand; Francesco Nex; Michael Ying Yang

arXiv:2012.10704·cs.CV·December 22, 2020

Self-supervised monocular depth estimation from oblique UAV videos

Logambal Madhuanand, Francesco Nex, Michael Ying Yang

PDF

1 Repo

TL;DR

This paper presents a self-supervised deep learning method for monocular depth estimation from UAV videos, using a novel architecture and contrastive loss to outperform existing approaches.

Contribution

It introduces a new architecture with 2D encoders and a 3D decoder, and employs a contrastive loss for improved depth estimation from UAV videos.

Findings

01

Outperforms state-of-the-art methods on UAVid dataset

02

Uses a novel 2D-3D CNN architecture for temporal feature extraction

03

Employs contrastive loss to enhance image reconstruction quality

Abstract

UAVs have become an essential photogrammetric measurement as they are affordable, easily accessible and versatile. Aerial images captured from UAVs have applications in small and large scale texture mapping, 3D modelling, object detection tasks, DTM and DSM generation etc. Photogrammetric techniques are routinely used for 3D reconstruction from UAV images where multiple images of the same scene are acquired. Developments in computer vision and deep learning techniques have made Single Image Depth Estimation (SIDE) a field of intense research. Using SIDE techniques on UAV images can overcome the need for multiple images for 3D reconstruction. This paper aims to estimate depth from a single UAV aerial image using deep learning. We follow a self-supervised learning approach, Self-Supervised Monocular Depth Estimation (SMDE), which does not need ground truth depth or any extra information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

logsanand/Monocular-depth-UAVimages
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods3 Dimensional Convolutional Neural Network