A Systematic Literature Review on Deep Learning-based Depth Estimation   in Computer Vision

Ali Rohan; Md Junayed Hasan; Andrei Petrovski

arXiv:2501.05147·cs.CV·January 10, 2025

A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision

Ali Rohan, Md Junayed Hasan, Andrei Petrovski

PDF

Open Access

TL;DR

This systematic review analyzes deep learning-based depth estimation methods in computer vision, highlighting datasets, models, evaluation metrics, and challenges like ground truth data scarcity, to synthesize current state-of-the-art research.

Contribution

It provides a comprehensive synthesis of DL-based depth estimation techniques, datasets, evaluation metrics, and challenges, filling a gap in existing reviews by focusing specifically on DE.

Findings

01

DL methods mainly focus on monocular, stereo, and multi-view depth estimation.

02

Top datasets include KITTI, NYU Depth V2, and Make 3D.

03

Ground truth data scarcity is a major challenge.

Abstract

Depth estimation (DE) provides spatial information about a scene and enables tasks such as 3D reconstruction, object detection, and scene understanding. Recently, there has been an increasing interest in using deep learning (DL)-based methods for DE. Traditional techniques rely on handcrafted features that often struggle to generalise to diverse scenes and require extensive manual tuning. However, DL models for DE can automatically extract relevant features from input data, adapt to various scene conditions, and generalise well to unseen environments. Numerous DL-based methods have been developed, making it necessary to survey and synthesize the state-of-the-art (SOTA). Previous reviews on DE have mainly focused on either monocular or stereo-based techniques, rather than comprehensively reviewing DE. Furthermore, to the best of our knowledge, there is no systematic literature review…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIndustrial Vision Systems and Defect Detection · Image Processing Techniques and Applications · Image and Object Detection Techniques

MethodsMax Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · U-Net · VGG-16 · Surrogate Lagrangian Relaxation · Balanced Selection