GCNDepth: Self-supervised Monocular Depth Estimation based on Graph   Convolutional Network

Armin Masoumian; Hatem A. Rashwan; Saddam Abdulwahab; Julian Cristiano; and Domenec Puig

arXiv:2112.06782·cs.CV·December 14, 2021·6 cites

GCNDepth: Self-supervised Monocular Depth Estimation based on Graph Convolutional Network

Armin Masoumian, Hatem A. Rashwan, Saddam Abdulwahab, Julian Cristiano, and Domenec Puig

PDF

Open Access 1 Repo

TL;DR

This paper introduces GCNDepth, a self-supervised monocular depth estimation method using graph convolutional networks to better preserve object geometry, achieving high accuracy and fewer parameters than existing solutions.

Contribution

It proposes a novel GCN-based architecture for depth estimation that captures topological structures, improving accuracy and efficiency over traditional CNN methods.

Findings

01

Achieved 89% accuracy on KITTI and Make3D datasets.

02

Reduced trainable parameters by 40% compared to state-of-the-art.

03

Provided comparable or better depth estimation results.

Abstract

Depth estimation is a challenging task of 3D reconstruction to enhance the accuracy sensing of environment awareness. This work brings a new solution with a set of improvements, which increase the quantitative and qualitative understanding of depth maps compared to existing methods. Recently, a convolutional neural network (CNN) has demonstrated its extraordinary ability in estimating depth maps from monocular videos. However, traditional CNN does not support topological structure and they can work only on regular image regions with determined size and weights. On the other hand, graph convolutional networks (GCN) can handle the convolution on non-Euclidean data and it can be applied to irregular image regions within a topological structure. Therefore, in this work in order to preserve object geometric appearances and distributions, we aim at exploiting GCN for a self-supervised depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

arminmasoumian/gcndepth
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · 3D Surveying and Cultural Heritage

MethodsConvolution · Graph Convolutional Network