MobileXNet: An Efficient Convolutional Neural Network for Monocular   Depth Estimation

Xingshuai Dong; Matthew A. Garratt; Sreenatha G. Anavatti; and Hussein; A. Abbass

arXiv:2111.12334·cs.RO·November 25, 2021

MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation

Xingshuai Dong, Matthew A. Garratt, Sreenatha G. Anavatti, and Hussein, A. Abbass

PDF

Open Access

TL;DR

MobileXNet is a lightweight CNN designed for monocular depth estimation that balances accuracy and speed, enabling real-time performance on less powerful hardware while maintaining competitive results.

Contribution

The paper introduces a novel shallow encoder-decoder CNN architecture that improves speed without sacrificing accuracy in monocular depth estimation.

Findings

01

Achieves comparable accuracy to state-of-the-art deep models

02

Runs significantly faster on a single GPU

03

Performs well across multiple datasets

Abstract

Depth is a vital piece of information for autonomous vehicles to perceive obstacles. Due to the relatively low price and small size of monocular cameras, depth estimation from a single RGB image has attracted great interest in the research community. In recent years, the application of Deep Neural Networks (DNNs) has significantly boosted the accuracy of monocular depth estimation (MDE). State-of-the-art methods are usually designed on top of complex and extremely deep network architectures, which require more computational resources and cannot run in real-time without using high-end GPUs. Although some researchers tried to accelerate the running speed, the accuracy of depth estimation is degraded because the compressed model does not represent images well. In addition, the inherent characteristic of the feature extractor used by the existing approaches results in severe spatial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques