MobileXNet: An Efficient Convolutional Neural Network for Monocular Depth Estimation
Xingshuai Dong, Matthew A. Garratt, Sreenatha G. Anavatti, and Hussein, A. Abbass

TL;DR
MobileXNet is a lightweight CNN designed for monocular depth estimation that balances accuracy and speed, enabling real-time performance on less powerful hardware while maintaining competitive results.
Contribution
The paper introduces a novel shallow encoder-decoder CNN architecture that improves speed without sacrificing accuracy in monocular depth estimation.
Findings
Achieves comparable accuracy to state-of-the-art deep models
Runs significantly faster on a single GPU
Performs well across multiple datasets
Abstract
Depth is a vital piece of information for autonomous vehicles to perceive obstacles. Due to the relatively low price and small size of monocular cameras, depth estimation from a single RGB image has attracted great interest in the research community. In recent years, the application of Deep Neural Networks (DNNs) has significantly boosted the accuracy of monocular depth estimation (MDE). State-of-the-art methods are usually designed on top of complex and extremely deep network architectures, which require more computational resources and cannot run in real-time without using high-end GPUs. Although some researchers tried to accelerate the running speed, the accuracy of depth estimation is degraded because the compressed model does not represent images well. In addition, the inherent characteristic of the feature extractor used by the existing approaches results in severe spatial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques
