TL;DR
This paper introduces a novel, lightweight deep learning architecture for unsupervised monocular depth estimation that operates efficiently on CPUs and embedded systems, enabling real-time applications without high-power GPUs.
Contribution
The authors propose a new pyramid-based network architecture trained unsupervised, achieving comparable accuracy to state-of-the-art methods but with significantly reduced complexity and real-time performance on CPUs.
Findings
Achieves similar accuracy to top methods with only 6% of parameters.
Infers depth maps in about 1.7 seconds on Raspberry Pi 3.
Operates at over 8 Hz on standard CPU, enabling real-time depth estimation.
Abstract
Unsupervised depth estimation from a single image is a very attractive technique with several implications in robotic, autonomous navigation, augmented reality and so on. This topic represents a very challenging task and the advent of deep learning enabled to tackle this problem with excellent results. However, these architectures are extremely deep and complex. Thus, real-time performance can be achieved only by leveraging power-hungry GPUs that do not allow to infer depth maps in application fields characterized by low-power constraints. To tackle this issue, in this paper we propose a novel architecture capable to quickly infer an accurate depth map on a CPU, even of an embedded system, using a pyramid of features extracted from a single input image. Similarly to state-of-the-art, we train our network in an unsupervised manner casting depth estimation as an image reconstruction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
