Cascade Network for Self-Supervised Monocular Depth Estimation
Chunlai Chai, Yukuan Lou, Shijin Zhang

TL;DR
This paper introduces a cascade network-based self-supervised method for monocular depth estimation, dividing scenes into parts to improve accuracy and reliability, achieving state-of-the-art results on the KITTI benchmark.
Contribution
It proposes a novel cascade network approach that segments scenes by depth and trains separate models for each segment, enhancing depth estimation performance.
Findings
Improved accuracy over previous self-supervised methods
Achieved state-of-the-art results on KITTI benchmark
Demonstrated effectiveness of scene segmentation in depth estimation
Abstract
It is a classical compute vision problem to obtain real scene depth maps by using a monocular camera, which has been widely concerned in recent years. However, training this model usually requires a large number of artificially labeled samples. To solve this problem, some researchers use a self-supervised learning model to overcome this problem and reduce the dependence on manually labeled data. Nevertheless, the accuracy and reliability of these methods have not reached the expected standard. In this paper, we propose a new self-supervised learning method based on cascade networks. Compared with the previous self-supervised methods, our method has improved accuracy and reliability, and we have proved this by experiments. We show a cascaded neural network that divides the target scene into parts of different sight distances and trains them separately to generate a better depth map. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical measurement and interference techniques
