TL;DR
Plane2Depth introduces a hierarchical framework that adaptively incorporates plane information to significantly enhance monocular depth estimation, especially in low-texture indoor scenes, outperforming existing methods.
Contribution
The paper presents a novel plane-guided depth generator and adaptive feature aggregation modules that effectively utilize scene planes for improved depth prediction.
Findings
Outperforms state-of-the-art on NYU-Depth-v2
Achieves competitive results on KITTI
Generalizes well to unseen scenes
Abstract
Monocular depth estimation aims to infer a dense depth map from a single image, which is a fundamental and prevalent task in computer vision. Many previous works have shown impressive depth estimation results through carefully designed network structures, but they usually ignore the planar information and therefore perform poorly in low-texture areas of indoor scenes. In this paper, we propose Plane2Depth, which adaptively utilizes plane information to improve depth prediction within a hierarchical framework. Specifically, in the proposed plane guided depth generator (PGDG), we design a set of plane queries as prototypes to softly model planes in the scene and predict per-pixel plane coefficients. Then the predicted plane coefficients can be converted into metric depth values with the pinhole camera model. In the proposed adaptive plane query aggregation (APGA) module, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
