Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth   Estimation

Li Liu; Ruijie Zhu; Jiacheng Deng; Ziyang Song; Wenfei Yang; Tianzhu; Zhang

arXiv:2409.02494·cs.CV·October 10, 2024

Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth Estimation

Li Liu, Ruijie Zhu, Jiacheng Deng, Ziyang Song, Wenfei Yang, Tianzhu, Zhang

PDF

2 Repos

TL;DR

Plane2Depth introduces a hierarchical framework that adaptively incorporates plane information to significantly enhance monocular depth estimation, especially in low-texture indoor scenes, outperforming existing methods.

Contribution

The paper presents a novel plane-guided depth generator and adaptive feature aggregation modules that effectively utilize scene planes for improved depth prediction.

Findings

01

Outperforms state-of-the-art on NYU-Depth-v2

02

Achieves competitive results on KITTI

03

Generalizes well to unseen scenes

Abstract

Monocular depth estimation aims to infer a dense depth map from a single image, which is a fundamental and prevalent task in computer vision. Many previous works have shown impressive depth estimation results through carefully designed network structures, but they usually ignore the planar information and therefore perform poorly in low-texture areas of indoor scenes. In this paper, we propose Plane2Depth, which adaptively utilizes plane information to improve depth prediction within a hierarchical framework. Specifically, in the proposed plane guided depth generator (PGDG), we design a set of plane queries as prototypes to softly model planes in the scene and predict per-pixel plane coefficients. Then the predicted plane coefficients can be converted into metric depth values with the pinhole camera model. In the proposed adaptive plane query aggregation (APGA) module, we introduce a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training