TL;DR
P3Depth introduces a novel piecewise planarity prior for supervised monocular depth estimation, leveraging coplanar pixel relationships to improve depth accuracy and boundary sharpness, achieving state-of-the-art results on NYU Depth-v2 and KITTI datasets.
Contribution
It proposes a dual-head neural network that predicts plane coefficients and seed pixel locations, enabling adaptive depth refinement based on local planarity.
Findings
Achieves new state-of-the-art accuracy on NYU Depth-v2.
Outperforms previous methods on KITTI dataset.
Produces depth maps with sharp occlusion boundaries.
Abstract
Monocular depth estimation is vital for scene understanding and downstream tasks. We focus on the supervised setup, in which ground-truth depth is available only at training time. Based on knowledge about the high regularity of real 3D scenes, we propose a method that learns to selectively leverage information from coplanar pixels to improve the predicted depth. In particular, we introduce a piecewise planarity prior which states that for each pixel, there is a seed pixel which shares the same planar 3D surface with the former. Motivated by this prior, we design a network with two heads. The first head outputs pixel-level plane coefficients, while the second one outputs a dense offset vector field that identifies the positions of seed pixels. The plane coefficients of seed pixels are then used to predict depth at each position. The resulting prediction is adaptively fused with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
