Enforcing geometric constraints of virtual normal for depth prediction
Wei Yin, Yifan Liu, Chunhua Shen, Youliang Yan

TL;DR
This paper introduces a novel loss function enforcing 3D geometric constraints via virtual normal directions, significantly improving monocular depth prediction accuracy and enabling direct 3D structure recovery.
Contribution
It proposes a new geometric constraint-based loss for depth prediction that enhances accuracy and allows direct 3D structure reconstruction without additional models.
Findings
Achieves state-of-the-art results on NYU Depth-V2 and KITTI datasets.
Improves depth prediction accuracy by incorporating geometric constraints.
Enables direct recovery of 3D structures like point clouds and surface normals.
Abstract
Monocular depth prediction plays a crucial role in understanding 3D scene geometry. Although recent methods have achieved impressive progress in evaluation metrics such as the pixel-wise relative error, most methods neglect the geometric constraints in the 3D space. In this work, we show the importance of the high-order 3D geometric constraints for depth prediction. By designing a loss term that enforces one simple type of geometric constraints, namely, virtual normal directions determined by randomly sampled three points in the reconstructed 3D space, we can considerably improve the depth prediction accuracy. Significantly, the byproduct of this predicted depth being sufficiently accurate is that we are now able to recover good 3D structures of the scene such as the point cloud and surface normal directly from the depth, eliminating the necessity of training new sub-models as was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
