MonoPlane: Exploiting Monocular Geometric Cues for Generalizable 3D Plane Reconstruction
Wang Zhao, Jiachen Liu, Sheng Zhang, Yishu Li, Sili Chen, Sharon X, Huang, Yong-Jin Liu, Hengkai Guo

TL;DR
MonoPlane introduces a monocular geometric cue-based framework for generalizable 3D plane detection and reconstruction, combining neural network-derived depth and normals with RANSAC and multi-plane optimization to achieve state-of-the-art results in diverse settings.
Contribution
The paper proposes a novel monocular geometric cue-based pipeline that generalizes well across datasets for 3D plane reconstruction, surpassing prior methods in zero-shot scenarios.
Findings
Achieves state-of-the-art zero-shot plane reconstruction performance.
Demonstrates robustness and scalability in diverse real-world datasets.
Extends single-view to sparse-view 3D plane reconstruction.
Abstract
This paper presents a generalizable 3D plane detection and reconstruction framework named MonoPlane. Unlike previous robust estimator-based works (which require multiple images or RGB-D input) and learning-based works (which suffer from domain shift), MonoPlane combines the best of two worlds and establishes a plane reconstruction pipeline based on monocular geometric cues, resulting in accurate, robust and scalable 3D plane detection and reconstruction in the wild. Specifically, we first leverage large-scale pre-trained neural networks to obtain the depth and surface normals from a single image. These monocular geometric cues are then incorporated into a proximity-guided RANSAC framework to sequentially fit each plane instance. We exploit effective 3D point proximity and model such proximity via a graph within RANSAC to guide the plane fitting from noisy monocular depths, followed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Optical measurement and interference techniques
