AlphaTablets: A Generic Plane Representation for 3D Planar   Reconstruction from Monocular Videos

Yuze He; Wang Zhao; Shaohui Liu; Yubin Hu; Yushi Bai; Yu-Hui Wen,; Yong-Jin Liu

arXiv:2411.19950·cs.CV·December 2, 2024

AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos

Yuze He, Wang Zhao, Shaohui Liu, Yubin Hu, Yushi Bai, Yu-Hui Wen,, Yong-Jin Liu

PDF

Open Access 1 Video

TL;DR

AlphaTablets introduces a new 3D plane representation using rectangles with alpha channels, enabling accurate and flexible 3D planar reconstruction from monocular videos with state-of-the-art results.

Contribution

The paper presents AlphaTablets, a novel, differentiable, rectangle-based 3D plane representation that improves reconstruction accuracy and boundary delineation from monocular videos.

Findings

01

Achieves state-of-the-art 3D planar reconstruction on ScanNet

02

Enables accurate boundary and surface modeling of planes

03

Demonstrates effective merging and refinement of AlphaTablets

Abstract

We introduce AlphaTablets, a novel and generic representation of 3D planes that features continuous 3D surface and precise boundary delineation. By representing 3D planes as rectangles with alpha channels, AlphaTablets combine the advantages of current 2D and 3D plane representations, enabling accurate, consistent and flexible modeling of 3D planes. We derive differentiable rasterization on top of AlphaTablets to efficiently render 3D planes into images, and propose a novel bottom-up pipeline for 3D planar reconstruction from monocular videos. Starting with 2D superpixels and geometric cues from pre-trained models, we initialize 3D planes as AlphaTablets and optimize them via differentiable rendering. An effective merging scheme is introduced to facilitate the growth and refinement of AlphaTablets. Through iterative optimization and merging, we reconstruct complete and accurate 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos· slideslive

Taxonomy

Topics3D Surveying and Cultural Heritage · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques