PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Bin Tan, Nan Xue, Song Bai, Tianfu Wu, Gui-Song Xia

TL;DR
PlaneTR is a Transformer-based neural network that detects and reconstructs 3D planes from a single image by jointly leveraging contextual and structural information in a sequence-to-sequence framework, achieving state-of-the-art results.
Contribution
The paper introduces PlaneTR, a novel Transformer architecture that integrates geometric structures and context for holistic 3D plane detection and reconstruction from images.
Findings
Achieves state-of-the-art performance on ScanNet and NYUv2 datasets.
Effectively combines structural cues and context in a unified Transformer framework.
Outperforms previous methods in accuracy and efficiency.
Abstract
This paper presents a neural network built upon Transformers, namely PlaneTR, to simultaneously detect and reconstruct planes from a single image. Different from previous methods, PlaneTR jointly leverages the context information and the geometric structures in a sequence-to-sequence way to holistically detect plane instances in one forward pass. Specifically, we represent the geometric structures as line segments and conduct the network with three main components: (i) context and line segments encoders, (ii) a structure-guided plane decoder, (iii) a pixel-wise plane embedding decoder. Given an image and its detected line segments, PlaneTR generates the context and line segment sequences via two specially designed encoders and then feeds them into a Transformers-based decoder to directly predict a sequence of plane instances by simultaneously considering the context and global structure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · Optical measurement and interference techniques
