GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network
Jheng-Wei Su, Chi-Han Peng, Peter Wonka, Hung-Kuo Chu

TL;DR
This paper introduces GPR-Net, a novel geometry-aware network that jointly learns panorama registration and 3D layout estimation from multiple 360-degree images without prior pose information, achieving state-of-the-art results.
Contribution
The paper proposes GPR-Net, a new framework that effectively addresses wide baseline panorama registration and layout estimation jointly, without relying on pose priors.
Findings
Achieves state-of-the-art performance on ZInD dataset.
Effectively handles wide baseline registration.
Jointly estimates layouts and registration without pose prior.
Abstract
Reconstructing 3D layouts from multiple panoramas has received increasing attention recently as estimating a complete layout of a large-scale and complex room from a single panorama is very difficult. The state-of-the-art method, called PSMNet, introduces the first learning-based framework that jointly estimates the room layout and registration given a pair of panoramas. However, PSMNet relies on an approximate (i.e., "noisy") registration as input. Obtaining this input requires a solution for wide baseline registration which is a challenging problem. In this work, we present a complete multi-view panoramic layout estimation framework that jointly learns panorama registration and layout estimation given a pair of panoramas without relying on a pose prior. The major improvement over PSMNet comes from a novel Geometry-aware Panorama Registration Network or GPR-Net that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dense Connections · Softmax · Vision Transformer
