GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama   Registration Network

Jheng-Wei Su; Chi-Han Peng; Peter Wonka; Hung-Kuo Chu

arXiv:2210.11419·cs.CV·October 24, 2022

GPR-Net: Multi-view Layout Estimation via a Geometry-aware Panorama Registration Network

Jheng-Wei Su, Chi-Han Peng, Peter Wonka, Hung-Kuo Chu

PDF

Open Access

TL;DR

This paper introduces GPR-Net, a novel geometry-aware network that jointly learns panorama registration and 3D layout estimation from multiple 360-degree images without prior pose information, achieving state-of-the-art results.

Contribution

The paper proposes GPR-Net, a new framework that effectively addresses wide baseline panorama registration and layout estimation jointly, without relying on pose priors.

Findings

01

Achieves state-of-the-art performance on ZInD dataset.

02

Effectively handles wide baseline registration.

03

Jointly estimates layouts and registration without pose prior.

Abstract

Reconstructing 3D layouts from multiple $36 0^{\circ}$ panoramas has received increasing attention recently as estimating a complete layout of a large-scale and complex room from a single panorama is very difficult. The state-of-the-art method, called PSMNet, introduces the first learning-based framework that jointly estimates the room layout and registration given a pair of panoramas. However, PSMNet relies on an approximate (i.e., "noisy") registration as input. Obtaining this input requires a solution for wide baseline registration which is a challenging problem. In this work, we present a complete multi-view panoramic layout estimation framework that jointly learns panorama registration and layout estimation given a pair of panoramas without relying on a pose prior. The major improvement over PSMNet comes from a novel Geometry-aware Panorama Registration Network or GPR-Net that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dense Connections · Softmax · Vision Transformer