Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout   Estimation from Spherical Panoramas

Nikolaos Zioulis; Federico Alvarez; Dimitrios Zarpalas; Petros Daras

arXiv:2102.03939·cs.CV·February 10, 2021

Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout Estimation from Spherical Panoramas

Nikolaos Zioulis, Federico Alvarez, Dimitrios Zarpalas, Petros Daras

PDF

1 Repo

TL;DR

This paper introduces a novel end-to-end method for estimating Manhattan-aligned room layouts from spherical panoramas without intermediate steps, using direct coordinate regression and geodesic-aware loss functions.

Contribution

It is the first to directly infer Manhattan-aligned layouts in a single shot, removing the need for postprocessing and intermediate representations.

Findings

01

Achieves accurate Manhattan-aligned layout estimation from spherical panoramas.

02

Introduces geodesic heatmaps and loss for better keypoint detection.

03

Provides publicly available models and code for the community.

Abstract

It has been shown that global scene understanding tasks like layout estimation can benefit from wider field of views, and specifically spherical panoramas. While much progress has been made recently, all previous approaches rely on intermediate representations and postprocessing to produce Manhattan-aligned estimates. In this work we show how to estimate full room layouts in a single-shot, eliminating the need for postprocessing. Our work is the first to directly infer Manhattan-aligned outputs. To achieve this, our data-driven model exploits direct coordinate regression and is supervised end-to-end. As a result, we can explicitly add quasi-Manhattan constraints, which set the necessary conditions for a homography-based Manhattan alignment module. Finally, we introduce the geodesic heatmaps and loss and a boundary-aware center of mass calculation that facilitate higher quality keypoint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VCL3D/SingleShotCuboids
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.