LayoutNet: Reconstructing the 3D Room Layout from a Single RGB Image
Chuhang Zou, Alex Colburn, Qi Shan, Derek Hoiem

TL;DR
This paper introduces LayoutNet, an algorithm that reconstructs 3D room layouts from a single RGB image, effectively handling panoramas, perspective images, cuboid, and complex layouts with improved accuracy and speed.
Contribution
LayoutNet is the first method to operate directly on panoramic images for 3D room layout reconstruction, improving accuracy and generalization over existing approaches.
Findings
Performs well on panoramic images in speed and accuracy.
Achieves high accuracy on perspective images.
Handles both cuboid and complex Manhattan layouts.
Abstract
We propose an algorithm to predict room layout from a single image that generalizes across panoramas and perspective images, cuboid layouts and more general layouts (e.g. L-shape room). Our method operates directly on the panoramic image, rather than decomposing into perspective images as do recent works. Our network architecture is similar to that of RoomNet, but we show improvements due to aligning the image based on vanishing points, predicting multiple layout elements (corners, boundaries, size and translation), and fitting a constrained Manhattan layout to the resulting predictions. Our method compares well in speed and accuracy to other existing work on panoramas, achieves among the best accuracy for perspective images, and can handle both cuboid-shaped and more general Manhattan layouts.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · 3D Surveying and Cultural Heritage
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
