Weakly supervised learning of indoor geometry by dual warping
Pulak Purkait, Ujwal Bonde, Christopher Zach

TL;DR
This paper introduces a weakly supervised method for indoor 3D scene prediction that operates in image space, using depth completion and novel data generation, achieving superior results in view synthesis tasks.
Contribution
The paper presents a novel image-space depth completion approach for indoor scenes using weak supervision and realistic occlusion data generation.
Findings
Outperforms existing methods on benchmark datasets
Effective for new view synthesis of RGB-D images
Operates efficiently without voxel-based 3D representations
Abstract
A major element of depth perception and 3D understanding is the ability to predict the 3D layout of a scene and its contained objects for a novel pose. Indoor environments are particularly suitable for novel view prediction, since the set of objects in such environments is relatively restricted. In this work we address the task of 3D prediction especially for indoor scenes by leveraging only weak supervision. In the literature 3D scene prediction is usually solved via a 3D voxel grid. However, such methods are limited to estimating rather coarse 3D voxel grids, since predicting entire voxel spaces has large computational costs. Hence, our method operates in image-space rather than in voxel space, and the task of 3D estimation essentially becomes a depth image completion problem. We propose a novel approach to easily generate training data containing depth maps with realistic occlusions,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Remote Sensing and LiDAR Applications · Computer Graphics and Visualization Techniques
