Outdoor inverse rendering from a single image using multiview self-supervision
Ye Yu, William A. P. Smith

TL;DR
This paper presents a neural network approach for scene-level inverse rendering from a single uncontrolled image, leveraging multiview self-supervision and MVS to recover shape, reflectance, and lighting.
Contribution
It introduces a novel method combining multiview stereo supervision with self-supervised learning for inverse rendering from a single image.
Findings
Effective recovery of shape, reflectance, and lighting from uncontrolled images.
First use of MVS supervision in inverse rendering.
Improved performance on inverse rendering and normal map benchmarks.
Abstract
In this paper we show how to perform scene-level inverse rendering to recover shape, reflectance and lighting from a single, uncontrolled image using a fully convolutional neural network. The network takes an RGB image as input, regresses albedo, shadow and normal maps from which we infer least squares optimal spherical harmonic lighting coefficients. Our network is trained using large uncontrolled multiview and timelapse image collections without ground truth. By incorporating a differentiable renderer, our network can learn from self-supervision. Since the problem is ill-posed we introduce additional supervision. Our key insight is to perform offline multiview stereo (MVS) on images containing rich illumination variation. From the MVS pose and depth maps, we can cross project between overlapping views such that Siamese training can be used to ensure consistent estimation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Advanced Image Processing Techniques
