De-rendering 3D Objects in the Wild
Felix Wimbauer, Shangzhe Wu, Christian Rupprecht

TL;DR
This paper introduces a weakly supervised method for decomposing 2D images into detailed 3D representations, including shape, material, and lighting, using minimal initial shape supervision, suitable for diverse real-world objects.
Contribution
It presents a novel weakly supervised approach that leverages rough shape estimates for 3D decomposition from single images, enabling generalization to unseen categories.
Findings
Successfully de-decomposes images into 3D shape, material, and lighting.
Generalizes well to unseen object categories.
Provides a synthetic dataset for quantitative evaluation.
Abstract
With increasing focus on augmented and virtual reality applications (XR) comes the demand for algorithms that can lift objects from images and videos into representations that are suitable for a wide variety of related 3D tasks. Large-scale deployment of XR devices and applications means that we cannot solely rely on supervised learning, as collecting and annotating data for the unlimited variety of objects in the real world is infeasible. We present a weakly supervised method that is able to decompose a single image of an object into shape (depth and normals), material (albedo, reflectivity and shininess) and global lighting parameters. For training, the method only relies on a rough initial shape estimate of the training objects to bootstrap the learning process. This shape supervision can come for example from a pretrained depth network or - more generically - from a traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
