Im2Avatar: Colorful 3D Reconstruction from a Single Image
Yongbin Sun, Ziwei Liu, Yue Wang, Sanjay E. Sarma

TL;DR
This paper introduces an end-to-end framework called Colorful Voxel Network for reconstructing textured 3D models from a single image, effectively recovering shape and surface color simultaneously.
Contribution
The work presents a novel deep learning approach that jointly predicts 3D shape and surface color from a single image, including a new loss function for sparse volume representation.
Findings
Significant improvement over baseline methods.
Effective generalization across object categories.
Accurate textured 3D reconstructions from single images.
Abstract
Existing works on single-image 3D reconstruction mainly focus on shape recovery. In this work, we study a new problem, that is, simultaneously recovering 3D shape and surface color from a single image, namely "colorful 3D reconstruction". This problem is both challenging and intriguing because the ability to infer textured 3D model from a single image is at the core of visual understanding. Here, we propose an end-to-end trainable framework, Colorful Voxel Network (CVN), to tackle this problem. Conditioned on a single 2D input, CVN learns to decompose shape and surface color information of a 3D object into a 3D shape branch and a surface color branch, respectively. Specifically, for the shape recovery, we generate a shape volume with the state of its voxels indicating occupancy. For the surface color recovery, we combine the strength of appearance hallucination and geometric projection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
