End-to-end 3D shape inverse rendering of different classes of objects from a single input image
Shima Kamyab, S. Zohreh Azimifar

TL;DR
This paper introduces a semi-supervised deep learning framework for reconstructing detailed 3D shapes from a single 2D image, reducing labeled data requirements and improving accuracy over existing methods.
Contribution
The proposed framework combines unsupervised pre-training with a minimal supervised component, enabling detailed 3D shape recovery without relying on predefined assumptions.
Findings
Achieves dense 3D reconstructions with high detail
Reduces need for labeled data through unsupervised pre-training
Outperforms recent methods on benchmark datasets
Abstract
In this paper a semi-supervised deep framework is proposed for the problem of 3D shape inverse rendering from a single 2D input image. The main structure of proposed framework consists of unsupervised pre-trained components which significantly reduce the need to labeled data for training the whole framework. using labeled data has the advantage of achieving to accurate results without the need to predefined assumptions about image formation process. Three main components are used in the proposed network: an encoder which maps 2D input image to a representation space, a 3D decoder which decodes a representation to a 3D structure and a mapping component in order to map 2D to 3D representation. The only part that needs label for training is the mapping part with not too many parameters. The other components in the network can be pre-trained unsupervised using only 2D images or 3D data in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
