Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision
Paul Henderson, Vittorio Ferrari

TL;DR
This paper introduces a versatile framework for 3D shape reconstruction and generation from 2D images alone, eliminating the need for 3D annotations and supporting weak supervision, while leveraging mesh representations and shading cues.
Contribution
It presents a novel 2D-supervised method for 3D mesh generation and reconstruction that works with single views and no pose annotations, outperforming voxel-based methods.
Findings
Disentangles shape and pose effectively.
Shading information enhances reconstruction quality.
Achieves comparable or better results than voxel-based approaches.
Abstract
We present a unified framework tackling two problems: class-specific 3D reconstruction from a single image, and generation of new 3D shape samples. These tasks have received considerable attention recently; however, existing approaches rely on 3D supervision, annotation of 2D images with keypoints or poses, and/or training with multiple views of each object instance. Our framework is very general: it can be trained in similar settings to these existing approaches, while also supporting weaker supervision scenarios. Importantly, it can be trained purely from 2D images, without ground-truth pose annotations, and with a single view per instance. We employ meshes as an output representation, instead of voxels used in most prior work. This allows us to exploit shading information during training, which previous 2D-supervised methods cannot. Thus, our method can learn to generate and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Advanced Vision and Imaging
