TL;DR
This paper introduces a novel loss function for single-view 3D reconstruction from a 2D image that eliminates the need for differentiable rendering, achieving state-of-the-art results efficiently.
Contribution
It proposes a new silhouette-based loss function and a pipeline that avoids rendering, improving accuracy and training speed in 3D model generation from single images.
Findings
Achieves state-of-the-art performance on multiple datasets.
Outperforms existing supervised and unsupervised methods.
Reduces training time compared to rendering-based approaches.
Abstract
Differentiable rendering is a very successful technique that applies to a Single-View 3D Reconstruction. Current renderers use losses based on pixels between a rendered image of some 3D reconstructed object and ground-truth images from given matched viewpoints to optimise parameters of the 3D shape. These models require a rendering step, along with visibility handling and evaluation of the shading model. The main goal of this paper is to demonstrate that we can avoid these steps and still get reconstruction results as other state-of-the-art models that are equal or even better than existing category-specific reconstruction methods. First, we use the same CNN architecture for the prediction of a point cloud shape and pose prediction like the one used by Insafutdinov & Dosovitskiy. Secondly, we propose the novel effective loss function that evaluates how well the projections of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
