3D-LMNet: Latent Embedding Matching for Accurate and Diverse 3D Point   Cloud Reconstruction from a Single Image

Priyanka Mandikal; K L Navaneet; Mayank Agarwal; R. Venkatesh Babu

arXiv:1807.07796·cs.CV·March 27, 2019·35 cites

3D-LMNet: Latent Embedding Matching for Accurate and Diverse 3D Point Cloud Reconstruction from a Single Image

Priyanka Mandikal, K L Navaneet, Mayank Agarwal, R. Venkatesh Babu

PDF

Open Access 1 Repo

TL;DR

This paper introduces 3D-LMNet, a novel method for single-image 3D point cloud reconstruction that leverages latent embedding matching and probabilistic modeling to produce accurate, diverse, and plausible 3D reconstructions.

Contribution

The paper presents a two-stage approach combining a 3D auto-encoder with a learned mapping from images to latent space, enabling multiple plausible reconstructions with diversity loss.

Findings

01

Outperforms state-of-the-art on real and synthetic datasets

02

Generates multiple diverse reconstructions for a single input image

03

Effectively models uncertainty in 3D reconstruction

Abstract

3D reconstruction from single view images is an ill-posed problem. Inferring the hidden regions from self-occluded images is both challenging and ambiguous. We propose a two-pronged approach to address these issues. To better incorporate the data prior and generate meaningful reconstructions, we propose 3D-LMNet, a latent embedding matching approach for 3D reconstruction. We first train a 3D point cloud auto-encoder and then learn a mapping from the 2D image to the corresponding learnt embedding. To tackle the issue of uncertainty in the reconstruction, we predict multiple reconstructions that are consistent with the input view. This is achieved by learning a probablistic latent space with a novel view-specific diversity loss. Thorough quantitative and qualitative analysis is performed to highlight the significance of the proposed approach. We outperform state-of-the-art approaches on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

val-iisc/3d-lmnet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques