Fully Understanding Generic Objects: Modeling, Segmentation, and   Reconstruction

Feng Liu; Luan Tran; Xiaoming Liu

arXiv:2104.00858·cs.CV·April 5, 2021

Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction

Feng Liu, Luan Tran, Xiaoming Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a semi-supervised method for 3D object reconstruction from 2D images, leveraging latent representations and a category-adaptive 3D joint occupancy field to improve accuracy on real and synthetic data.

Contribution

It proposes a novel semi-supervised learning framework that decomposes images into latent representations and uses a 3D joint occupancy field for better shape and albedo modeling.

Findings

01

Superior 3D reconstruction from single images of real and synthetic objects.

02

Effective shape segmentation demonstrated.

03

Leverages real 2D images without 3D ground truth for modeling.

Abstract

Inferring 3D structure of a generic object from a 2D image is a long-standing objective of computer vision. Conventional approaches either learn completely from CAD-generated synthetic data, which have difficulty in inference from real images, or generate 2.5D depth image via intrinsic decomposition, which is limited compared to the full 3D reconstruction. One fundamental challenge lies in how to leverage numerous real 2D images without any 3D ground truth. To address this issue, we take an alternative approach with semi-supervised learning. That is, for a 2D image of a generic object, we decompose it into latent representations of category, shape and albedo, lighting and camera projection matrix, decode the representations to segmented 3D shape and albedo respectively, and fuse these components to render an image well approximating the input image. Using a category-adaptive 3D joint…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liuf1990/Fully_3D_Object
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Robotics and Sensor-Based Localization