Learning to Reconstruct and Segment 3D Objects

Bo Yang

arXiv:2010.09582·cs.CV·October 20, 2020

Learning to Reconstruct and Segment 3D Objects

Bo Yang

PDF

Open Access 1 Repo

TL;DR

This paper explores deep learning methods for reconstructing and segmenting 3D objects from various visual inputs, aiming to improve understanding of 3D environments beyond traditional hand-crafted approaches.

Contribution

It introduces novel deep neural network techniques for 3D shape estimation and semantic segmentation at object and scene levels, trained on large-scale real-world data.

Findings

01

Deep neural networks outperform traditional methods in 3D shape estimation.

02

Robust segmentation achieved even with occlusions and limited views.

03

Scene understanding improved through integrated object-level analysis.

Abstract

To endow machines with the ability to perceive the real-world in a three dimensional representation as we do as humans is a fundamental and long-standing topic in Artificial Intelligence. Given different types of visual inputs such as images or point clouds acquired by 2D/3D sensors, one important goal is to understand the geometric structure and semantics of the 3D environment. Traditional approaches usually leverage hand-crafted features to estimate the shape and semantics of objects or scenes. However, they are difficult to generalize to novel objects and scenarios, and struggle to overcome critical issues caused by visual occlusions. By contrast, we aim to understand scenes and the objects within them by learning general and robust representations using deep neural networks, trained on large-scale real-world 3D data. To achieve these aims, this thesis makes three core contributions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yang7879/3D-BoNet
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization