Generative and Discriminative Voxel Modeling with Convolutional Neural   Networks

Andrew Brock; Theodore Lim; J.M. Ritchie; Nick Weston

arXiv:1608.04236·cs.CV·August 17, 2016·451 cites

Generative and Discriminative Voxel Modeling with Convolutional Neural Networks

Andrew Brock, Theodore Lim, J.M. Ritchie, Nick Weston

PDF

Open Access 2 Repos

TL;DR

This paper investigates voxel-based 3D data representations, introducing new neural network methods for shape modeling and object classification, achieving significant improvements on benchmark datasets.

Contribution

It presents novel training methods for voxel-based variational autoencoders, a user interface for latent space exploration, and a deep CNN architecture for object classification.

Findings

01

51.5% relative improvement on ModelNet benchmark

02

Effective voxel-based shape modeling and classification

03

Addressed challenges specific to voxel representations

Abstract

When working with three-dimensional data, choice of representation is key. We explore voxel-based models, and present evidence for the viability of voxellated representations in applications including shape modeling and object classification. Our key contributions are methods for training voxel-based variational autoencoders, a user interface for exploring the latent space learned by the autoencoder, and a deep convolutional neural network architecture for object classification. We address challenges unique to voxel-based representations, and empirically evaluate our models on the ModelNet benchmark, where we demonstrate a 51.5% relative improvement in the state of the art for object classification.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Image Processing and 3D Reconstruction · Computer Graphics and Visualization Techniques