Multi-level 3D CNN for Learning Multi-scale Spatial Features

Sambit Ghadai; Xian Lee; Aditya Balu; Soumik Sarkar; Adarsh; Krishnamurthy

arXiv:1805.12254·cs.CV·May 7, 2019

Multi-level 3D CNN for Learning Multi-scale Spatial Features

Sambit Ghadai, Xian Lee, Aditya Balu, Soumik Sarkar, Adarsh, Krishnamurthy

PDF

1 Repo

TL;DR

This paper introduces a multi-level 3D CNN that learns multi-scale spatial features from voxel grids, improving 3D object recognition efficiency by reducing memory use while maintaining accuracy.

Contribution

The paper proposes an end-to-end multi-level voxel grid approach for 3D object recognition, addressing resolution and data uniformity challenges in existing methods.

Findings

01

Achieves comparable accuracy to dense voxel methods.

02

Uses significantly less memory than traditional dense voxel representations.

03

Demonstrates effective multi-scale feature learning for 3D object recognition.

Abstract

3D object recognition accuracy can be improved by learning the multi-scale spatial features from 3D spatial geometric representations of objects such as point clouds, 3D models, surfaces, and RGB-D data. Current deep learning approaches learn such features either using structured data representations (voxel grids and octrees) or from unstructured representations (graphs and point clouds). Learning features from such structured representations is limited by the restriction on resolution and tree depth while unstructured representations creates a challenge due to non-uniformity among data samples. In this paper, we propose an end-to-end multi-level learning approach on a multi-level voxel grid to overcome these drawbacks. To demonstrate the utility of the proposed multi-level learning, we use a multi-level voxel representation of 3D objects to perform object recognition. The multi-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

idealab-isu/GPView
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.