PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation

Danfei Xu; Dragomir Anguelov; Ashesh Jain

arXiv:1711.10871·cs.CV·August 28, 2018·55 cites

PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation

Danfei Xu, Dragomir Anguelov, Ashesh Jain

PDF

Open Access 3 Repos

TL;DR

PointFusion introduces a versatile deep learning approach that fuses image and point cloud data for accurate 3D object detection across diverse environments without dataset-specific tuning.

Contribution

It presents a simple, application-agnostic fusion network combining CNN and PointNet outputs for 3D bounding box estimation from multi-sensor data.

Findings

01

Outperforms or matches state-of-the-art on KITTI and SUN-RGBD datasets.

02

Works effectively across outdoor and indoor environments.

03

No dataset-specific tuning required.

Abstract

We present PointFusion, a generic 3D object detection method that leverages both image and 3D point cloud information. Unlike existing methods that either use multi-stage pipelines or hold sensor and dataset-specific assumptions, PointFusion is conceptually simple and application-agnostic. The image data and the raw point cloud data are independently processed by a CNN and a PointNet architecture, respectively. The resulting outputs are then combined by a novel fusion network, which predicts multiple 3D box hypotheses and their confidences, using the input 3D points as spatial anchors. We evaluate PointFusion on two distinctive datasets: the KITTI dataset that features driving scenes captured with a lidar-camera setup, and the SUN-RGBD dataset that captures indoor environments with RGB-D cameras. Our model is the first one that is able to perform better or on-par with the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Surveying and Cultural Heritage · Robotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications