FuseSeg: LiDAR Point Cloud Segmentation Fusing Multi-Modal Data

Georg Krispel; Michael Opitz; Georg Waltner; Horst Possegger; Horst; Bischof

arXiv:1912.08487·cs.CV·December 20, 2019

FuseSeg: LiDAR Point Cloud Segmentation Fusing Multi-Modal Data

Georg Krispel, Michael Opitz, Georg Waltner, Horst Possegger, Horst, Bischof

PDF

TL;DR

FuseSeg introduces a fusion approach combining LiDAR and RGB data for improved point cloud segmentation, achieving up to 18% IoU gain and real-time processing at 50 fps.

Contribution

It presents a novel fusion method that integrates multi-modal data into a single network for enhanced segmentation accuracy.

Findings

01

Up to 18% IoU improvement on KITTI benchmark

02

Real-time segmentation at 50 fps

03

Effective fusion of LiDAR and RGB data

Abstract

We introduce a simple yet effective fusion method of LiDAR and RGB data to segment LiDAR point clouds. Utilizing the dense native range representation of a LiDAR sensor and the setup calibration, we establish point correspondences between the two input modalities. Subsequently, we are able to warp and fuse the features from one domain into the other. Therefore, we can jointly exploit information from both data sources within one single network. To show the merit of our method, we extend SqueezeSeg, a point cloud segmentation network, with an RGB feature branch and fuse it into the original structure. Our extension called FuseSeg leads to an improvement of up to 18% IoU on the KITTI benchmark. In addition to the improved accuracy, we also achieve real-time performance at 50 fps, five times as fast as the KITTI LiDAR data recording speed.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.