LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic   Segmentation

Lin Zhao; Hui Zhou; Xinge Zhu; Xiao Song; Hongsheng Li; Wenbing Tao

arXiv:2108.07511·cs.CV·August 18, 2021

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

Lin Zhao, Hui Zhou, Xinge Zhu, Xiao Song, Hongsheng Li, Wenbing Tao

PDF

TL;DR

LIF-Seg introduces a novel coarse-to-fine fusion network for LiDAR and camera data that effectively addresses alignment and fusion challenges, significantly improving 3D LiDAR semantic segmentation accuracy in autonomous driving.

Contribution

The paper proposes a new fusion strategy that fully utilizes image context and rectifies sensor alignment issues, advancing multi-modal fusion techniques for autonomous vehicle perception.

Findings

01

Outperforms existing methods on nuScenes dataset

02

Effectively addresses weak spatiotemporal synchronization

03

Demonstrates significant accuracy improvements

Abstract

Camera and 3D LiDAR sensors have become indispensable devices in modern autonomous driving vehicles, where the camera provides the fine-grained texture, color information in 2D space and LiDAR captures more precise and farther-away distance measurements of the surrounding environments. The complementary information from these two sensors makes the two-modality fusion be a desired option. However, two major issues of the fusion between camera and LiDAR hinder its performance, \ie, how to effectively fuse these two modalities and how to precisely align them (suffering from the weak spatiotemporal synchronization problem). In this paper, we propose a coarse-to-fine LiDAR and camera fusion-based network (termed as LIF-Seg) for LiDAR segmentation. For the first issue, unlike these previous works fusing the point cloud and image information in a one-to-one manner, the proposed method fully…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.