LiDAR-based Recurrent 3D Semantic Segmentation with Temporal Memory   Alignment

Fabian Duerr; Mario Pfaller; Hendrik Weigel; Juergen Beyerer

arXiv:2103.02263·cs.CV·March 4, 2021

LiDAR-based Recurrent 3D Semantic Segmentation with Temporal Memory Alignment

Fabian Duerr, Mario Pfaller, Hendrik Weigel, Juergen Beyerer

PDF

Open Access

TL;DR

This paper introduces a recurrent 3D semantic segmentation method for LiDAR point clouds that leverages temporal memory alignment using ego motion, significantly improving segmentation accuracy over single-frame approaches.

Contribution

It proposes a novel recurrent architecture with Temporal Memory Alignment for 3D semantic segmentation, exploiting temporal information in LiDAR sequences for the first time.

Findings

01

Ranks first on SemanticKITTI multiple scan benchmark.

02

Achieves state-of-the-art on single scan benchmark.

03

Temporal information significantly improves segmentation results.

Abstract

Understanding and interpreting a 3d environment is a key challenge for autonomous vehicles. Semantic segmentation of 3d point clouds combines 3d information with semantics and thereby provides a valuable contribution to this task. In many real-world applications, point clouds are generated by lidar sensors in a consecutive fashion. Working with a time series instead of single and independent frames enables the exploitation of temporal information. We therefore propose a recurrent segmentation architecture (RNN), which takes a single range image frame as input and exploits recursively aggregated temporal information. An alignment strategy, which we call Temporal Memory Alignment, uses ego motion to temporally align the memory between consecutive frames in feature space. A Residual Network and ConvGRU are investigated for the memory update. We demonstrate the benefits of the presented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote Sensing and LiDAR Applications · 3D Shape Modeling and Analysis · Advanced Vision and Imaging