Object Detection on Dynamic Occupancy Grid Maps Using Deep Learning and   Automatic Label Generation

Stefan Hoermann; Philipp Henzler; Martin Bach; Klaus Dietmayer

arXiv:1802.02202·cs.CV·February 8, 2018

Object Detection on Dynamic Occupancy Grid Maps Using Deep Learning and Automatic Label Generation

Stefan Hoermann, Philipp Henzler, Martin Bach, Klaus Dietmayer

PDF

TL;DR

This paper presents a deep learning approach for object detection and pose estimation in dynamic occupancy grid maps, utilizing automatic label generation and a specialized loss function to handle data imbalance.

Contribution

It introduces a novel automatic labeling method and a loss function tailored for training deep networks on dynamic occupancy data, improving object detection in complex environments.

Findings

01

Achieved an average precision of 75.9% in object detection.

02

Demonstrated good generalization capabilities of the trained network.

03

Developed an offline algorithm for automatic label generation from sensor data.

Abstract

We tackle the problem of object detection and pose estimation in a shared space downtown environment. For perception multiple laser scanners with 360{\deg} coverage were fused in a dynamic occupancy grid map (DOGMa). A single-stage deep convolutional neural network is trained to provide object hypotheses comprising of shape, position, orientation and an existence score from a single input DOGMa. Furthermore, an algorithm for offline object extraction was developed to automatically label several hours of training data. The algorithm is based on a two-pass trajectory extraction, forward and backward in time. Typical for engineered algorithms, the automatic label generation suffers from misdetections, which makes hard negative mining impractical. Therefore, we propose a loss function counteracting the high imbalance between mostly static background and extremely rare dynamic grid cells.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.