Object Detection on Dynamic Occupancy Grid Maps Using Deep Learning and Automatic Label Generation
Stefan Hoermann, Philipp Henzler, Martin Bach, Klaus Dietmayer

TL;DR
This paper presents a deep learning approach for object detection and pose estimation in dynamic occupancy grid maps, utilizing automatic label generation and a specialized loss function to handle data imbalance.
Contribution
It introduces a novel automatic labeling method and a loss function tailored for training deep networks on dynamic occupancy data, improving object detection in complex environments.
Findings
Achieved an average precision of 75.9% in object detection.
Demonstrated good generalization capabilities of the trained network.
Developed an offline algorithm for automatic label generation from sensor data.
Abstract
We tackle the problem of object detection and pose estimation in a shared space downtown environment. For perception multiple laser scanners with 360{\deg} coverage were fused in a dynamic occupancy grid map (DOGMa). A single-stage deep convolutional neural network is trained to provide object hypotheses comprising of shape, position, orientation and an existence score from a single input DOGMa. Furthermore, an algorithm for offline object extraction was developed to automatically label several hours of training data. The algorithm is based on a two-pass trajectory extraction, forward and backward in time. Typical for engineered algorithms, the automatic label generation suffers from misdetections, which makes hard negative mining impractical. Therefore, we propose a loss function counteracting the high imbalance between mostly static background and extremely rare dynamic grid cells.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
