Long-Term Occupancy Grid Prediction Using Recurrent Neural Networks
Marcel Schreiber, Stefan Hoermann, Klaus Dietmayer

TL;DR
This paper presents a deep learning approach using ConvLSTMs for long-term occupancy grid prediction in urban driving scenarios, enabling multi-hour predictions of scene evolution with improved accuracy.
Contribution
It introduces a novel RNN-based architecture with recurrent skip connections for predicting small occluded objects and static regions in long sequences of real-world data.
Findings
Enhanced prediction accuracy over previous models
Ability to predict small occluded objects like pedestrians
Effective long-term scene evolution modeling
Abstract
We tackle the long-term prediction of scene evolution in a complex downtown scenario for automated driving based on Lidar grid fusion and recurrent neural networks (RNNs). A bird's eye view of the scene, including occupancy and velocity, is fed as a sequence to a RNN which is trained to predict future occupancy. The nature of prediction allows generation of multiple hours of training data without the need of manual labeling. Thus, the training strategy and loss function is designed for long sequences of real-world data (unbalanced, continuously changing situations, false labels, etc.). The deep CNN architecture comprises convolutional long short-term memories (ConvLSTMs) to separate static from dynamic regions and to predict dynamic objects in future frames. Novel recurrent skip connections show the ability to predict small occluded objects, i.e. pedestrians, and occluded static…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
