Enhancing Traffic Scene Predictions with Generative Adversarial Networks
Peter K\"onig, Sandra Aigner, Marco K\"orner

TL;DR
This paper introduces a two-stage pipeline combining video prediction and GAN-based enhancement to generate realistic traffic scene frames, improving object detection accuracy for autonomous driving.
Contribution
It presents a novel two-stage approach that enhances predicted traffic scene frames using GANs, leading to better object detection performance in autonomous driving scenarios.
Findings
Enhanced frames improve object detection accuracy by about 9%.
Traditional image metrics do not correlate well with detection performance.
The pipeline effectively generates realistic traffic scene predictions.
Abstract
We present a new two-stage pipeline for predicting frames of traffic scenes where relevant objects can still reliably be detected. Using a recent video prediction network, we first generate a sequence of future frames based on past frames. A second network then enhances these frames in order to make them appear more realistic. This ensures the quality of the predicted frames to be sufficient to enable accurate detection of objects, which is especially important for autonomously driving cars. To verify this two-stage approach, we conducted experiments on the Cityscapes dataset. For enhancing, we trained two image-to-image translation methods based on generative adversarial networks, one for blind motion deblurring and one for image super-resolution. All resulting predictions were quantitatively evaluated using both traditional metrics and a state-of-the-art object detection network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
