Robust Deep Neural Object Detection and Segmentation for Automotive   Driving Scenario with Compressed Image Data

Kristian Fischer; Christian Blum; Christian Herglotz; Andr\'e Kaup

arXiv:2205.06501·eess.IV·May 16, 2022·ISCA

Robust Deep Neural Object Detection and Segmentation for Automotive Driving Scenario with Compressed Image Data

Kristian Fischer, Christian Blum, Christian Herglotz, Andr\'e Kaup

PDF

TL;DR

This paper enhances the robustness of deep neural object detection and segmentation models for autonomous driving by training with compressed images, leading to improved accuracy and significant bitrate savings.

Contribution

It introduces a training method that incorporates compressed images to improve the robustness of Faster and Mask R-CNN models in automotive scenarios.

Findings

01

Up to 3.68% increase in average precision on compressed images.

02

Nearly 48% bitrate savings achieved.

03

Improved model robustness in real-world compressed data scenarios.

Abstract

Deep neural object detection or segmentation networks are commonly trained with pristine, uncompressed data. However, in practical applications the input images are usually deteriorated by compression that is applied to efficiently transmit the data. Thus, we propose to add deteriorated images to the training process in order to increase the robustness of the two state-of-the-art networks Faster and Mask R-CNN. Throughout our paper, we investigate an autonomous driving scenario by evaluating the newly trained models on the Cityscapes dataset that has been compressed with the upcoming video coding standard Versatile Video Coding (VVC). When employing the models that have been trained with the proposed method, the weighted average precision of the R-CNNs can be increased by up to 3.68 percentage points for compressed input images, which corresponds to bitrate savings of nearly 48 %.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRegion Proposal Network · Convolution · Softmax · RoIAlign · Mask R-CNN