Improving Online Performance Prediction for Semantic Segmentation
Marvin Klingner, Andreas B\"ar, Marcel Mross, Tim Fingscheidt

TL;DR
This paper introduces an improved online performance prediction method for semantic segmentation DNNs, utilizing auxiliary monocular depth estimation and multi-task learning to enhance accuracy in safety-critical applications.
Contribution
It proposes a novel multi-task training approach with shared encoder-decoder architecture and temporal statistics aggregation for better online performance prediction.
Findings
Enhanced prediction accuracy on KITTI dataset
Shared encoder improves computational efficiency
Temporal aggregation reduces prediction error
Abstract
In this work we address the task of observing the performance of a semantic segmentation deep neural network (DNN) during online operation, i.e., during inference, which is of high importance in safety-critical applications such as autonomous driving. Here, many high-level decisions rely on such DNNs, which are usually evaluated offline, while their performance in online operation remains unknown. To solve this problem, we propose an improved online performance prediction scheme, building on a recently proposed concept of predicting the primary semantic segmentation task's performance. This can be achieved by evaluating the auxiliary task of monocular depth estimation with a measurement supplied by a LiDAR sensor and a subsequent regression to the semantic segmentation performance. In particular, we propose (i) sequential training methods for both tasks in a multi-task training setup,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
