Fail-Safe Execution of Deep Learning based Systems through Uncertainty   Monitoring

Michael Weiss; Paolo Tonella

arXiv:2102.00902·cs.SE·February 3, 2021

Fail-Safe Execution of Deep Learning based Systems through Uncertainty Monitoring

Michael Weiss, Paolo Tonella

PDF

Open Access 2 Repos

TL;DR

This paper presents a method for enhancing the safety of deep learning systems by using uncertainty estimators to detect unreliable predictions and trigger safety mechanisms, validated through extensive empirical study.

Contribution

It introduces novel uncertainty metrics, discusses their advantages and disadvantages, and provides a publicly available tool for uncertainty estimation in DNNs for fail-safe system deployment.

Findings

01

Uncertainty estimators can effectively identify unreliable DNN predictions.

02

The proposed metrics outperform existing approaches in empirical assessments.

03

The approach improves the safety and reliability of deep learning systems in critical applications.

Abstract

Modern software systems rely on Deep Neural Networks (DNN) when processing complex, unstructured inputs, such as images, videos, natural language texts or audio signals. Provided the intractably large size of such input spaces, the intrinsic limitations of learning algorithms, and the ambiguity about the expected predictions for some of the inputs, not only there is no guarantee that DNN's predictions are always correct, but rather developers must safely assume a low, though not negligible, error probability. A fail-safe Deep Learning based System (DLS) is one equipped to handle DNN faults by means of a supervisor, capable of recognizing predictions that should not be trusted and that should activate a healing procedure bringing the DLS to a safe state. In this paper, we propose an approach to use DNN uncertainty estimators to implement such a supervisor. We first discuss the advantages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Software Reliability and Analysis Research · Risk and Safety Analysis