Can we Defend Against the Unknown? An Empirical Study About Threshold   Selection for Neural Network Monitoring

Khoi Tran Dang; Kevin Delmas; J\'er\'emie Guiochet; Joris Gu\'erin

arXiv:2405.08654·cs.LG·May 22, 2024

Can we Defend Against the Unknown? An Empirical Study About Threshold Selection for Neural Network Monitoring

Khoi Tran Dang, Kevin Delmas, J\'er\'emie Guiochet, Joris Gu\'erin

PDF

Open Access

TL;DR

This paper investigates how to select effective thresholds for neural network runtime monitors, especially under unforeseen threats, highlighting the importance of threshold choice in real-world safety-critical applications.

Contribution

It provides empirical insights into threshold selection challenges and explores the impact of incorporating generic threats into threshold optimization for improved robustness.

Findings

01

Threshold choice significantly affects monitor effectiveness against unseen threats.

02

Incorporating generic threats into threshold optimization can improve robustness.

03

Monitors trained on known threats may underperform on unforeseen threats.

Abstract

With the increasing use of neural networks in critical systems, runtime monitoring becomes essential to reject unsafe predictions during inference. Various techniques have emerged to establish rejection scores that maximize the separability between the distributions of safe and unsafe predictions. The efficacy of these approaches is mostly evaluated using threshold-agnostic metrics, such as the area under the receiver operating characteristic curve. However, in real-world applications, an effective monitor also requires identifying a good threshold to transform these scores into meaningful binary decisions. Despite the pivotal importance of threshold optimization, this problem has received little attention. A few studies touch upon this question, but they typically assume that the runtime data distribution mirrors the training distribution, which is a strong assumption as monitors are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Explainable Artificial Intelligence (XAI)