Outside the Box: Abstraction-Based Monitoring of Neural Networks
Thomas A. Henzinger, Anna Lukina, Christian Schilling

TL;DR
This paper introduces an abstraction-based framework for monitoring neural networks at runtime by analyzing hidden layer behaviors to detect novel inputs, addressing limitations of confidence-based methods.
Contribution
It presents a novel approach using program analysis abstractions to monitor neural networks' hidden layers for improved novelty detection.
Findings
Effective detection of novel inputs on image classification benchmarks.
Robustness to variability in unknown classes.
Flexible trade-off between false warnings and detection accuracy.
Abstract
Neural networks have demonstrated unmatched performance in a range of classification tasks. Despite numerous efforts of the research community, novelty detection remains one of the significant limitations of neural networks. The ability to identify previously unseen inputs as novel is crucial for our understanding of the decisions made by neural networks. At runtime, inputs not falling into any of the categories learned during training cannot be classified correctly by the neural network. Existing approaches treat the neural network as a black box and try to detect novel inputs based on the confidence of the output predictions. However, neural networks are not trained to reduce their confidence for novel inputs, which limits the effectiveness of these approaches. We propose a framework to monitor a neural network by observing the hidden layers. We employ a common abstraction from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
