Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model
Xin Qiu, Risto Miikkulainen

TL;DR
This paper introduces RED, a Gaussian Process-based framework for detecting misclassification errors in neural networks, improving reliability and interpretability of confidence scores in real-world applications.
Contribution
The paper proposes a novel error detection framework, RED, that estimates uncertainty using Gaussian Processes, enhancing detection accuracy and interpretability over existing metrics.
Findings
RED outperforms other error detection methods on 125 UCI datasets.
The approach is effective with various classifiers and deep learning architectures.
RED helps understand the origin of errors, including out-of-distribution and adversarial samples.
Abstract
As neural network classifiers are deployed in real-world applications, it is crucial that their failures can be detected reliably. One practical solution is to assign confidence scores to each prediction, then use these scores to filter out possible misclassifications. However, existing confidence metrics are not yet sufficiently reliable for this role. This paper presents a new framework that produces a quantitative metric for detecting misclassification errors. This framework, RED, builds an error detector on top of the base classifier and estimates uncertainty of the detection scores using Gaussian Processes. Experimental comparisons with other error detection methods on 125 UCI datasets demonstrate that this approach is effective. Further implementations on two probabilistic base classifiers and two large deep learning architecture in vision tasks further confirm that the method is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
