Confidence estimation of classification based on the distribution of the neural network output layer
Abdel Aziz Taha, Leonhard Hennig, Petr Knoth

TL;DR
This paper introduces novel confidence estimation methods for neural network classifiers that leverage the output logit distribution, improving prediction filtering without altering existing models.
Contribution
The authors propose new confidence estimation techniques based on logit distributions that require no modifications to neural networks and enhance prediction accuracy across various tasks.
Findings
Significant accuracy improvements in relation extraction, named entity recognition, and image classification.
Methods effectively filter predictions to maximize precision with minimal recall loss.
No changes needed to existing neural network architectures.
Abstract
One of the most common problems preventing the application of prediction models in the real world is lack of generalization: The accuracy of models, measured in the benchmark does repeat itself on future data, e.g. in the settings of real business. There is relatively little methods exist that estimate the confidence of prediction models. In this paper, we propose novel methods that, given a neural network classification model, estimate uncertainty of particular predictions generated by this model. Furthermore, we propose a method that, given a model and a confidence level, calculates a threshold that separates prediction generated by this model into two subsets, one of them meets the given confidence level. In contrast to other methods, the proposed methods do not require any changes on existing neural networks, because they simply build on the output logit layer of a common neural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
