Confidence estimation of classification based on the distribution of the   neural network output layer

Abdel Aziz Taha; Leonhard Hennig; Petr Knoth

arXiv:2210.07745·cs.CL·October 19, 2022·5 cites

Confidence estimation of classification based on the distribution of the neural network output layer

Abdel Aziz Taha, Leonhard Hennig, Petr Knoth

PDF

Open Access

TL;DR

This paper introduces novel confidence estimation methods for neural network classifiers that leverage the output logit distribution, improving prediction filtering without altering existing models.

Contribution

The authors propose new confidence estimation techniques based on logit distributions that require no modifications to neural networks and enhance prediction accuracy across various tasks.

Findings

01

Significant accuracy improvements in relation extraction, named entity recognition, and image classification.

02

Methods effectively filter predictions to maximize precision with minimal recall loss.

03

No changes needed to existing neural network architectures.

Abstract

One of the most common problems preventing the application of prediction models in the real world is lack of generalization: The accuracy of models, measured in the benchmark does repeat itself on future data, e.g. in the settings of real business. There is relatively little methods exist that estimate the confidence of prediction models. In this paper, we propose novel methods that, given a neural network classification model, estimate uncertainty of particular predictions generated by this model. Furthermore, we propose a method that, given a model and a confidence level, calculates a threshold that separates prediction generated by this model into two subsets, one of them meets the given confidence level. In contrast to other methods, the proposed methods do not require any changes on existing neural networks, because they simply build on the output logit layer of a common neural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning