Explorations of the Softmax Space: Knowing When the Neural Network   Doesn't Know

Daniel Sikar; Artur d'Avila Garcez; Tillman Weyde

arXiv:2502.00456·cs.LG·May 1, 2025

Explorations of the Softmax Space: Knowing When the Neural Network Doesn't Know

Daniel Sikar, Artur d'Avila Garcez, Tillman Weyde

PDF

Open Access

TL;DR

This paper introduces a confidence measure for neural networks based on clustering softmax outputs, enabling the system to identify uncertain predictions and defer decisions, thereby improving reliability in critical applications.

Contribution

It proposes a novel clustering-based confidence measure using softmax vectors to determine when to defer predictions, applicable across different models and datasets.

Findings

01

Effective in identifying low-confidence predictions

02

Works consistently across datasets and models

03

Enables deferral of uncertain predictions to humans

Abstract

Ensuring the reliability of automated decision-making based on neural networks will be crucial as Artificial Intelligence systems are deployed more widely in critical situations. This paper proposes a new approach for measuring confidence in the predictions of any neural network that relies on the predictions of a softmax layer. We identify that a high-accuracy trained network may have certain outputs for which there should be low confidence. In such cases, decisions should be deferred and it is more appropriate for the network to provide a \textit{not known} answer to a corresponding classification task. Our approach clusters the vectors in the softmax layer to measure distances between cluster centroids and network outputs. We show that a cluster with centroid calculated simply as the mean softmax output for all correct predictions can serve as a suitable proxy in the evaluation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications