Money on the Table: Statistical information ignored by Softmax can improve classifier accuracy
Charles B. Delahunt, Courosh Mehanian, J. Nathan Kutz

TL;DR
This paper introduces a hybrid classifier that leverages the full class response distribution encoded in neural networks, improving accuracy by utilizing information ignored by the standard Softmax layer.
Contribution
The paper proposes a novel hybrid classifier, SPH, that enhances neural network accuracy by pooling class response distributions during testing, exploiting information ignored by Softmax.
Findings
SPH reduces test error by 6-23% across models.
Utilizes class response distributions for improved classification.
Works with trained models without retraining.
Abstract
Softmax is a standard final layer used in Neural Nets (NNs) to summarize information encoded in the trained NN and return a prediction. However, Softmax leverages only a subset of the class-specific structure encoded in the trained model and ignores potentially valuable information: During training, models encode an array of class response distributions, where is the distribution of the pre-Softmax readout neuron's responses to the class. Given a test sample, Softmax implicitly uses only the row of this array that corresponds to the readout neurons' responses to the sample's true class. Leveraging more of this array can improve classifier accuracy, because the likelihoods of two competing classes can be encoded in other rows of . To explore this potential resource, we develop a hybrid classifier (Softmax-Pooling Hybrid, ) that uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsSoftmax
