Money on the Table: Statistical information ignored by Softmax can   improve classifier accuracy

Charles B. Delahunt; Courosh Mehanian; J. Nathan Kutz

arXiv:1901.09283·cs.LG·December 9, 2019·1 cites

Money on the Table: Statistical information ignored by Softmax can improve classifier accuracy

Charles B. Delahunt, Courosh Mehanian, J. Nathan Kutz

PDF

Open Access

TL;DR

This paper introduces a hybrid classifier that leverages the full class response distribution encoded in neural networks, improving accuracy by utilizing information ignored by the standard Softmax layer.

Contribution

The paper proposes a novel hybrid classifier, SPH, that enhances neural network accuracy by pooling class response distributions during testing, exploiting information ignored by Softmax.

Findings

01

SPH reduces test error by 6-23% across models.

02

Utilizes class response distributions for improved classification.

03

Works with trained models without retraining.

Abstract

Softmax is a standard final layer used in Neural Nets (NNs) to summarize information encoded in the trained NN and return a prediction. However, Softmax leverages only a subset of the class-specific structure encoded in the trained model and ignores potentially valuable information: During training, models encode an array $D$ of class response distributions, where $D_{ij}$ is the distribution of the $j^{t h}$ pre-Softmax readout neuron's responses to the $i^{t h}$ class. Given a test sample, Softmax implicitly uses only the row of this array $D$ that corresponds to the readout neurons' responses to the sample's true class. Leveraging more of this array $D$ can improve classifier accuracy, because the likelihoods of two competing classes can be encoded in other rows of $D$ . To explore this potential resource, we develop a hybrid classifier (Softmax-Pooling Hybrid, $S P H$ ) that uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications

MethodsSoftmax