TL;DR
This paper introduces IsoMax loss, a novel isotropic, distance-based loss function that replaces SoftMax to improve out-of-distribution detection in neural networks without sacrificing accuracy or efficiency.
Contribution
The paper proposes IsoMax loss, a new loss function that enhances OOD detection by producing high entropy outputs without changing model architecture or training procedures.
Findings
IsoMax loss improves OOD detection performance.
Models with IsoMax loss maintain accuracy and inference speed.
No additional data or hyperparameter tuning needed.
Abstract
In this paper, we argue that the unsatisfactory out-of-distribution (OOD) detection performance of neural networks is mainly due to the SoftMax loss anisotropy and propensity to produce low entropy probability distributions in disagreement with the principle of maximum entropy. Current out-of-distribution (OOD) detection approaches usually do not directly fix the SoftMax loss drawbacks, but rather build techniques to circumvent it. Unfortunately, those methods usually produce undesired side effects (e.g., classification accuracy drop, additional hyperparameters, slower inferences, and collecting extra data). In the opposite direction, we propose replacing SoftMax loss with a novel loss function that does not suffer from the mentioned weaknesses. The proposed IsoMax loss is isotropic (exclusively distance-based) and provides high entropy posterior probability distributions. Replacing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax
