DropMax: Adaptive Variational Softmax
Hae Beom Lee, Juho Lee, Saehoon Kim, Eunho Yang, Sung Ju Hwang

TL;DR
DropMax introduces an adaptive stochastic softmax classifier that drops non-target classes during training, improving accuracy by focusing on confusing classes through learned dropout probabilities.
Contribution
It presents DropMax, a novel stochastic softmax method with learned adaptive dropout for non-target classes, enhancing classification performance.
Findings
Significantly improved accuracy over standard softmax.
Learns to focus on confusing classes via adaptive dropout.
Builds an ensemble of classifiers through stochastic regularization.
Abstract
We propose DropMax, a stochastic version of softmax classifier which at each iteration drops non-target classes according to dropout probabilities adaptively decided for each instance. Specifically, we overlay binary masking variables over class output probabilities, which are input-adaptively learned via variational inference. This stochastic regularization has an effect of building an ensemble classifier out of exponentially many classifiers with different decision boundaries. Moreover, the learning of dropout rates for non-target classes on each instance allows the classifier to focus more on classification against the most confusing classes. We validate our model on multiple public datasets for classification, on which it obtains significantly improved accuracy over the regular softmax classifier and other baselines. Further analysis of the learned dropout probabilities shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Data Stream Mining Techniques · Anomaly Detection Techniques and Applications
MethodsSoftmax · Dropout
