von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning
Tyler R. Scott, Andrew C. Gallagher, Michael C. Mozer

TL;DR
This paper empirically compares different embedding geometries for softmax classification losses, introduces a von Mises-Fisher based probabilistic classifier, and offers guidance on selecting optimal loss functions for various tasks.
Contribution
It systematically evaluates embedding geometries for softmax losses and proposes a novel von Mises-Fisher loss that improves calibration and competitiveness.
Findings
Spherical losses exhibit unique properties leading to a new probabilistic classifier.
The von Mises-Fisher loss is competitive with state-of-the-art methods.
The paper provides practical guidance on choosing embedding geometries and losses.
Abstract
Recent work has argued that classification losses utilizing softmax cross-entropy are superior not only for fixed-set classification tasks, but also by outperforming losses developed specifically for open-set tasks including few-shot learning and retrieval. Softmax classifiers have been studied using different embedding geometries -- Euclidean, hyperbolic, and spherical -- and claims have been made about the superiority of one or another, but they have not been systematically compared with careful controls. We conduct an empirical investigation of embedding geometry on softmax losses for a variety of fixed-set classification and image retrieval tasks. An interesting property observed for the spherical losses lead us to propose a probabilistic classifier based on the von Mises-Fisher distribution, and we show that it is competitive with state-of-the-art methods while producing improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Statistical Methods and Inference
MethodsSoftmax
