On PyTorch Implementation of Density Estimators for von Mises-Fisher and Its Mixture
Minyoung Kim

TL;DR
This paper provides a detailed PyTorch implementation of the von Mises-Fisher density model and its mixture, addressing numerical issues and demonstrating applications in synthetic data and image clustering.
Contribution
It offers practical recipes for training vMF models and mixtures, including solutions for high-dimensional Bessel function evaluation problems using arbitrary precision arithmetic.
Findings
Effective handling of Bessel function evaluation issues in high dimensions
Successful application of vMF models to synthetic data and image clustering
Provision of publicly available code for reproducibility
Abstract
The von Mises-Fisher (vMF) is a well-known density model for directional random variables. The recent surge of the deep embedding methodologies for high-dimensional structured data such as images or texts, aimed at extracting salient directional information, can make the vMF model even more popular. In this article, we will review the vMF model and its mixture, provide detailed recipes of how to train the models, focusing on the maximum likelihood estimators, in Python/PyTorch. In particular, implementation of vMF typically suffers from the notorious numerical issue of the Bessel function evaluation in the density normalizer, especially when the dimensionality is high, and we address the issue using the MPMath library that supports arbitrary precision. For the mixture learning, we provide both minibatch-based large-scale SGD learning, as well as the EM algorithm which is a full batch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
MethodsStochastic Gradient Descent
