A unified framework for hard and soft clustering with regularized optimal transport
Jean-Fr\'ed\'eric Diebold, Nicolas Papadakis, Arnaud Dessein and, Charles-Alban Deledalle

TL;DR
This paper introduces a unified clustering framework using regularized optimal transport that encompasses hard and soft clustering, with a parameter controlling the nature of the clustering, and provides convergence analysis and empirical benefits.
Contribution
The paper presents a novel optimal transport-based clustering method that unifies hard and soft clustering, including a generalized EM algorithm with convergence guarantees.
Findings
Using b1>1 improves inference performance.
Approaching b1a0bca0bca0bca0bc enhances inference accuracy.
Approaching b1a0bca0bca0bca0bc improves classification.
Abstract
In this paper, we formulate the problem of inferring a Finite Mixture Model from discrete data as an optimal transport problem with entropic regularization of parameter . Our method unifies hard and soft clustering, the Expectation-Maximization (EM) algorithm being exactly recovered for . The family of clustering algorithm we propose rely on the resolution of nonconvex problems using alternating minimization. We study the convergence property of our generalized EM algorithms and show that each step in the minimization process has a closed form solution when inferring finite mixture models of exponential families. Experiments highlight the benefits of taking a parameter to improve the inference performance and for classification.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods
