Sliced Wasserstein Distance for Learning Gaussian Mixture Models

Soheil Kolouri; Gustavo K. Rohde; Heiko Hoffmann

arXiv:1711.05376·cs.CV·November 17, 2017

Sliced Wasserstein Distance for Learning Gaussian Mixture Models

Soheil Kolouri, Gustavo K. Rohde, Heiko Hoffmann

PDF

2 Repos

TL;DR

This paper introduces a novel GMM estimation method using sliced Wasserstein distance, which offers better robustness and fidelity in high-dimensional data compared to traditional EM algorithms.

Contribution

It proposes a new GMM parameter estimation approach based on sliced Wasserstein distance, improving robustness and high-dimensional data modeling.

Findings

01

More robust to random initializations

02

Better high-dimensional data distribution estimation

03

Energy landscape is more well-behaved

Abstract

Gaussian mixture models (GMM) are powerful parametric tools with many applications in machine learning and computer vision. Expectation maximization (EM) is the most popular algorithm for estimating the GMM parameters. However, EM guarantees only convergence to a stationary point of the log-likelihood function, which could be arbitrarily worse than the optimal solution. Inspired by the relationship between the negative log-likelihood function and the Kullback-Leibler (KL) divergence, we propose an alternative formulation for estimating the GMM parameters using the sliced Wasserstein distance, which gives rise to a new algorithm. Specifically, we propose minimizing the sliced-Wasserstein distance between the mixture model and the data distribution with respect to the GMM parameters. In contrast to the KL-divergence, the energy landscape for the sliced-Wasserstein distance is more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.