On the Optimization Landscape of Maximum Mean Discrepancy

Itai Alon; Amir Globerson; Ami Wiesel

arXiv:2110.13452·cs.LG·May 7, 2024

On the Optimization Landscape of Maximum Mean Discrepancy

Itai Alon, Amir Globerson, Ami Wiesel

PDF

Open Access

TL;DR

This paper analyzes the optimization landscape of Maximum Mean Discrepancy (MMD) in generative models, showing that in certain cases the landscape is benign and allows for global optimization via gradient methods.

Contribution

It provides the first theoretical analysis demonstrating conditions under which MMD-based generative models can be globally optimized despite non-convexity.

Findings

01

MMD landscape is benign for low-rank Gaussian distributions.

02

Gradient methods can globally minimize MMD in these cases.

03

Analysis extends to mixtures of Gaussians.

Abstract

Generative models have been successfully used for generating realistic signals. Because the likelihood function is typically intractable in most of these models, the common practice is to use "implicit" models that avoid likelihood calculation. However, it is hard to obtain theoretical guarantees for such models. In particular, it is not understood when they can globally optimize their non-convex objectives. Here we provide such an analysis for the case of Maximum Mean Discrepancy (MMD) learning of generative models. We prove several optimality results, including for a Gaussian distribution with low rank covariance (where likelihood is inapplicable) and a mixture of Gaussians. Our analysis shows that that the MMD optimization landscape is benign in these cases, and therefore gradient based methods will globally minimize the MMD objective.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Machine Learning and Data Classification