Quantitative Convergence of Wasserstein Gradient Flows of Kernel Mean Discrepancies
L\'ena\"ic Chizat, Maria Colombo, Roberto Colombo, Xavier Fern\'andez-Real

TL;DR
This paper analyzes the convergence rates of Wasserstein gradient flows of Kernel Mean Discrepancy functionals, with applications to neural network training dynamics and particle systems, providing explicit rates and numerical validation.
Contribution
It establishes the first quantitative convergence rates for Wasserstein gradient flows of KMD functionals, including neural network training and particle systems, with explicit polynomial and exponential rates.
Findings
Exponential convergence for s=1 under minimal assumptions
Polynomial convergence rates for s>1 depending on regularity
Numerical experiments validating theoretical rates
Abstract
We study the quantitative convergence of Wasserstein gradient flows of Kernel Mean Discrepancy (KMD) (also known as Maximum Mean Discrepancy (MMD)) functionals. Our setting covers in particular the training dynamics of shallow neural networks in the infinite-width and continuous time limit, as well as interacting particle systems with pairwise Riesz kernel interaction in the mean-field and overdamped limit. Our main analysis concerns the model case of KMD functionals given by the squared Sobolev distance for any and a fixed probability measure on the -dimensional torus. First, inspired by Yudovich theory for the -Euler equation, we establish existence and uniqueness in natural weak regularity classes. Next, we show that for the flow converges globally at an exponential rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques
