Optimal and Diffusion Transports in Machine Learning
Gabriel Peyr\'e

TL;DR
This paper surveys methods that model the evolution of probability distributions in machine learning, focusing on diffusion and optimal transport techniques, and their applications in generative models, neural network training, and language models.
Contribution
It provides a unified overview of diffusion and optimal transport approaches, highlighting their mathematical structures, challenges, and applications in modern machine learning.
Findings
Diffusion methods underpin modern generative AI.
Optimal transport minimizes displacement cost in distribution interpolation.
Both approaches are applicable to sampling, neural network optimization, and language model dynamics.
Abstract
Several problems in machine learning are naturally expressed as the design and analysis of time-evolving probability distributions. This includes sampling via diffusion methods, optimizing the weights of neural networks, and analyzing the evolution of token distributions across layers of large language models. While the targeted applications differ (samples, weights, tokens), their mathematical descriptions share a common structure. A key idea is to switch from the Eulerian representation of densities to their Lagrangian counterpart through vector fields that advect particles. This dual view introduces challenges, notably the non-uniqueness of Lagrangian vector fields, but also opportunities to craft density evolutions and flows with favorable properties in terms of regularity, stability, and computational tractability. This survey presents an overview of these methods, with emphasis on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Machine Learning in Materials Science
