Optimal and Diffusion Transports in Machine Learning

Gabriel Peyr\'e

arXiv:2512.06797·math.OC·December 9, 2025

Optimal and Diffusion Transports in Machine Learning

Gabriel Peyr\'e

PDF

Open Access

TL;DR

This paper surveys methods that model the evolution of probability distributions in machine learning, focusing on diffusion and optimal transport techniques, and their applications in generative models, neural network training, and language models.

Contribution

It provides a unified overview of diffusion and optimal transport approaches, highlighting their mathematical structures, challenges, and applications in modern machine learning.

Findings

01

Diffusion methods underpin modern generative AI.

02

Optimal transport minimizes displacement cost in distribution interpolation.

03

Both approaches are applicable to sampling, neural network optimization, and language model dynamics.

Abstract

Several problems in machine learning are naturally expressed as the design and analysis of time-evolving probability distributions. This includes sampling via diffusion methods, optimizing the weights of neural networks, and analyzing the evolution of token distributions across layers of large language models. While the targeted applications differ (samples, weights, tokens), their mathematical descriptions share a common structure. A key idea is to switch from the Eulerian representation of densities to their Lagrangian counterpart through vector fields that advect particles. This dual view introduces challenges, notably the non-uniqueness of Lagrangian vector fields, but also opportunities to craft density evolutions and flows with favorable properties in terms of regularity, stability, and computational tractability. This survey presents an overview of these methods, with emphasis on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Machine Learning in Materials Science