Implicit Bias of the JKO Scheme

Peter Halmos; Boris Hanin

arXiv:2511.14827·stat.ML·March 5, 2026

Implicit Bias of the JKO Scheme

Peter Halmos, Boris Hanin

PDF

Open Access

TL;DR

This paper analyzes the implicit bias of the JKO scheme in Wasserstein gradient flows, revealing second-order modifications that influence the scheme's convergence and stability, with implications for understanding functional biases in probability measure optimization.

Contribution

It characterizes the second-order implicit bias of the JKO scheme, showing how it modifies the energy functional and affects the flow's behavior, providing new insights into its stability and convergence properties.

Findings

01

JKO scheme approximates Wasserstein gradient flow with second-order accuracy using a modified energy functional.

02

Implicit bias of the scheme involves adding a curvature-dependent term to the energy functional.

03

Numerical examples demonstrate the impact of the second-order bias on Langevin dynamics and sampling tasks.

Abstract

Wasserstein gradient flow provides a general framework for minimizing an energy functional $J$ over the space of probability measures on a Riemannian manifold $(M, g)$ . Its canonical time-discretization, the Jordan-Kinderlehrer-Otto (JKO) scheme, produces for any step size $η > 0$ a sequence of probability distributions $ρ_{k}^{η}$ that approximate to first order in $η$ Wasserstein gradient flow on $J$ . But the JKO scheme also has many other remarkable properties not shared by other first order integrators, e.g. it preserves energy dissipation and exhibits unconditional stability for $λ$ -geodesically convex functionals $J$ . To better understand the JKO scheme we characterize its implicit bias at second order in $η$ . We show that $ρ_{k}^{η}$ are approximated to order $η^{2}$ by Wasserstein gradient flow on a modified energy \[ J^{\eta}(\rho) = J(\rho) -…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeometric Analysis and Curvature Flows · Statistical Mechanics and Entropy · Stochastic Gradient Optimization Techniques