Implicit Bias of the JKO Scheme
Peter Halmos, Boris Hanin

TL;DR
This paper analyzes the implicit bias of the JKO scheme in Wasserstein gradient flows, revealing second-order modifications that influence the scheme's convergence and stability, with implications for understanding functional biases in probability measure optimization.
Contribution
It characterizes the second-order implicit bias of the JKO scheme, showing how it modifies the energy functional and affects the flow's behavior, providing new insights into its stability and convergence properties.
Findings
JKO scheme approximates Wasserstein gradient flow with second-order accuracy using a modified energy functional.
Implicit bias of the scheme involves adding a curvature-dependent term to the energy functional.
Numerical examples demonstrate the impact of the second-order bias on Langevin dynamics and sampling tasks.
Abstract
Wasserstein gradient flow provides a general framework for minimizing an energy functional over the space of probability measures on a Riemannian manifold . Its canonical time-discretization, the Jordan-Kinderlehrer-Otto (JKO) scheme, produces for any step size a sequence of probability distributions that approximate to first order in Wasserstein gradient flow on . But the JKO scheme also has many other remarkable properties not shared by other first order integrators, e.g. it preserves energy dissipation and exhibits unconditional stability for -geodesically convex functionals . To better understand the JKO scheme we characterize its implicit bias at second order in . We show that are approximated to order by Wasserstein gradient flow on a modified energy \[ J^{\eta}(\rho) = J(\rho) -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric Analysis and Curvature Flows · Statistical Mechanics and Entropy · Stochastic Gradient Optimization Techniques
