Fast and Large-Scale Unbalanced Optimal Transport via its Semi-Dual and Adaptive Gradient Methods
Ferdinand Genans (SU, LPSM (UMR\_8001))

TL;DR
This paper develops scalable gradient-based algorithms for unbalanced optimal transport by analyzing the semi-dual formulation, revealing favorable local geometry, and proposing adaptive methods with provable convergence rates for large-scale applications.
Contribution
It introduces an adaptive gradient approach for unbalanced optimal transport's semi-dual, demonstrating improved condition numbers and convergence rates for large-scale problems.
Findings
SGD achieves a convergence rate of O(n/εT] in stochastic regimes.
Local condition number scales as O(1/ε), independent of n.
Proposed adaptive Nesterov method attains near-optimal local complexity.
Abstract
Unbalanced Optimal Transport (UOT) has emerged as a robust relaxation of standard Optimal Transport, particularly effective for handling outliers and mass variations. However, scalable algorithms for UOT, specifically those based on Gradient Descent (SGD), remain largely underexplored. In this work, we address this gap by analyzing the semi-dual formulation of Entropic UOT and demonstrating its suitability for adaptive gradient methods. While the semi-dual is a standard tool for large-scale balanced OT, its geometry in the unbalanced setting appears ill-conditioned under standard analysis. Specifically, worst-case bounds on the marginal penalties using divergence suggest a condition number scaling with , implying poor scalability. In contrast, we show that the local condition number actually scales as , effectively removing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs · Markov Chains and Monte Carlo Methods
