Robust First and Second-Order Differentiation for Regularized Optimal Transport
Xingjie Li, Fei Lu, Molei Tao, Felix X.-F. Ye

TL;DR
This paper develops a stable, scalable method for computing first and second-order derivatives of regularized optimal transport distances, enabling faster and more accurate optimization in applications like shuffled regression.
Contribution
It provides the first analytical derivation of the Hessian for entropic OT and addresses numerical instability issues, facilitating efficient second-order optimization methods.
Findings
Second-order methods converge much faster than first-order methods.
Analytical Hessian computation improves stability and scalability.
Enhanced accuracy in parameter estimation for shuffled regression.
Abstract
Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, such as the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a first-order optimizer, which requires the gradient of the OT distance. For faster convergence, one might also resort to a second-order optimizer, which additionally requires the Hessian. The computations of these derivatives are crucial for efficient and accurate optimization. However, they present significant challenges in terms of memory consumption and numerical instability, especially for large datasets and small regularization strengths. We circumvent these issues by analytically computing the gradients for OT distances and the Hessian for the entropic OT distance, which was not previously used due to intricate tensor-wise calculations and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods in inverse problems · Spectral Theory in Mathematical Physics
