Learning Generative Models with Sinkhorn Divergences
Aude Genevay, Gabriel Peyr\'e, Marco Cuturi

TL;DR
This paper introduces a computationally feasible method for training large-scale generative models using Sinkhorn divergences, which are smoothed optimal transport losses that improve stability and efficiency.
Contribution
It presents the first practical approach to optimize generative models with OT-based losses by combining entropic smoothing and automatic differentiation of Sinkhorn iterations.
Findings
Enables GPU-accelerated training of generative models with OT losses.
Provides a family of losses interpolating between Wasserstein and MMD.
Improves stability and computational efficiency in high-dimensional settings.
Abstract
The ability to compare two degenerate probability distributions (i.e. two probability distributions supported on two distinct low-dimensional manifolds living in a much higher-dimensional space) is a crucial problem arising in the estimation of generative models for high-dimensional observations such as those arising in computer vision or natural language. It is known that optimal transport metrics can represent a cure for this problem, since they were specifically designed as an alternative to information divergences to handle such problematic scenarios. Unfortunately, training generative machines using OT raises formidable computational and statistical challenges, because of (i) the computational burden of evaluating OT losses, (ii) the instability and lack of smoothness of these losses, (iii) the difficulty to estimate robustly these losses and their gradients in high dimension. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Face recognition and analysis
