Flowing Datasets with Wasserstein over Wasserstein Gradient Flows
Cl\'ement Bonet, Christophe Vauthier, Anna Korba

TL;DR
This paper introduces a novel framework for modeling and optimizing datasets as probability distributions over distributions using Wasserstein metrics, enabling advanced gradient flows for transfer learning and dataset distillation.
Contribution
It proposes the Wasserstein over Wasserstein (WoW) distance and gradient flows on this space, facilitating new techniques for dataset manipulation and learning tasks.
Findings
Effective transfer learning via WoW gradient flows
Improved dataset distillation using Wasserstein-based functionals
Novel tractable metrics for probability distributions over distributions
Abstract
Many applications in machine learning involve data represented as probability distributions. The emergence of such data requires radically novel techniques to design tractable gradient flows on probability distributions over this type of (infinite-dimensional) objects. For instance, being able to flow labeled datasets is a core task for applications ranging from domain adaptation to transfer learning or dataset distillation. In this setting, we propose to represent each class by the associated conditional distribution of features, and to model the dataset as a mixture distribution supported on these classes (which are themselves probability distributions), meaning that labeled datasets can be seen as probability distributions over probability distributions. We endow this space with a metric structure from optimal transport, namely the Wasserstein over Wasserstein (WoW) distance, derive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
