Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Cl\'ement Bonet; Christophe Vauthier; Anna Korba

arXiv:2506.07534·cs.LG·June 10, 2025

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows

Cl\'ement Bonet, Christophe Vauthier, Anna Korba

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel framework for modeling and optimizing datasets as probability distributions over distributions using Wasserstein metrics, enabling advanced gradient flows for transfer learning and dataset distillation.

Contribution

It proposes the Wasserstein over Wasserstein (WoW) distance and gradient flows on this space, facilitating new techniques for dataset manipulation and learning tasks.

Findings

01

Effective transfer learning via WoW gradient flows

02

Improved dataset distillation using Wasserstein-based functionals

03

Novel tractable metrics for probability distributions over distributions

Abstract

Many applications in machine learning involve data represented as probability distributions. The emergence of such data requires radically novel techniques to design tractable gradient flows on probability distributions over this type of (infinite-dimensional) objects. For instance, being able to flow labeled datasets is a core task for applications ranging from domain adaptation to transfer learning or dataset distillation. In this setting, we propose to represent each class by the associated conditional distribution of features, and to model the dataset as a mixture distribution supported on these classes (which are themselves probability distributions), meaning that labeled datasets can be seen as probability distributions over probability distributions. We endow this space with a metric structure from optimal transport, namely the Wasserstein over Wasserstein (WoW) distance, derive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clbonet/Flowing_Datasets_with_WoW_Gradient_Flows
jaxOfficial

Videos

Flowing Datasets with Wasserstein over Wasserstein Gradient Flows· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning