Unsupervised Ground Metric Learning using Wasserstein Singular Vectors

Geert-Jan Huizing; Laura Cantini; Gabriel Peyr\'e

arXiv:2102.06278·stat.ML·July 20, 2022

Unsupervised Ground Metric Learning using Wasserstein Singular Vectors

Geert-Jan Huizing, Laura Cantini, Gabriel Peyr\'e

PDF

Open Access 1 Repo

TL;DR

This paper introduces an unsupervised method to learn meaningful ground metrics for optimal transport by computing Wasserstein singular vectors, enabling data-driven analysis without labeled data.

Contribution

It proposes a novel unsupervised approach to simultaneously compute sample and feature distances as Wasserstein singular vectors, with scalable algorithms for high-dimensional data.

Findings

01

Successfully applied to single-cell RNA-sequencing data

02

Provides criteria for existence and uniqueness of singular vectors

03

Offers scalable methods for high-dimensional OT computations

Abstract

Defining meaningful distances between samples in a dataset is a fundamental problem in machine learning. Optimal Transport (OT) lifts a distance between features (the "ground metric") to a geometrically meaningful distance between samples. However, there is usually no straightforward choice of ground metric. Supervised ground metric learning approaches exist but require labeled data. In absence of labels, only ad-hoc ground metrics remain. Unsupervised ground metric learning is thus a fundamental problem to enable data-driven applications of OT. In this paper, we propose for the first time a canonical answer by simultaneously computing an OT distance between samples and between features of a dataset. These distance matrices emerge naturally as positive singular vectors of the function mapping ground metrics to OT distances. We provide criteria to ensure the existence and uniqueness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gjhuizing/wsingular
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSingle-cell and spatial transcriptomics · Generative Adversarial Networks and Image Synthesis · Stochastic Gradient Optimization Techniques