Scalable Optimal Transport in High Dimensions for Graph Distances,   Embedding Alignment, and More

Johannes Gasteiger; Marten Lienen; Stephan G\"unnemann

arXiv:2107.06876·cs.LG·April 6, 2022·1 cites

Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More

Johannes Gasteiger, Marten Lienen, Stephan G\"unnemann

PDF

Open Access 1 Video

TL;DR

This paper introduces log-linear time approximations for optimal transport in high-dimensional spaces, enabling faster and more accurate applications in embedding alignment and graph distance regression.

Contribution

It proposes LSH-based sparse and Nyström-based locally corrected approximations for OT, improving scalability and accuracy in high-dimensional deep learning tasks.

Findings

01

Speeds up word embedding alignment by 3x and improves accuracy by 3.1 percentage points.

02

Proposes Graph Transport Network (GTN) that outperforms previous models by 48%.

03

Achieves log-linear scaling in graph distance regression.

Abstract

The current best practice for computing optimal transport (OT) is via entropy regularization and Sinkhorn iterations. This algorithm runs in quadratic time as it requires the full pairwise cost matrix, which is prohibitively expensive for large sets of objects. In this work we propose two effective log-linear time approximations of the cost matrix: First, a sparse approximation based on locality-sensitive hashing (LSH) and, second, a Nystr\"om approximation with LSH-based sparse corrections, which we call locally corrected Nystr\"om (LCN). These approximations enable general log-linear time algorithms for entropy-regularized OT that perform well even for the complex, high-dimensional spaces common in deep learning. We analyse these approximations theoretically and evaluate them experimentally both directly and end-to-end as a component for real-world applications. Using our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, and More· slideslive

Taxonomy

TopicsCaching and Content Delivery · Advanced Graph Neural Networks · Recommender Systems and Techniques

MethodsEntropy Regularization