Distance-Matrix Wasserstein Statistics for Scalable Gromov--Wasserstein Learning

Ao Xu; Tieru Wu

arXiv:2605.14981·cs.LG·May 15, 2026

Distance-Matrix Wasserstein Statistics for Scalable Gromov--Wasserstein Learning

Ao Xu, Tieru Wu

PDF

TL;DR

This paper introduces Distance-Matrix Wasserstein (DMW), a scalable and theoretically grounded method for approximating Gromov--Wasserstein distances using sampled distance matrices, with applications in graph and shape analysis.

Contribution

The authors propose DMW, a hierarchy of Wasserstein statistics that approximates GW distances efficiently, with proven bounds and scalable computation methods.

Findings

01

DMW provides a lower bound and relaxation of GW distances.

02

Finite-sample bounds depend on data manifold, not ambient dimension.

03

Experiments demonstrate DMW's effectiveness in graph classification and two-sample testing.

Abstract

Gromov--Wasserstein (GW) distances compare graphs, shapes, and point clouds through internal distances, without requiring a common coordinate system. This invariance is powerful, but discrete GW is a nonconvex quadratic optimal transport problem and is difficult to estimate at scale. We propose \emph{Distance-Matrix Wasserstein} (DMW), a hierarchy of Wasserstein statistics comparing laws of random finite distance matrices. Rather than optimizing a global point-level alignment, DMW samples $n$ points from each space, records their pairwise distances, and transports the resulting matrix laws. We prove that DMW is a relaxation and lower bound of GW, and establish a reverse approximation inequality: the GW--DMW gap is controlled by the Wasserstein error of approximating each original measure with $n$ samples. Hence population DMW converges to GW as sampled subspaces become dense. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.