Sliced Inner Product Gromov-Wasserstein Distances

Xiaoyun Gong; Gabriel Rioux; Ziv Goldfeld

arXiv:2605.08546·stat.ML·May 12, 2026

Sliced Inner Product Gromov-Wasserstein Distances

Xiaoyun Gong, Gabriel Rioux, Ziv Goldfeld

PDF

TL;DR

This paper introduces a sliced inner product Gromov-Wasserstein distance that improves scalability and invariance properties for aligning high-dimensional heterogeneous datasets, with applications in text clustering and language models.

Contribution

It proposes a novel sliced IGW distance with rotational invariance and provides theoretical and computational analysis for high-dimensional data alignment.

Findings

01

Validated the theoretical properties through numerical experiments.

02

Demonstrated effectiveness in heterogeneous text data clustering.

03

Applied to compare language model representations.

Abstract

The Gromov-Wasserstein (GW) problem provides a framework for aligning heterogeneous datasets by matching their intrinsic geometry, but its statistical and computational scaling remains an issue for high-dimensional problems. Slicing techniques offer an appealing route to scalability, but, unlike Wasserstein distances, GW problems do not generally admit closed-form solutions in one-dimension. We resolve this problem for the GW problem with inner product cost (IGW), propose a sliced IGW distance that enjoys a natural rotational invariance property, and comprehensively study its structural and computational properties. Numerical experiments validating our theory are presented, followed by applications to heterogeneous clustering of text data and language model representation comparison.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.