Relative-Translation Invariant Wasserstein Distance
Binshuai Wang, Qiwei Di, Ming Yin, Mengdi Wang, Quanquan Gu, Peng Wei

TL;DR
This paper introduces the relative-translation invariant Wasserstein distance ($RW_p$), a new metric for comparing probability distributions under translation shifts, with efficient computation methods and practical validation in real-world tasks.
Contribution
The paper proposes the $RW_p$ distance, extending optimal transport to be translation-invariant, and develops an efficient Sinkhorn algorithm for its computation, validated through experiments.
Findings
$RW_2$ is decomposable and translation-invariant.
The $RW_2$ Sinkhorn algorithm improves computational efficiency.
$RW_2$ is robust to distribution translations in practical tasks.
Abstract
We introduce a new family of distances, relative-translation invariant Wasserstein distances (), for measuring the similarity of two probability distributions under distribution shift. Generalizing it from the classical optimal transport model, we show that distances are also real distance metrics defined on the quotient set and invariant to distribution translations. When , the distance enjoys more exciting properties, including decomposability of the optimal transport model, translation-invariance of the distance, and a Pythagorean relationship between and the classical quadratic Wasserstein distance (). Based on these properties, we show that a distribution shift, measured by distance, can be explained in the bias-variance perspective. In addition, we propose a variant of the Sinkhorn algorithm,…
Peer Reviews
Decision·Submitted to ICLR 2025
- The authors propose relative translation optimal transport (ROT) which induces a Wasserstein like distance on the quotient space of probability measures induced by the translation relation. - For quadratic ROT, a decomposition in terms of Wasserstein distance (horizontal function) summed with a quadratic term including the shifting parameter $s$. - The authors test ROT approach on weather detection dataset. - I checked the proofs of the main results and they sound correct.
- In Definition 4, the relative transplantation invariant Wasserstein, $RW_p(\mu, \nu)$ is given with respect to an $s$-shifting of the source distribution $\mu$. However in Theorem 2, $RW_p$ is proper distance on the quotient set of shifting probabilities. Since the main results of the paper are based with respect to a shifting of the source, I’am wondering about the properties of $RW_p$ outside the quotient set. - The quadratic $RW_2$ is very closed to the translation property of Wasserstein
- The paper presents the properties of finding the Wasserstein distance when we are allowed to shift the distributions. - The paper also presents algorithms for approximating the $RW_p$.
- I think the proof of triangle inequality has some mistakes. My main concern is where you set $W_p(\eta, \eta')$ to 0 from line 2 to line 3. If $[W_p]$ refers to the Wasserstein distance between the classes of $\mu$ and $\nu$, then essentially $RW_p=[W_p]$ and from line 1 to line 2 of the Equation, you are assuming the triangle inequality holds for your distance. If $[W_p]$ just means the Wasserstein distance, then $W_p(\eta, \eta')$ might not be 0. (It seems reasonable that $RW_p$ is a metric,
- Novelty, paper presents a novel shifting-invariant Wasserstein-based metric, which extends the original Wasserstein distance to be invariant under shifting translations - Clarity and Mathematical Soundness, this paper is generally well-written with clear explanations. The authors provide solid mathematical proofs for the metric properties and the relation to Wasserstein distance. - Algorithm Development, this paper proposed two algorithm implementations for the $RW_p$ distances, the $RW_2$ Sin
- This paper does not provide an analysis of the convergence rates of the $RW_p$ distances as a distribution measure. - This paper lacks theoretical and experimental comparisons with Gromov-Wasserstein (GW) distance that has similar invariance properties. The GW distance is also translation-invariant and compares distributions based on the shapes, which makes it a good benchmark for comparison in the experiment section [1]. A recent work [2] proposed a robust p-Wasserstein distance (RPW), that
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFibroblast Growth Factor Research
MethodsSparse Evolutionary Training
