Measuring dependence between random vectors via optimal transport
Gilles Mordant, Johan Segers

TL;DR
This paper introduces Wasserstein distance-based coefficients to measure dependence between random vectors, providing new estimators with explicit asymptotic properties and applications to EEG data analysis.
Contribution
It proposes novel dependence coefficients based on 2-Wasserstein distance and quasi-Gaussian assumptions, with explicit formulas and estimators that are asymptotically normal.
Findings
New dependence coefficients are normalized between 0 and 1.
Estimators are asymptotically normal with explicit variance formulas.
Application to EEG data demonstrates practical utility.
Abstract
To quantify the dependence between two random vectors of possibly different dimensions, we propose to rely on the properties of the 2-Wasserstein distance. We first propose two coefficients that are based on the Wasserstein distance between the actual distribution and a reference distribution with independent components. The coefficients are normalized to take values between 0 and 1, where 1 represents the maximal amount of dependence possible given the two multivariate margins. We then make a quasi-Gaussian assumption that yields two additional coefficients rooted in the same ideas as the first two. These different coefficients are more amenable for distributional results and admit attractive formulas in terms of the joint covariance or correlation matrix. Furthermore, maximal dependence is proved to occur at the covariance matrix with minimal von Neumann entropy given the covariance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Advanced Statistical Methods and Models · Statistical Methods and Inference
