Distribution-free Measures of Association based on Optimal Transport
Nabarun Deb, Promit Ghosal, and Bodhisattva Sen

TL;DR
This paper introduces a new class of nonparametric, interpretable measures of association based on optimal transport and reproducing kernel Hilbert spaces, capable of testing independence with finite-sample guarantees.
Contribution
It develops a novel, distribution-free framework for measuring dependence and testing independence using geometric graphs and optimal transport, unifying and extending existing correlation measures.
Findings
Measures are 0 if and only if variables are independent.
Measures are 1 if and only if one variable is a measurable function of the other.
The proposed tests are exact, finite-sample distribution-free, and applicable in high dimensions.
Abstract
In this paper we propose and study a class of nonparametric, yet interpretable measures of association between two random vectors and taking values in and respectively (). These nonparametric measures -- defined using the theory of reproducing kernel Hilbert spaces coupled with optimal transport -- capture the strength of dependence between and and have the property that they are 0 if and only if the variables are independent and 1 if and only if one variable is a measurable function of the other. Further, these population measures can be consistently estimated using the general framework of geometric graphs which include -nearest neighbor graphs and minimum spanning trees. Additionally, these measures can also be readily used to construct an exact finite sample distribution-free test of mutual independence between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Probability and Risk Models · Nonlinear Differential Equations Analysis
