Universal Dependency Analysis
Hoang-Vu Nguyen, Jilles Vreeken

TL;DR
This paper introduces UDS, a universal, non-parametric correlation measure based on cumulative entropy, capable of assessing linear and non-linear correlations across any data subspace efficiently.
Contribution
The paper presents UDS, a novel correlation measure that is universal, non-parametric, and applicable to subspaces of any dimensionality, with an efficient computation method.
Findings
UDS outperforms existing correlation measures in experiments.
UDS effectively captures both linear and non-linear correlations.
The measure is applicable across diverse data distributions and types.
Abstract
Most data is multi-dimensional. Discovering whether any subset of dimensions, or subspaces, of such data is significantly correlated is a core task in data mining. To do so, we require a measure that quantifies how correlated a subspace is. For practical use, such a measure should be universal in the sense that it captures correlation in subspaces of any dimensionality and allows to meaningfully compare correlation scores across different subspaces, regardless how many dimensions they have and what specific statistical properties their dimensions possess. Further, it would be nice if the measure can non-parametrically and efficiently capture both linear and non-linear correlations. In this paper, we propose UDS, a multivariate correlation measure that fulfills all of these desiderata. In short, we define \uds based on cumulative entropy and propose a principled normalization scheme to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAffine Coupling · Normalizing Flows
