Data-Driven Representations for Testing Independence: Modeling, Analysis and Connection with Mutual Information Estimation
Mauricio E. Gonzalez, Jorge F. Silva, Miguel Videla, and Marcos E., Orchard

TL;DR
This paper introduces a data-driven partitioning approach for testing independence between continuous variables, connecting it with mutual information estimation and providing theoretical guarantees and experimental validation.
Contribution
It proposes a novel data-dependent tree-structured partition method for independence testing, linking it with mutual information estimation and establishing consistency and complexity bounds.
Findings
The TSP scheme achieves consistent independence testing.
Finite-sample bounds demonstrate the method's effectiveness.
Experimental results show advantages over non-data-driven strategies.
Abstract
This work addresses testing the independence of two continuous and finite-dimensional random variables from the design of a data-driven partition. The empirical log-likelihood statistic is adopted to approximate the sufficient statistics of an oracle test against independence (that knows the two hypotheses). It is shown that approximating the sufficient statistics of the oracle test offers a learning criterion for designing a data-driven partition that connects with the problem of mutual information estimation. Applying these ideas in the context of a data-dependent tree-structured partition (TSP), we derive conditions on the TSP's parameters to achieve a strongly consistent distribution-free test of independence over the family of probabilities equipped with a density. Complementing this result, we present finite-length results that show our TSP scheme's capacity to detect the scenario…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
