TL;DR
This paper develops new statistical tests for independence that account for biased sampling mechanisms, such as truncation and censoring, improving detection power in complex real-world data scenarios.
Contribution
It generalizes quasi-independence to biased sampling contexts and introduces two novel tests based on Hoeffding's statistic with bootstrap and permutation methods.
Findings
Tests perform well in simulations for biased sampling scenarios.
Methods improve power over existing approaches.
Validated on real datasets with truncation and bias mechanisms.
Abstract
Testing for association or dependence between pairs of random variables is a fundamental problem in statistics. In some applications, data are subject to selection bias that causes dependence between observations even when it is absent from the population. An important example is truncation models, in which observed pairs are restricted to a specific subset of the X-Y plane. Standard tests for independence are not suitable in such cases, and alternative tests that take the selection bias into account are required. To deal with this issue, we generalize the notion of quasi-independence with respect to the sampling mechanism, and study the problem of detecting any deviations from it. We develop two test statistics motivated by the classic Hoeffding's statistic, and use two approaches to compute their distribution under the null: (i) a bootstrap-based approach, and (ii) a permutation-test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
