A Distribution-Free Test of Independence and Its Application to Variable Selection
Hengjian Cui, Wei Zhong

TL;DR
This paper introduces a new distribution-free independence test based on mean variance index, suitable for high-dimensional data, with applications in gene selection and demonstrating superior power over existing methods.
Contribution
It proposes a novel mean variance based independence test that is distribution-free, efficient, and applicable to high-dimensional categorical-continuous variable analysis.
Findings
The test is distribution-free and invariant under transformations.
It has higher power than existing tests in simulations.
Successfully applied to identify significant genes in cancer data.
Abstract
Motivated by the importance of measuring the association between the response and predictors in high dimensional data, In this article, we propose a new mean variance test of independence between a categorical random variable and a continuous one based on mean variance index. The mean variance index is zero if and only if two variables are independent. Under the independence, we derive an explicit form of its asymptotic null distribution, which provides us with an efficient and fast way to compute the empirical p-value in practice. The number of classes of the categorical variable is allowed to diverge slowly to the infinity. It is essentially a rank test and thus distribution-free. No assumption on the distributions of two random variables is required and the test statistic is invariant under one-to-one transformations. It is resistent to heavy-tailed distributions and extreme values.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Bayesian Methods and Mixture Models
