Permutation-Based Rank Test in the Presence of Discretization and Application in Causal Discovery with Mixed Data
Xinshuai Dong, Ignavier Ng, Boyang Sun, Haoyue Dai, Guang-Yuan Hao, Shunxing Fan, Peter Spirtes, Yumou Qiu, Kun Zhang

TL;DR
This paper introduces a permutation-based rank test designed for mixed data with discretized variables, enabling accurate causal discovery while controlling error rates.
Contribution
It proposes the MPRT method that accounts for discretization effects, extending rank tests to practical scenarios with mixed and discretized data.
Findings
Effectively controls Type I error with discretized data
Validated on synthetic and real-world datasets
Enhances causal discovery accuracy
Abstract
Recent advances have shown that statistical tests for the rank of cross-covariance matrices play an important role in causal discovery. These rank tests include partial correlation tests as special cases and provide further graphical information about latent variables. Existing rank tests typically assume that all the continuous variables can be perfectly measured, and yet, in practice many variables can only be measured after discretization. For example, in psychometric studies, the continuous level of certain personality dimensions of a person can only be measured after being discretized into order-preserving options such as disagree, neutral, and agree. Motivated by this, we propose Mixed data Permutation-based Rank Test (MPRT), which properly controls the statistical errors even when some or all variables are discretized. Theoretically, we establish the exchangeability and estimate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Bayesian Modeling and Causal Inference · Data Mining Algorithms and Applications
