Fisher exact scanning for dependency
Li Ma, Jialiang Mao

TL;DR
Fisher exact scanning (FES) is a novel method that extends Fisher's exact test to larger contingency tables and continuous data, enabling efficient, exact dependency testing and characterization in massive datasets without resampling.
Contribution
FES introduces a scalable, exact dependency testing framework using a multivariate hypergeometric model and Bayesian network representation, eliminating resampling and enabling effective multiple testing correction.
Findings
FES provides exact p-value distributions for dependency tests.
FES demonstrates linear computational complexity with sample size.
FES outperforms existing methods in statistical and computational efficiency.
Abstract
We introduce a method---called Fisher exact scanning (FES)---for testing and identifying variable dependency that generalizes Fisher's exact test on contingency tables to contingency tables and continuous sample spaces. FES proceeds through scanning over the sample space using windows in the form of tables of various sizes, and on each window completing a Fisher's exact test. Based on a factorization of Fisher's multivariate hypergeometric (MHG) likelihood into the product of the univariate hypergeometric likelihoods, we show that there exists a coarse-to-fine, sequential generative representation for the MHG model in the form of a Bayesian network, which in turn implies the mutual independence (up to deviation due to discreteness) among the Fisher's exact tests completed under FES. This allows an exact characterization of the joint null distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Bayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models
