Generalized Goodness-Of-Fit Tests for Correlated Data
Hong Zhang, Zheyang Wu

TL;DR
This paper develops generalized goodness-of-fit tests for correlated data, introducing a novel p-value calculation, data transformation techniques, and a robust testing strategy called digGOF, with applications demonstrated in genetic studies.
Contribution
It introduces the digGOF testing strategy that combines double-adaptation and innovated transformation within the gGOF framework for correlated data analysis.
Findings
IT enhances detection of sparse signals in correlated data
digGOF shows robust performance across various correlation structures
Methods are validated through simulations and genetic studies
Abstract
This paper concerns the problem of applying the generalized goodness-of-fit (gGOF) type tests for analyzing correlated data. The gGOF family broadly covers the maximum-based testing procedures by ordered input -values, such as the false discovery rate procedure, the Kolmogorov-Smirnov type statistics, the -divergence family, etc. Data analysis framework and a novel -value calculation approach is developed under the Gaussian mean model and the generalized linear model (GLM). We reveal the influence of data transformations to the signal-to-noise ratio and the statistical power under both sparse and dense signal patterns and various correlation structures. In particular, the innovated transformation (IT), which is shown equivalent to the marginal model-fitting under the GLM, is often preferred for detecting sparse signals in correlated data. We propose a testing strategy called…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Genetic Associations and Epidemiology · Statistical Methods in Clinical Trials
