A Flexible Quasi-Copula Distribution for Statistical Modeling
Sarah S. Ji, Benjamin B. Chu, Hua Zhou, Kenneth Lange

TL;DR
This paper introduces a new flexible distribution class that simplifies parameter estimation in complex correlated data models, demonstrated through a genome-wide association study.
Contribution
It derives a novel class of probability density functions enabling explicit moments and distributions, improving modeling of non-Gaussian longitudinal data.
Findings
Effective modeling of non-Gaussian longitudinal data
Successful application to genome-wide association analysis
Enhanced computational scalability
Abstract
Copulas, generalized estimating equations, and generalized linear mixed models promote the analysis of grouped data where non-normal responses are correlated. Unfortunately, parameter estimation remains challenging in these three frameworks. Based on prior work of Tonda, we derive a new class of probability density functions that allow explicit calculation of moments, marginal and conditional distributions, and the score and observed information needed in maximum likelihood estimation. We also illustrate how the new distribution flexibly models longitudinal data following a non-Gaussian distribution. Finally, we conduct a tri-variate genome-wide association analysis on dichotomized systolic and diastolic blood pressure and body mass index data from the UK-Biobank, showcasing the modeling prowess and computational scalability of the new distribution.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Distribution Estimation and Applications · Statistical Methods and Inference
