Modeling Zero-Inflated Correlated Dental Data through Gaussian Copulas and Approximate Bayesian Computation
Anish Mukherjee, Jeremy T. Gaskins, Shoumi Sarkar, Steven, Levy, Somnath Datta

TL;DR
This paper introduces a novel statistical model combining zero-inflated count data regression with Gaussian copulas and approximate Bayesian computation to analyze correlated longitudinal dental data, revealing insights into caries risk factors.
Contribution
It develops a new spatio-temporal zero-inflated count model with population-level interpretation using Gaussian copulas and ABC inference, applied to dental health data.
Findings
Identified significant covariate effects on dental caries.
Characterized the correlation structure across teeth and time.
Provided insights into risk factors for dental caries.
Abstract
We develop a new longitudinal count data regression model that accounts for zero-inflation and spatio-temporal correlation across responses. This project is motivated by an analysis of Iowa Fluoride Study (IFS) data, a longitudinal cohort study with data on caries (cavity) experience scores measured for each tooth across five time points. To that end, we use a hurdle model for zero-inflation with two parts: the presence model indicating whether a count is non-zero through logistic regression and the severity model that considers the non-zero counts through a shifted Negative Binomial distribution allowing overdispersion. To incorporate dependence across measurement occasion and teeth, these marginal models are embedded within a Gaussian copula that introduces spatio-temporal correlations. A distinct advantage of this formulation is that it allows us to determine covariate effects with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
