Variance partitioning in multilevel models for count data
George Leckie, William Browne, Harvey Goldstein, Juan Merlo, Peter, Austin

TL;DR
This paper derives exact variance partitioning coefficients for multilevel count data models, including negative binomial and three-level models, aiding researchers in understanding clustering effects.
Contribution
It introduces new exact VPC/ICC formulas for flexible negative binomial and multi-level count models, addressing a gap in current statistical methodology.
Findings
Derived exact VPC/ICC formulas for negative binomial models.
Extended formulas to three-level and random-coefficient models.
Applied methods to student absenteeism data.
Abstract
A first step when fitting multilevel models to continuous responses is to explore the degree of clustering in the data. Researchers fit variance-component models and then report the proportion of variation in the response that is due to systematic differences between clusters. Equally they report the response correlation between units within a cluster. These statistics are popularly referred to as variance partition coefficients (VPCs) and intraclass correlation coefficients (ICCs). When fitting multilevel models to categorical (binary, ordinal, or nominal) and count responses, these statistics prove more challenging to calculate. For categorical response models, researchers appeal to their latent response formulations and report VPCs/ICCs in terms of latent continuous responses envisaged to underly the observed categorical responses. For standard count response models, however, there…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Urban, Neighborhood, and Segregation Studies · Psychometric Methodologies and Testing
