A comparison of methods for the analysis of binomial proportion data in behavioral research
Alberto Ferrari, Mario Comelli

TL;DR
This paper evaluates various statistical methods for analyzing binomial proportion data in behavioral research, comparing their power, error rates, and applicability through simulations and real data applications.
Contribution
It provides a comprehensive comparison of linear, Poisson, beta-binomial, and GLMM methods, offering practical guidelines for researchers analyzing clustered binary data.
Findings
GLMMs and beta-binomial regression are highly effective for clustered binary data.
Linear regression can be reliable for hypothesis testing despite non-normality.
Poisson regression often suffers from model misspecification when used for proportions.
Abstract
In behavioral and psychiatric research, data consisting of a per-subject proportion of "successes" and "failures" over a finite number of trials often arise. This kind of clustered binary data are usually non-normally distributed, which can cause issues with parameter estimation and predictions if the usual general linear model is applied and sample size is small. Here we studied the performances of some of the available analytic methods applicable to the analysis of proportion data; namely linear regression, Poisson regression, beta-binomial regression and Generalized Linear Mixed Models (GLMMs). We report the conclusions from a simulation study evaluating power and Type I error rates of these models in scenarios akin to those met by behavioral researchers and differing in sample size, cluster size and fixed effects parameters; plus, we describe results from the application of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
