Maximum Likelihood Estimation of a Proportion from a Sample of Triplets
Rafael Weissbach, Eric Scholz

TL;DR
This paper develops a maximum likelihood method to estimate a proportion from triplet samples, accounting for dependencies within triplets, and demonstrates its application to caries prevalence data in children.
Contribution
It introduces a novel MLE approach that models intra-triplet correlations without assuming a specific distribution for success counts.
Findings
Standard errors are significantly inflated when dependencies are ignored.
The method provides consistent and asymptotically normal estimates.
Application to real data illustrates the importance of accounting for dependencies.
Abstract
When estimating a proportion and only a sample of triplets is given, dependencies within the triplets are to be accounted for. Without assuming a distribution for the success count of the triplet, together with the proportion, as second and third parameter the correlations of 1st and 2nd order enter the model. We apply maximum likelihood estimation, and derive consistency by using that the triplet count is multinomially distributed, combined with the continuous mapping theorem. The asymptotic normality follows with the delta-method, resulting in closed-form expressions for the standard errors. As application we study caries prevalence of pre-school children from a sample to nursing schools. We compare the standard errors with those for assuming erroneously independence within the nursing schools. As to be suspected, the design `inflates' the standard error markedly.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Survey Sampling and Estimation Techniques
