Compound Estimation for Binomials
Yan Chen, Lihua Lei

TL;DR
This paper introduces a novel compound decision approach for estimating binomial means, leveraging an approximate SURE for improved accuracy without Gaussian approximations, applicable to small samples and heterogeneous data.
Contribution
It develops an approximate SURE for binomial mean estimation within a compound decision framework, enabling asymptotic optimality and valid inference for machine learning-assisted shrinkage estimators.
Findings
Effective in small sample and heterogeneous settings
Demonstrated on datasets involving firms, education, and innovation
Outperforms traditional methods in accuracy and inference
Abstract
Many applications involve estimating the mean of multiple binomial outcomes as a common problem -- assessing intergenerational mobility of census tracts, estimating prevalence of infectious diseases across countries, and measuring click-through rates for different demographic groups. The most standard approach is to report the plain average of each outcome. Despite simplicity, the estimates are noisy when the sample sizes or mean parameters are small. In contrast, the Empirical Bayes (EB) methods are able to boost the average accuracy by borrowing information across tasks. Nevertheless, the EB methods require a Bayesian model where the parameters are sampled from a prior distribution which, unlike the commonly-studied Gaussian case, is unidentified due to discreteness of binomial measurements. Even if the prior distribution is known, the computation is difficult when the sample sizes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Markov Chains and Monte Carlo Methods
