Compound Estimation for Binomials

Yan Chen; Lihua Lei

arXiv:2512.25042·econ.EM·January 1, 2026

Compound Estimation for Binomials

Yan Chen, Lihua Lei

PDF

Open Access

TL;DR

This paper introduces a novel compound decision approach for estimating binomial means, leveraging an approximate SURE for improved accuracy without Gaussian approximations, applicable to small samples and heterogeneous data.

Contribution

It develops an approximate SURE for binomial mean estimation within a compound decision framework, enabling asymptotic optimality and valid inference for machine learning-assisted shrinkage estimators.

Findings

01

Effective in small sample and heterogeneous settings

02

Demonstrated on datasets involving firms, education, and innovation

03

Outperforms traditional methods in accuracy and inference

Abstract

Many applications involve estimating the mean of multiple binomial outcomes as a common problem -- assessing intergenerational mobility of census tracts, estimating prevalence of infectious diseases across countries, and measuring click-through rates for different demographic groups. The most standard approach is to report the plain average of each outcome. Despite simplicity, the estimates are noisy when the sample sizes or mean parameters are small. In contrast, the Empirical Bayes (EB) methods are able to boost the average accuracy by borrowing information across tasks. Nevertheless, the EB methods require a Bayesian model where the parameters are sampled from a prior distribution which, unlike the commonly-studied Gaussian case, is unidentified due to discreteness of binomial measurements. Even if the prior distribution is known, the computation is difficult when the sample sizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Markov Chains and Monte Carlo Methods