Large-Scale Estimation under Unknown Heteroskedasticity
Sheng Chao Ho

TL;DR
This paper develops nonparametric empirical Bayes methods for estimating heterogeneous means and variances, providing nearly optimal estimators with applications to evaluating teacher quality.
Contribution
It extends Tweedie's formulae to unknown heteroskedastic settings and proposes feasible estimators with near-parametric regret bounds.
Findings
Allowing for heteroskedasticity improves teacher quality estimates.
Feasible estimators achieve near-optimal regret bounds.
Application demonstrates importance of variance heterogeneity in practice.
Abstract
This paper studies nonparametric empirical Bayes methods in a heterogeneous parameters framework that features unknown means and variances. We provide extended Tweedie's formulae that express the (infeasible) optimal estimators of heterogeneous parameters, such as unit-specific means or quantiles, in terms of the density of certain sufficient statistics. These are used to propose feasible versions with nearly parametric regret bounds of the order of . The estimators are employed in a study of teachers' value-added, where we find that allowing for heterogeneous variances across teachers is crucial for delivery optimal estimates of teacher quality and detecting low-performing teachers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
