Maximum likelihood estimators uniformly minimize distribution variance among distribution unbiased estimators in exponential families
Paul Vos, Qiang Wu

TL;DR
This paper demonstrates that maximum likelihood estimators in exponential families uniquely minimize distribution variance among unbiased estimators when viewed as random distributions, extending classical concepts with a distributional perspective.
Contribution
It introduces a distributional framework for estimation, defining distribution expectation and variance, and proves the MLE's optimality and robustness as the minimum distribution variance unbiased estimator.
Findings
MLE is distribution unbiased in exponential families.
MLE uniquely minimizes distribution variance among unbiased estimators.
MLE remains robust for model misspecification via KL projection.
Abstract
We employ a parameter-free distribution estimation framework where estimators are random distributions and utilize the Kullback-Leibler (KL) divergence as a loss function. Wu and Vos [J. Statist. Plann. Inference 142 (2012) 1525-1536] show that when an estimator obtained from an i.i.d. sample is viewed as a random distribution, the KL risk of the estimator decomposes in a fashion parallel to the mean squared error decomposition when the estimator is a real-valued random variable. In this paper, we explore how conditional versions of distribution expectation () can be defined so that a distribution version of the Rao-Blackwell theorem holds. We define distributional expectation and variance () that also provide a decomposition of KL risk in exponential and mixture families. For exponential families, we show that the maximum likelihood estimator (viewed as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
