Improved multivariate normal mean estimation with unknown covariance when p is greater than n
Didier Ch\'etelat, Martin T. Wells

TL;DR
This paper introduces new estimators for the mean of a high-dimensional normal distribution with unknown covariance, which outperform the usual estimator when the number of variables exceeds the sample size.
Contribution
It develops a novel class of estimators that dominate the standard estimator in the p>n setting, including new risk unbiased estimators and analysis of their performance.
Findings
Proposed estimators outperform the usual estimator in high-dimensional settings.
Developed unbiased risk estimators for the p>n case.
Identified relationships between domination effectiveness, p, and n.
Abstract
We consider the problem of estimating the mean vector of a p-variate normal distribution under invariant quadratic loss, , when the covariance is unknown. We propose a new class of estimators that dominate the usual estimator . The proposed estimators of depend upon X and an independent Wishart matrix S with n degrees of freedom, however, S is singular almost surely when p>n. The proof of domination involves the development of some new unbiased estimators of risk for the p>n setting. We also find some relationships between the amount of domination and the magnitudes of n and p.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
