Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries
Stanislav Minsker

TL;DR
This paper introduces a new estimator for the mean of a random matrix with heavy-tailed entries that achieves sub-Gaussian concentration, improving robustness over traditional methods especially in covariance estimation.
Contribution
The authors develop a novel mean estimator for matrices with heavy-tailed entries that guarantees sub-Gaussian concentration, extending robustness in covariance and matrix completion tasks.
Findings
Estimator achieves sub-Gaussian concentration under finite second moment assumption.
Applicable to covariance estimation and matrix completion problems.
Improves robustness against outliers in heavy-tailed data.
Abstract
Estimation of the covariance matrix has attracted a lot of attention of the statistical research community over the years, partially due to important applications such as Principal Component Analysis. However, frequently used empirical covariance estimator (and its modifications) is very sensitive to outliers in the data. As P. J. Huber wrote in 1964, "...This raises a question which could have been asked already by Gauss, but which was, as far as I know, only raised a few years ago (notably by Tukey): what happens if the true distribution deviates slightly from the assumed normal one? As is now well known, the sample mean then may have a catastrophically bad performance..." Motivated by this question, we develop a new estimator of the (element-wise) mean of a random matrix, which includes covariance estimation problem as a special case. Assuming that the entries of a matrix possess…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
