Mean Estimation with Sub-Gaussian Rates in Polynomial Time
Samuel B. Hopkins

TL;DR
This paper introduces a polynomial-time algorithm for mean estimation of heavy-tailed multivariate data that achieves sub-Gaussian confidence intervals under minimal assumptions, improving over previous methods.
Contribution
The paper presents the first polynomial-time algorithm for mean estimation with sub-Gaussian confidence intervals under only finite mean and covariance assumptions.
Findings
Achieves sub-Gaussian confidence intervals in polynomial time
Uses a new semidefinite programming relaxation of a high-dimensional median
Outperforms previous estimators that required stronger assumptions or exponential time
Abstract
We study polynomial time algorithms for estimating the mean of a heavy-tailed multivariate random vector. We assume only that the random vector has finite mean and covariance. In this setting, the radius of confidence intervals achieved by the empirical mean are large compared to the case that is Gaussian or sub-Gaussian. We offer the first polynomial time algorithm to estimate the mean with sub-Gaussian-size confidence intervals under such mild assumptions. Our algorithm is based on a new semidefinite programming relaxation of a high-dimensional median. Previous estimators which assumed only existence of finitely-many moments of either sacrifice sub-Gaussian performance or are only known to be computable via brute-force search procedures requiring time exponential in the dimension.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
