A Fast Spectral Algorithm for Mean Estimation with Sub-Gaussian Rates
Zhixian Lei, Kyle Luh, Prayaag Venkat, Fred Zhang

TL;DR
This paper introduces a spectral algorithm for mean estimation of heavy-tailed distributions that achieves optimal sub-Gaussian error bounds with significantly improved computational efficiency over previous SDP-based methods.
Contribution
The authors develop a novel spectral algorithm that surpasses SDP-based approaches in runtime, leveraging eigenvector computations for efficient mean estimation under heavy-tailed data.
Findings
Achieves optimal sub-Gaussian error bounds.
Runs in time rac{rac{n^2 d}{ ilde{}}}
Improves previous runtime from rac{rac{n^{3.5}+ n^2d}{ ilde{}}}.
Abstract
We study the algorithmic problem of estimating the mean of heavy-tailed random vector in , given i.i.d. samples. The goal is to design an efficient estimator that attains the optimal sub-gaussian error bound, only assuming that the random vector has bounded mean and covariance. Polynomial-time solutions to this problem are known but have high runtime due to their use of semi-definite programming (SDP). Conceptually, it remains open whether convex relaxation is truly necessary for this problem. In this work, we show that it is possible to go beyond SDP and achieve better computational efficiency. In particular, we provide a spectral algorithm that achieves the optimal statistical performance and runs in time , improving upon the previous fastest runtime by Cherapanamjeri el al. (COLT '19). Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Machine Learning and Algorithms
