The Geometric Median and Applications to Robust Mean Estimation
Stanislav Minsker, Nate Strawn

TL;DR
This paper investigates the statistical and numerical properties of the geometric median, providing bounds and inequalities that improve robust mean estimation, especially for heavy-tailed distributions, with practical optimization and numerical insights.
Contribution
It offers new bounds for the geometric median's deviation from the mean, dimension-independent results for certain distributions, and practical stopping rules for numerical approximation.
Findings
Dimension-independent bounds for specific distribution classes
Exponential deviation inequalities for sample median
Effective stopping rules for numerical median approximation
Abstract
This paper is devoted to the statistical and numerical properties of the geometric median, and its applications to the problem of robust mean estimation via the median of means principle. Our main theoretical results include (a) an upper bound for the distance between the mean and the median for general absolutely continuous distributions in R^d, and examples of specific classes of distributions for which these bounds do not depend on the ambient dimension d; (b) exponential deviation inequalities for the distance between the sample and the population versions of the geometric median, which again depend only on the trace-type quantities and not on the ambient dimension. As a corollary, we deduce improved bounds for the (geometric) median of means estimator that hold for large classes of heavy-tailed distributions. Finally, we address the error of numerical approximation, which is an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
