Simple and Optimal Sublinear Algorithms for Mean Estimation
Beatrice Bertolotti, Matteo Russo, Chris Schwiegelshohn, Sudarshan Shyam

TL;DR
This paper introduces new sublinear algorithms for multivariate mean estimation that achieve optimal sample complexity and runtime, with practical methods for approximating the mean using geometric medians and coordinate-wise medians.
Contribution
It presents the first optimal sublinear algorithms for multivariate mean estimation, including novel estimators and a fast gradient descent method for geometric median computation.
Findings
Optimal sample complexity achieved for mean estimation
Proposed gradient descent algorithm outperforms existing methods
Empirical results show geometric median-based approach is most effective
Abstract
We study the sublinear multivariate mean estimation problem in -dimensional Euclidean space. Specifically, we aim to find the mean of a ground point set , which minimizes the sum of squared Euclidean distances of the points in to . We first show that a multiplicative approximation to can be found with probability using many independent uniform random samples, and provide a matching lower bound. Furthermore, we give two estimators with optimal sample complexity that can be computed in optimal running time for extracting a suitable approximate mean: 1. The coordinate-wise median of sample means of sample size . As a corollary, we also show improved convergence rates for this estimator for estimating means of multivariate distributions. 2. The geometric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsControl Systems and Identification
