Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error
Jonah Brown-Cohen

TL;DR
This paper develops faster, provably efficient algorithms for estimating the worst-case expected error in statistical estimation under known data collection processes, improving computational practicality and establishing lower bounds.
Contribution
It introduces online convex optimization-based algorithms for approximating optimal semilinear estimators with explicit runtime guarantees.
Findings
Achieves a .5-approximation using SDPs for .5-normalized data.
Computes top eigenvector iteratively for -normalized data without losing approximation quality.
Identifies conditions under which any estimator has constant worst-case expected error.
Abstract
The study of statistical estimation without distributional assumptions on data values, but with knowledge of data collection methods was recently introduced by Chen, Valiant and Valiant (NeurIPS 2020). In this framework, the goal is to design estimators that minimize the worst-case expected error. Here the expectation is over a known, randomized data collection process from some population, and the data values corresponding to each element of the population are assumed to be worst-case. Chen, Valiant and Valiant show that, when data values are -normalized, there is a polynomial time algorithm to compute an estimator for the mean with worst-case expected error that is within a factor of the optimum within the natural class of semilinear estimators. However, their algorithm is based on optimizing a somewhat complex concave objective function over a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNumerical Methods and Algorithms · Statistical and numerical algorithms · Sparse and Compressive Sensing Techniques
