Optimal Robust Estimation under Local and Global Corruptions: Stronger Adversary and Smaller Error
Thanasis Pittas, Ankit Pensia

TL;DR
This paper demonstrates that optimal robust estimation is achievable in polynomial time under a combined contamination model with stronger local perturbations, extending the applicability of stability-based estimators to more complex real-world data corruptions.
Contribution
It introduces a polynomial-time algorithm achieving information-theoretic optimal error under a stronger local perturbation model, expanding the robustness of existing estimators.
Findings
Optimal error achieved under stronger local perturbations.
Stability-based estimators remain effective in combined contamination models.
Algorithms for distribution learning and PCA are developed for the new model.
Abstract
Algorithmic robust statistics has traditionally focused on the contamination model where a small fraction of the samples are arbitrarily corrupted. We consider a recent contamination model that combines two kinds of corruptions: (i) small fraction of arbitrary outliers, as in classical robust statistics, and (ii) local perturbations, where samples may undergo bounded shifts on average. While each noise model is well understood individually, the combined contamination model poses new algorithmic challenges, with only partial results known. Existing efficient algorithms are limited in two ways: (i) they work only for a weak notion of local perturbations, and (ii) they obtain suboptimal error for isotropic subgaussian distributions (among others). The latter limitation led [NGS24, COLT'24] to hypothesize that improving the error might, in fact, be computationally hard. Perhaps…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAgricultural risk and resilience · Advanced Statistical Methods and Models
