Non-asymptotic analysis and inference for an outlyingness induced winsorized mean
Yijun Zuo

TL;DR
This paper introduces an outlyingness induced winsorized mean estimator that achieves optimal robustness against 50% contamination, maintains sub-Gaussian performance, and is computationally efficient in finite samples.
Contribution
It proposes a novel winsorized mean estimator with maximal robustness and sub-Gaussian performance, improving robustness and efficiency over existing estimators.
Findings
Resists up to 50% data contamination without breakdown.
Achieves sub-Gaussian performance in uncontaminated samples.
Computable in linear time with bounded error in finite samples.
Abstract
Robust estimation of a mean vector, a topic regarded as obsolete in the traditional robust statistics community, has recently surged in machine learning literature in the last decade. The latest focus is on the sub-Gaussian performance and computability of the estimators in a non-asymptotic setting. Numerous traditional robust estimators are computationally intractable, which partly contributes to the renewal of the interest in the robust mean estimation. Robust centrality estimators, however, include the trimmed mean and the sample median. The latter has the best robustness but suffers a low-efficiency drawback. Trimmed mean and median of means, %as robust alternatives to the sample mean, and achieving sub-Gaussian performance have been proposed and studied in the literature. This article investigates the robustness of leading sub-Gaussian estimators of mean and reveals that none…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Advanced Statistical Process Monitoring
