Minimax Statistical Estimation under Wasserstein Contamination
Patrick Chao, Edgar Dobriban

TL;DR
This paper develops a minimax theory for Wasserstein-$r$ contamination models in statistical estimation, demonstrating that classical estimators like the mean and least squares are nearly optimal under these robust contamination settings.
Contribution
It introduces a comprehensive minimax framework for Wasserstein-$r$ contaminations, analyzing fundamental problems and identifying optimal estimators and contamination strategies.
Findings
Exact minimax risks are derived for location estimation and linear regression.
Classical estimators like the mean and least squares are shown to be nearly optimal.
Optimal density estimation rates are established with adjusted bandwidths.
Abstract
Contaminations are a key concern in modern statistical learning, as small but systematic perturbations of all datapoints can substantially alter estimation results. Here, we study Wasserstein- contaminations () in an norm (), in which each observation may undergo an adversarial perturbation with bounded cost, complementing the classical Huber model, corresponding to total variation norm, where only a fraction of observations is arbitrarily corrupted. We study both independent and joint (coordinated) contaminations and develop a minimax theory under losses. Our analysis encompasses several fundamental problems: location estimation, linear regression, and pointwise nonparametric density estimation. For joint contaminations in location estimation and for prediction in linear regression, we obtain the exact minimax risk, identify least…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization
MethodsFocus
