Minimax Statistical Estimation under Wasserstein Contamination

Patrick Chao; Edgar Dobriban

arXiv:2308.01853·stat.ML·November 24, 2025·1 cites

Minimax Statistical Estimation under Wasserstein Contamination

Patrick Chao, Edgar Dobriban

PDF

Open Access 2 Repos

TL;DR

This paper develops a minimax theory for Wasserstein-$r$ contamination models in statistical estimation, demonstrating that classical estimators like the mean and least squares are nearly optimal under these robust contamination settings.

Contribution

It introduces a comprehensive minimax framework for Wasserstein-$r$ contaminations, analyzing fundamental problems and identifying optimal estimators and contamination strategies.

Findings

01

Exact minimax risks are derived for location estimation and linear regression.

02

Classical estimators like the mean and least squares are shown to be nearly optimal.

03

Optimal density estimation rates are established with adjusted bandwidths.

Abstract

Contaminations are a key concern in modern statistical learning, as small but systematic perturbations of all datapoints can substantially alter estimation results. Here, we study Wasserstein- $r$ contaminations ( $r \geq 1$ ) in an $ℓ_{q}$ norm ( $q \in [1, \infty]$ ), in which each observation may undergo an adversarial perturbation with bounded cost, complementing the classical Huber model, corresponding to total variation norm, where only a fraction of observations is arbitrarily corrupted. We study both independent and joint (coordinated) contaminations and develop a minimax theory under $ℓ_{q}^{r}$ losses. Our analysis encompasses several fundamental problems: location estimation, linear regression, and pointwise nonparametric density estimation. For joint contaminations in location estimation and for prediction in linear regression, we obtain the exact minimax risk, identify least…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization

MethodsFocus