Robust Estimation under the Wasserstein Distance
Sloan Nietert, Rachel Cummings, and Ziv Goldfeld

TL;DR
This paper introduces a robust distribution estimation method under Wasserstein distance using partial optimal transport and minimum distance estimation, achieving minimax-optimal risk and improving generative modeling with contaminated data.
Contribution
It develops a new structural analysis of partial OT, a novel dual form with a sup-norm penalty, and applies these to enhance WGAN robustness against adversarial corruptions.
Findings
Achieves minimax-optimal robust estimation risk in many settings.
Provides a scalable method for generative modeling with contaminated datasets.
Demonstrates effectiveness through numerical experiments.
Abstract
We study the problem of robust distribution estimation under the Wasserstein distance, a popular discrepancy measure between probability distributions rooted in optimal transport (OT) theory. Given samples from an unknown distribution , of which are adversarially corrupted, we seek an estimate for with minimal Wasserstein error. To address this task, we draw upon two frameworks from OT and robust statistics: partial OT (POT) and minimum distance estimation (MDE). We prove new structural properties for POT and use them to show that MDE under a partial Wasserstein distance achieves the minimax-optimal robust estimation risk in many settings. Along the way, we derive a novel dual form for POT that adds a sup-norm penalty to the classic Kantorovich dual for standard OT. Since the popular Wasserstein generative adversarial network (WGAN) framework implements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Probabilistic and Robust Engineering Design
