Robust Hypothesis Testing with Wasserstein Uncertainty Sets

Liyan Xie; Rui Gao; and Yao Xie

arXiv:2105.14348·math.ST·June 1, 2021·6 cites

Robust Hypothesis Testing with Wasserstein Uncertainty Sets

Liyan Xie, Rui Gao, and Yao Xie

PDF

Open Access

TL;DR

This paper introduces a robust hypothesis testing framework using Wasserstein distance to improve performance under limited data scenarios, with applications in healthcare and anomaly detection.

Contribution

It develops a new non-parametric testing method based on distributionally robust optimization with Wasserstein uncertainty sets, including a tractable convex reformulation.

Findings

01

The proposed test is more robust with limited samples.

02

Convex reformulation enables efficient computation.

03

Method performs well on synthetic and real datasets.

Abstract

We consider a data-driven robust hypothesis test where the optimal test will minimize the worst-case performance regarding distributions that are close to the empirical distributions with respect to the Wasserstein distance. This leads to a new non-parametric hypothesis testing framework based on distributionally robust optimization, which is more robust when there are limited samples for one or both hypotheses. Such a scenario often arises from applications such as health care, online change-point detection, and anomaly detection. We study the computational and statistical properties of the proposed test by presenting a tractable convex reformulation of the original infinite-dimensional variational problem exploiting Wasserstein's properties and characterizing the radii selection for the uncertainty sets. We also demonstrate the good performance of our method on synthetic and real data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring