A Data-Driven Approach to Robust Hypothesis Testing Using Sinkhorn   Uncertainty Sets

Jie Wang; Yao Xie

arXiv:2202.04258·stat.ML·May 17, 2022

A Data-Driven Approach to Robust Hypothesis Testing Using Sinkhorn Uncertainty Sets

Jie Wang, Yao Xie

PDF

Open Access

TL;DR

This paper introduces a data-driven robust hypothesis testing method using Sinkhorn distance to define uncertainty sets, resulting in more flexible detectors that outperform traditional Wasserstein-based tests in small-sample scenarios.

Contribution

It proposes a novel Sinkhorn distance-based approach for robust hypothesis testing, extending the support of least favorable distributions beyond training samples.

Findings

01

Outperforms Wasserstein robust tests in experiments

02

Provides more flexible detectors for small-sample scenarios

03

Validated on synthetic and real datasets

Abstract

Hypothesis testing for small-sample scenarios is a practically important problem. In this paper, we investigate the robust hypothesis testing problem in a data-driven manner, where we seek the worst-case detector over distributional uncertainty sets centered around the empirical distribution from samples using Sinkhorn distance. Compared with the Wasserstein robust test, the corresponding least favorable distributions are supported beyond the training samples, which provides a more flexible detector. Various numerical experiments are conducted on both synthetic and real datasets to validate the competitive performances of our proposed method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications