Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data

Feng Liu; Wenkai Xu; Jie Lu; Danica J. Sutherland

arXiv:2106.07636·stat.ML·January 6, 2022·1 cites

Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data

Feng Liu, Wenkai Xu, Jie Lu, Danica J. Sutherland

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces meta two-sample testing (M2ST), leveraging auxiliary data to learn kernels that enable effective testing on new tasks with limited data, outperforming traditional methods.

Contribution

It proposes novel meta-learning algorithms for kernel-based two-sample tests that work well with scarce data, supported by theoretical and empirical validation.

Findings

01

Meta testing schemes outperform direct kernel learning in limited data scenarios.

02

The tailored approach achieves higher test power than generic schemes.

03

Theoretical analysis identifies conditions for successful meta-testing.

Abstract

Modern kernel-based two-sample tests have shown great success in distinguishing complex, high-dimensional distributions with appropriate learned kernels. Previous work has demonstrated that this kernel learning procedure succeeds, assuming a considerable number of observed samples from each distribution. In realistic scenarios with very limited numbers of data samples, however, it can be challenging to identify a kernel powerful enough to distinguish complex distributions. We address this issue by introducing the problem of meta two-sample testing (M2ST), which aims to exploit (abundant) auxiliary data on related tasks to find an algorithm that can quickly identify a powerful test on new target tasks. We propose two specific algorithms for this task: a generic scheme which improves over baselines and a more tailored approach which performs even better. We provide both theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fengliu90/MetaTesting
pytorchOfficial

Videos

Meta Two-Sample Testing: Learning Kernels for Testing with Limited Data· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and Data Classification