Signature Maximum Mean Discrepancy Two-Sample Statistical Tests

Andrew Alden; Blanka Horvath; Zacharia Issa

arXiv:2506.01718·stat.ML·June 3, 2025

Signature Maximum Mean Discrepancy Two-Sample Statistical Tests

Andrew Alden, Blanka Horvath, Zacharia Issa

PDF

Open Access

TL;DR

This paper explores the signature Maximum Mean Discrepancy (sig-MMD), a kernel-based method for comparing path space distributions, analyzing its practical application, challenges like Type 2 errors, and mitigation techniques.

Contribution

It introduces the sig-MMD for two-sample testing on path space, providing practical examples, identifying challenges, and proposing solutions for effective use.

Findings

01

sig-MMD can be used to compare distributions on path space

02

Type 2 errors can occur in limited data settings

03

Techniques to reduce false negatives are proposed

Abstract

Maximum Mean Discrepancy (MMD) is a widely used concept in machine learning research which has gained popularity in recent years as a highly effective tool for comparing (finite-dimensional) distributions. Since it is designed as a kernel-based method, the MMD can be extended to path space valued distributions using the signature kernel. The resulting signature MMD (sig-MMD) can be used to define a metric between distributions on path space. Similarly to the original use case of the MMD as a test statistic within a two-sample testing framework, the sig-MMD can be applied to determine if two sets of paths are drawn from the same stochastic process. This work is dedicated to understanding the possibilities and challenges associated with applying the sig-MMD as a statistical tool in practice. We introduce and explain the sig-MMD, and provide easily accessible and verifiable examples for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference