Refereeing the Referees: Evaluating Two-Sample Tests for Validating   Generators in Precision Sciences

Samuele Grossi; Marco Letizia; Riccardo Torre

arXiv:2409.16336·stat.ML·September 26, 2024

Refereeing the Referees: Evaluating Two-Sample Tests for Validating Generators in Precision Sciences

Samuele Grossi, Marco Letizia, Riccardo Torre

PDF

TL;DR

This paper introduces a robust, computationally efficient methodology for evaluating high-dimensional generative models using two-sample tests based on integral probability measures, with applications in scientific fields like particle physics.

Contribution

It presents a novel sliced Kolmogorov-Smirnov statistic and compares various two-sample tests, demonstrating their effectiveness and efficiency in high-dimensional model validation.

Findings

01

One-dimensional tests achieve sensitivity comparable to multivariate metrics.

02

Proposed tests are significantly faster computationally.

03

Methodology is applicable to complex scientific datasets like particle physics.

Abstract

We propose a robust methodology to evaluate the performance and computational efficiency of non-parametric two-sample tests, specifically designed for high-dimensional generative models in scientific applications such as in particle physics. The study focuses on tests built from univariate integral probability measures: the sliced Wasserstein distance and the mean of the Kolmogorov-Smirnov statistics, already discussed in the literature, and the novel sliced Kolmogorov-Smirnov statistic. These metrics can be evaluated in parallel, allowing for fast and reliable estimates of their distribution under the null hypothesis. We also compare these metrics with the recently proposed unbiased Fr\'echet Gaussian Distance and the unbiased quadratic Maximum Mean Discrepancy, computed with a quartic polynomial kernel. We evaluate the proposed tests on various distributions, focusing on their…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.