E-values for k-Sample Tests With Exponential Families
Yunda Hao, Peter Gr\"unwald, Tyron Lardy, Long Long, Reuben Adams

TL;DR
This paper introduces and compares e-variables for k-sample tests within exponential families, providing theoretical insights and efficient algorithms, with results showing their similar behavior under small effects and family-dependent growth rates.
Contribution
The paper develops new e-variables for k-sample testing in exponential families, extending previous methods and providing practical algorithms for their computation.
Findings
E-variables behave similarly under small effects across families.
For Gaussian and Poisson, certain e-variables coincide.
Family-dependent differences in growth rates of e-variables are observed.
Abstract
We develop and compare e-variables for testing whether samples of data are drawn from the same distribution, the alternative being that they come from different elements of an exponential family. We consider the GRO (growth-rate optimal) e-variables for (1) a `small' null inside the same exponential family, and (2) a `large' nonparametric null, as well as (3) an e-variable arrived at by conditioning on the sum of the sufficient statistics. (2) and (3) are efficiently computable, and extend ideas from Turner et al. [2021] and Wald [1947] respectively from Bernoulli to general exponential families. We provide theoretical and simulation-based comparisons of these e-variables in terms of their logarithmic growth rate, and find that for small effects all four e-variables behave surprisingly similarly; for the Gaussian location and Poisson families, e-variables (1) and (3) coincide; for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Software Reliability and Analysis Research · Statistical Methods and Bayesian Inference
