Exact and arbitrarily accurate non-parametric two-sample tests based on rank spacings
Dan D. Erdmann-Pham, Jonathan Terhorst, Yun S. Song

TL;DR
This paper develops exact and finite-sample characterizations of rank-based non-parametric two-sample tests, introducing new algorithms and tail bounds, and confirming conjectures about their efficiency compared to classical tests.
Contribution
It provides exact distributions for rank spacings, algorithms for their computation, and new theoretical results on test efficiency and tail bounds.
Findings
Exact distributions for p=1 case.
Fast algorithms for p>1 distributions.
Proved efficiency conjecture for rank-based tests.
Abstract
A common method for deriving non-parametric tests is to reformulate a parametric test in terms of sample ranks. Despite being distribution free (even in finite samples), the resulting tests often display remarkable asymptotic power properties, typically matching the efficiency of their parametric counterpart. Empirically, these favorable power properties have been shown to persist in non-asymptotic regimes as well, prompting the need for finite-sample characterizations of the corresponding rank-based statistics. Here, we provide such characterization for the family of weighted -norms of rank spacings, which includes the classical tests of Mann-Whitney, Dixon, and various generalizations thereof. For , we provide exact expressions for the involved distributions, while for we describe the associated moment sequences and derive an algorithm to recover the distributions of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Distribution Estimation and Applications · Statistical Methods and Inference · Random Matrices and Applications
