Equitability, interval estimation, and statistical power
Yakir A. Reshef, David N. Reshef, Pardis C. Sabeti, Michael M., Mitzenmacher

TL;DR
This paper introduces the concept of equitability in dependence measures, formalizes it through interpretable intervals, and demonstrates its utility in identifying meaningful relationships in high-dimensional data.
Contribution
It formalizes equitability via interpretable intervals and links it to hypothesis testing, enabling more effective detection of relationships of varying strengths.
Findings
Equitable statistics have small interpretable intervals.
Equitability improves power to distinguish relationships of different strengths.
The paper provides practical methods to evaluate equitability.
Abstract
For analysis of a high-dimensional dataset, a common approach is to test a null hypothesis of statistical independence on all variable pairs using a non-parametric measure of dependence. However, because this approach attempts to identify any non-trivial relationship no matter how weak, it often identifies too many relationships to be useful. What is needed is a way of identifying a smaller set of relationships that merit detailed further analysis. Here we formally present and characterize equitability, a property of measures of dependence that aims to overcome this challenge. Notionally, an equitable statistic is a statistic that, given some measure of noise, assigns similar scores to equally noisy relationships of different types [Reshef et al. 2011]. We begin by formalizing this idea via a new object called the interpretable interval, which functions as an interval estimate of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
