The Top-K Tau-Path Screen for Monotone Association
Srinath Sampath, Adriano Caloiaro, Wayne Johnson, Joseph S. Verducci

TL;DR
This paper introduces the top-K tau-path (TKTP) method, which efficiently identifies subpopulations with monotone associations in large datasets, improving upon previous methods by enabling analysis of thousands of variable pairs.
Contribution
The paper presents a novel TKTP algorithm that enhances the detection of subpopulations with monotone associations, scalable to large datasets, with detailed computational analysis and practical demonstrations.
Findings
Algorithm efficiently identifies subpopulations with high association.
Scalability to thousands of variable pairs and large sample sizes.
Simulation studies confirm accuracy across various settings.
Abstract
A pair of variables that tend to rise and fall either together or in opposition are said to be monotonically associated. For certain phenomena, this tendency is causally restricted to a subpopulation, as, for example, an allergic reaction to an irritant. Previously, Yu et al. (2011) devised a method of rearranging observations to test paired data to see if such an association might be present in a subpopulation. However, the computational intensity of the method limited its application to relatively small samples of data, and the test itself only judges if association is present in some subpopulation; it does not clearly identify the subsample that came from this subpopulation, especially when the whole sample tests positive. The present paper adds a "top-K" feature (Sampath and Verducci (2013)) based on a multistage ranking model, that identifies a concise subsample that is likely to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Bayesian Methods and Mixture Models · Hypothalamic control of reproductive hormones
