Change Point Analysis of Multivariate Data: Using Multivariate Rank-based Distribution-free Nonparametric Testing via Measure Transportation with Applications in Tumor Microarrays and Dementia
Amanda Ng

TL;DR
This paper introduces a distribution-free, nonparametric algorithm for detecting multiple change points in multivariate data using measure transportation-based ranks, with applications in medical datasets like tumor microarrays and dementia progression.
Contribution
It develops a novel, assumption-light algorithm for multivariate change point detection based on measure transportation ranks, and provides theoretical properties and practical implementation in R.
Findings
Successfully applied to tumor microarray data, epilepsy ECoG data, and dementia score trajectories.
Estimates number and locations of change points without parametric assumptions.
Algorithm implemented in the R package recp, available on GitHub.
Abstract
In this paper, I propose a general algorithm for multiple change point analysis via multivariate distribution-free nonparametric testing based on the concept of ranks that are defined by measure transportation. Multivariate ranks and the usual one-dimensional ranks both share an important property: they are both distribution-free. This finding allows for the creation of nonparametric tests that are distribution-free under the null hypothesis. This method has applications in a variety of fields, and in this paper I implement this algorithm to a microarray dataset for individuals with bladder tumors, an ECoG snapshot for a patient with epilepsy, and in the context of trajectories of CASI scores by education level and dementia status. Each change point denotes a shift in the rate of change of Cognitive Abilities score over years, indicating the existence of preclinical dementia. Here I…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Bayesian Methods and Mixture Models
