Change point detection in high dimensional data with U-statistics
B. Cooper Boniece, Lajos Horv\'ath, Peter Jacobs

TL;DR
This paper introduces a new high-dimensional change point detection method using U-statistics based on $L_p$ norms, demonstrating its theoretical consistency and practical effectiveness in various scenarios.
Contribution
It develops a flexible, nonparametric testing procedure for change points in high-dimensional data, with proven asymptotic properties and superior performance in simulations.
Findings
Outperforms existing methods in high-dimensional change point detection
Proven asymptotic distribution under dependence structures
Effective in real-world Twitter data analysis
Abstract
We consider the problem of detecting distributional changes in a sequence of high dimensional data. Our approach combines two separate statistics stemming from norms whose behavior is similar under but potentially different under , leading to a testing procedure that that is flexible against a variety of alternatives. We establish the asymptotic distribution of our proposed test statistics separately in cases of weakly dependent and strongly dependent coordinates as , where denotes sample size and is the dimension, and establish consistency of testing and estimation procedures in high dimensions under one-change alternative settings. Computational studies in single and multiple change point scenarios demonstrate our method can outperform other nonparametric approaches in the literature for certain alternatives in high dimensions. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
