Optimal nonparametric change point detection and localization
Oscar Hernan Madrid Padilla, Yi Yu, Daren Wang, and Alessandro Rinaldo

TL;DR
This paper develops and analyzes nonparametric change point detection methods for univariate data, extending binary segmentation algorithms to detect distributional changes using Kolmogorov--Smirnov distance, with proven optimality and phase transition insights.
Contribution
It generalizes binary segmentation algorithms to nonparametric settings for distributional change detection and establishes their near minimax optimality and phase transition phenomena.
Findings
Wild binary segmentation is nearly minimax rate-optimal.
A phase transition separates feasible and infeasible localization regimes.
Extensive numerical experiments support theoretical results.
Abstract
We study change point detection and localization for univariate data in fully nonparametric settings in which, at each time point, we acquire an i.i.d. sample from an unknown distribution. We quantify the magnitude of the distributional changes at the change points using the Kolmogorov--Smirnov distance. We allow all the relevant parameters -- the minimal spacing between two consecutive change points, the minimal magnitude of the changes in the Kolmogorov--Smirnov distance, and the number of sample points collected at each time point -- to change with the length of time series. We generalize the renowned binary segmentation (e.g. Scott and Knott, 1974) algorithm and its variant, the wild binary segmentation of Fryzlewicz (2014), both originally designed for univariate mean change point detection problems, to our nonparametric settings and exhibit rates of consistency for both of them.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Control Systems and Identification · Advanced Statistical Process Monitoring
