Change Point Inference for Non-Euclidean Data Sequences using Distance Profiles
Paromita Dubey, Minxing Zheng

TL;DR
This paper proposes a non-parametric, distance profile-based scan statistic for detecting change points in non-Euclidean data sequences, with theoretical guarantees and practical applications across diverse data types.
Contribution
It introduces a universal, tuning-parameter-free change point detection method for data in metric spaces, with rigorous asymptotic analysis and real-data demonstrations.
Findings
Effective in multivariate and distributional data
Theoretically optimal change point localization
Outperforms existing methods in simulations
Abstract
We introduce a powerful scan statistic and the corresponding test for detecting the presence and pinpointing the location of a change point within the distribution of a data sequence with the data elements residing in a separable metric space . These change points mark abrupt shifts in the distribution of the data sequence as characterized using distance profiles, where the distance profile of an element is the distribution of distances from as dictated by the data. This approach is tuning parameter free, fully non-parametric and universally applicable to diverse data types, including distributional and network data, as long as distances between the data objects are available. We obtain an explicit characterization of the asymptotic distribution of the test statistic under the null hypothesis of no change points, rigorous guarantees on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Anomaly Detection Techniques and Applications · Data Management and Algorithms
