Wasserstein-based identification of metastable states in time series data via change point detection and segment clustering
David Gentile, Joshua Huang, James M. Murphy

TL;DR
This paper introduces a Wasserstein metric-based method for change point detection and clustering in multi-dimensional time series, effectively identifying metastable states without parametric assumptions, with applications in molecular dynamics and acoustics.
Contribution
The paper presents a scalable, non-parametric approach using Wasserstein metrics for change point detection and segment clustering in high-dimensional time series data.
Findings
Successfully identifies metastable states in synthetic and real-world data
Scales linearly with data size and dimension
Effective in molecular dynamics and underwater acoustics applications
Abstract
Change point detection for time series analysis is a difficult and important problem in applied statistics, for which a variety of approaches have been developed in the past several decades. Here, the Wasserstein metric is employed as a tool for change-point identification in multi-dimensional time series data in order to identify clusters in time series in an unsupervised way. We leverage the simplicity of the optimal transport cost in the 1-dimensional setting to quickly identify both a segmentation (family of change points for a trajectory) and a clustering for the data when the number of segments is much smaller than the number of data points, making no parametric assumptions about the particular distributions involved. Our change point detection method scales linearly in the size of the data and in the dimension of the samples. We test our approach on idealized synthetic data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric Analysis and Curvature Flows · Tensor decomposition and applications · Time Series Analysis and Forecasting
