Computationally efficient change point detection for high-dimensional regression
Florencia Leonardi, Peter B\"uhlmann

TL;DR
This paper introduces a computationally efficient method for detecting multiple change points in high-dimensional regression data, combining theoretical guarantees with practical algorithms like dynamic programming and binary search.
Contribution
It proposes a novel joint estimator for change points and segment parameters, with scalable algorithms and proven theoretical properties in high-dimensional settings.
Findings
The binary search algorithm achieves near-equivalent accuracy to dynamic programming.
The estimator satisfies oracle inequalities even with many change points.
Performance validated on simulated and real datasets.
Abstract
Large-scale sequential data is often exposed to some degree of inhomogeneity in the form of sudden changes in the parameters of the data-generating process. We consider the problem of detecting such structural changes in a high-dimensional regression setting. We propose a joint estimator of the number and the locations of the change points and of the parameters in the corresponding segments. The estimator can be computed using dynamic programming or, as we emphasize here, it can be approximated using a binary search algorithm with computational operations while still enjoying essentially the same theoretical properties; here denotes the computational cost of computing the Lasso for sample size . We establish oracle inequalities for the estimator as well as for its binary search approximation, covering also the case with a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Control Systems and Identification · Statistical and numerical algorithms
