Parsimonious Time Series Clustering

Carmela Iorio; Gianluca Frasso; Antonio D'Ambrosio; Roberta; Siciliano

arXiv:1509.00729·stat.ME·September 3, 2015

Parsimonious Time Series Clustering

Carmela Iorio, Gianluca Frasso, Antonio D'Ambrosio, Roberta, Siciliano

PDF

Open Access

TL;DR

This paper presents a new parsimonious model-based approach for clustering noisy, sparse time series data using P-spline smoothers, improving efficiency and accuracy in biological applications.

Contribution

It introduces a spline-based clustering framework that reduces computational complexity and handles noisy, sparse data effectively, applicable within standard clustering methods.

Findings

01

Effective in clustering gene expression data

02

Improves computational efficiency

03

Maintains high clustering accuracy

Abstract

We introduce a parsimonious model-based framework for clustering time course data. In these applications the computational burden becomes often an issue due to the number of available observations. The measured time series can also be very noisy and sparse and a suitable model describing them can be hard to define. We propose to model the observed measurements by using P-spline smoothers and to cluster the functional objects as summarized by the optimal spline coefficients. In principle, this idea can be adopted within all the most common clustering frameworks. In this work we discuss applications based on a k-means algorithm. We evaluate the accuracy and the efficiency of our proposal by simulations and by dealing with drosophila melanogaster gene expression data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Fermentation and Sensory Analysis · Metabolomics and Mass Spectrometry Studies