An Empirical Evaluation of Time-Series Feature Sets

Trent Henderson; Ben D. Fulcher

arXiv:2110.10914·cs.LG·October 22, 2021

An Empirical Evaluation of Time-Series Feature Sets

Trent Henderson, Ben D. Fulcher

PDF

1 Repo

TL;DR

This paper empirically compares seven time-series feature sets in terms of speed, redundancy, and overlap, providing insights to optimize their use in various applications.

Contribution

It systematically evaluates and compares multiple time-series feature sets on speed, redundancy, and overlap, offering guidance for selecting appropriate features.

Findings

01

Catch22 and TSFEL are the fastest feature sets (~0.1ms per feature).

02

TSFEL and tsfresh exhibit high redundancy, with 90% variance captured by four PCs.

03

hctsa is the most comprehensive feature set, while tsfresh is the most distinctive.

Abstract

Solving time-series problems with features has been rising in popularity due to the availability of software for feature extraction. Feature-based time-series analysis can now be performed using many different feature sets, including hctsa (7730 features: Matlab), feasts (42 features: R), tsfeatures (63 features: R), Kats (40 features: Python), tsfresh (up to 1558 features: Python), TSFEL (390 features: Python), and the C-coded catch22 (22 features: Matlab, R, Python, and Julia). There is substantial overlap in the types of methods included in these sets (e.g., properties of the autocorrelation function and Fourier power spectrum), but they are yet to be systematically compared. Here we compare these seven sets on computational speed, assess the redundancy of features contained in each, and evaluate the overlap and redundancy between them. We take an empirical approach to feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hendersontrent/feature-set-comp
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPrincipal Components Analysis