TL;DR
This paper empirically compares seven time-series feature sets in terms of speed, redundancy, and overlap, providing insights to optimize their use in various applications.
Contribution
It systematically evaluates and compares multiple time-series feature sets on speed, redundancy, and overlap, offering guidance for selecting appropriate features.
Findings
Catch22 and TSFEL are the fastest feature sets (~0.1ms per feature).
TSFEL and tsfresh exhibit high redundancy, with 90% variance captured by four PCs.
hctsa is the most comprehensive feature set, while tsfresh is the most distinctive.
Abstract
Solving time-series problems with features has been rising in popularity due to the availability of software for feature extraction. Feature-based time-series analysis can now be performed using many different feature sets, including hctsa (7730 features: Matlab), feasts (42 features: R), tsfeatures (63 features: R), Kats (40 features: Python), tsfresh (up to 1558 features: Python), TSFEL (390 features: Python), and the C-coded catch22 (22 features: Matlab, R, Python, and Julia). There is substantial overlap in the types of methods included in these sets (e.g., properties of the autocorrelation function and Fourier power spectrum), but they are yet to be systematically compared. Here we compare these seven sets on computational speed, assess the redundancy of features contained in each, and evaluate the overlap and redundancy between them. We take an empirical approach to feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPrincipal Components Analysis
