An efficient aggregation method for the symbolic representation of temporal data
Xinye Chen, Stefan G\"uttel

TL;DR
The paper introduces fABBA, an improved symbolic aggregation method for temporal data that reduces computational complexity, eliminates the need for predefining symbols, and outperforms existing methods like ABBA, SAX, and 1d-SAX in accuracy and speed.
Contribution
The paper presents fABBA, a novel variant of ABBA that uses a sorting-based aggregation, enhancing efficiency and flexibility in symbolic time series representation.
Findings
fABBA significantly reduces runtime compared to ABBA.
fABBA outperforms SAX and 1d-SAX in reconstruction accuracy.
fABBA can also compress other data types like images.
Abstract
Symbolic representations are a useful tool for the dimension reduction of temporal data, allowing for the efficient storage of and information retrieval from time series. They can also enhance the training of machine learning algorithms on time series data through noise reduction and reduced sensitivity to hyperparameters. The adaptive Brownian bridge-based aggregation (ABBA) method is one such effective and robust symbolic representation, demonstrated to accurately capture important trends and shapes in time series. However, in its current form the method struggles to process very large time series. Here we present a new variant of the ABBA method, called fABBA. This variant utilizes a new aggregation approach tailored to the piecewise representation of time series. By replacing the k-means clustering used in ABBA with a sorting-based aggregation technique, and thereby avoiding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Visualization and Analytics · Neural Networks and Applications
Methodsk-Means Clustering
