Empirical Studies on Symbolic Aggregation Approximation Under Statistical Perspectives for Knowledge Discovery in Time Series
Wei Song, Zhiguang Wang, Yangdong Ye, Ming Fan

TL;DR
This paper empirically investigates the statistical properties of Symbolic Aggregation Approximation (SAX) in time series analysis, introducing a new measurement and providing insights into its effectiveness and applicability.
Contribution
The study introduces a novel statistical measure, IEC, to evaluate SAX's adequacy and offers an analytical framework for analyzing symbolic dynamics in time series.
Findings
SAX reduces complexity while preserving core information.
IEC score helps determine SAX suitability for datasets.
Statistical tools enhance understanding of symbolic representations.
Abstract
Symbolic Aggregation approXimation (SAX) has been the de facto standard representation methods for knowledge discovery in time series on a number of tasks and applications. So far, very little work has been done in empirically investigating the intrinsic properties and statistical mechanics in SAX words. In this paper, we applied several statistical measurements and proposed a new statistical measurement, i.e. information embedding cost (IEC) to analyze the statistical behaviors of the symbolic dynamics. Our experiments on the benchmark datasets and the clinical signals demonstrate that SAX can always reduce the complexity while preserving the core information embedded in the original time series with significant embedding efficiency. Our proposed IEC score provide a priori to determine if SAX is adequate for specific dataset, which can be generalized to evaluate other symbolic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Advanced Text Analysis Techniques · Complex Systems and Time Series Analysis
