Employing Two-Dimensional Word Embedding for Difficult Tabular Data Stream Classification
Pawe{\l} Zyblewski

TL;DR
This paper introduces SSTML, a novel method that transforms difficult data streams into images using multi-dimensional encoding and applies ResNet-18 for classification, outperforming existing algorithms in accuracy.
Contribution
It pioneers the use of multi-dimensional encoding for data stream classification, integrating image-based deep learning with a new encoding algorithm for improved performance.
Findings
SSTML achieves statistically significant better accuracy than state-of-the-art methods.
SSTML maintains comparable processing times to existing algorithms.
The approach effectively handles concept drift and data imbalance in streams.
Abstract
Rapid technological advances are inherently linked to the increased amount of data, a substantial portion of which can be interpreted as data stream, capable of exhibiting the phenomenon of concept drift and having a high imbalance ratio. Consequently, developing new approaches to classifying difficult data streams is a rapidly growing research area. At the same time, the proliferation of deep learning and transfer learning, as well as the success of convolutional neural networks in computer vision tasks, have contributed to the emergence of a new research trend, namely Multi-Dimensional Encoding (MDE), focusing on transforming tabular data into a homogeneous form of a discrete digital signal. This paper proposes Streaming Super Tabular Machine Learning (SSTML), thereby exploring for the first time the potential of MDE in the difficult data stream classification task. SSTML encodes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Time Series Analysis and Forecasting · Anomaly Detection Techniques and Applications
