Survey and Taxonomy: The Role of Data-Centric AI in Transformer-Based Time Series Forecasting
Jingjing Xu, Caesar Wu, Yuan-Fang Li, Gregoire Danoy, Pascal Bouvry

TL;DR
This paper surveys the integration of data-centric AI principles with transformer-based time series forecasting, highlighting the importance of data quality and preprocessing for model performance.
Contribution
It provides a comprehensive taxonomy and literature review on how data-centric AI enhances transformer-based time series forecasting models.
Findings
Data-centric AI is crucial for optimizing transformer-based TSF models.
Existing research emphasizes data preprocessing's impact on model accuracy.
The survey identifies gaps and future directions in integrating data-centric approaches with TSF.
Abstract
Alongside the continuous process of improving AI performance through the development of more sophisticated models, researchers have also focused their attention to the emerging concept of data-centric AI, which emphasizes the important role of data in a systematic machine learning training process. Nonetheless, the development of models has also continued apace. One result of this progress is the development of the Transformer Architecture, which possesses a high level of capability in multiple domains such as Natural Language Processing (NLP), Computer Vision (CV) and Time Series Forecasting (TSF). Its performance is, however, heavily dependent on input data preprocessing and output data evaluation, justifying a data-centric approach to future research. We argue that data-centric AI is essential for training AI models, particularly for transformer-based TSF models efficiently. However,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Power Transformer Diagnostics and Insulation · Neural Networks and Applications
MethodsAttention Is All You Need · Label Smoothing · Adam · Linear Layer · Byte Pair Encoding · Layer Normalization · Softmax · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention
