Fusing Large Language Models with Temporal Transformers for Time Series Forecasting
Chen Su, Yuanhe Tian, Qinyu Liu, Jun Zhang, Yan Song

TL;DR
This paper introduces a hybrid Transformer architecture that combines large language models and traditional Transformers to improve time series forecasting by integrating semantic and temporal information.
Contribution
It presents a novel fusion method that leverages LLMs' semantic understanding with Transformers' temporal modeling for enhanced forecasting accuracy.
Findings
Outperforms existing models on benchmark datasets
Effectively captures semantic and temporal patterns
Improves forecasting accuracy significantly
Abstract
Recently, large language models (LLMs) have demonstrated powerful capabilities in performing various tasks and thus are applied by recent studies to time series forecasting (TSF) tasks, which predict future values with the given historical time series. Existing LLM-based approaches transfer knowledge learned from text data to time series prediction using prompting or fine-tuning strategies. However, LLMs are proficient at reasoning over discrete tokens and semantic patterns but are not initially designed to model continuous numerical time series data. The gaps between text and time series data lead LLMs to achieve inferior performance to a vanilla Transformer model that is directly trained on TSF data. However, the vanilla Transformers often struggle to learn high-level semantic patterns. In this paper, we design a novel Transformer-based architecture that complementarily leverages LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Text Analysis Techniques · Time Series Analysis and Forecasting
