Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

Ming Jin; Shiyu Wang; Lintao Ma; Zhixuan Chu; James Y. Zhang; Xiaoming; Shi; Pin-Yu Chen; Yuxuan Liang; Yuan-Fang Li; Shirui Pan; Qingsong Wen

arXiv:2310.01728·cs.LG·January 30, 2024·128 cites

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y. Zhang, Xiaoming, Shi, Pin-Yu Chen, Yuxuan Liang, Yuan-Fang Li, Shirui Pan, Qingsong Wen

PDF

Open Access 3 Repos 1 Video 3 Reviews

TL;DR

Time-LLM introduces a novel reprogramming framework that leverages large language models for time series forecasting, effectively aligning data modalities and enhancing reasoning capabilities, outperforming specialized models in various scenarios.

Contribution

The paper proposes a new method to repurpose large language models for time series forecasting by reprogramming inputs and introducing Prompt-as-Prefix to improve reasoning.

Findings

01

Outperforms state-of-the-art forecasting models.

02

Excels in few-shot and zero-shot learning scenarios.

03

Effectively aligns time series data with language models.

Abstract

Time series forecasting holds significant importance in many real-world dynamic systems and has been extensively studied. Unlike natural language process (NLP) and computer vision (CV), where a single large model can tackle multiple tasks, models for time series forecasting are often specialized, necessitating distinct designs for different tasks and applications. While pre-trained foundation models have made impressive strides in NLP and CV, their development in time series domains has been constrained by data sparsity. Recent studies have revealed that large language models (LLMs) possess robust pattern recognition and reasoning abilities over complex sequences of tokens. However, the challenge remains in effectively aligning the modalities of time series data and natural language to leverage these capabilities. In this work, we present Time-LLM, a reprogramming framework to repurpose…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 3· reject, not good enoughConfidence 5

Strengths

- This paper provides a summary of the metrics for pre-trained large-language models, including generalizability, data efficiency, reasoning, and multimodal knowledge. - The details of the proposed method are presented clearly and are easy to follow.

Weaknesses

My main concerns include: - While LLM is a hot topic in the deep learning community, it is still unconvincing to directly transfer the knowledge of natural language in LLMs to time series tasks. Note that i) text and time series are distinct data modalities, and ii) the pre-trained LLMs are not pre-trained with text-time-series pairs. - Furthermore, the first contribution *“introducing a novel concept of reprogramming large language models for time series forecasting without altering the pre-t

Reviewer 02Rating 8· accept, good paperConfidence 4

Strengths

1. The paper is very well written, clear, with sufficient details to ensure reproducibility. I really liked that desiderata that the authors identified to enable LLMs to produce forecast. 2. The experiments were well designed with some limitations in rigour which I will discuss in the next section. I really liked the paper, it was well written, well motivated and performant.

Weaknesses

Following are some things to improve in the paper. I think in general the experiments can be made more rigorous. 1. **Baselines**: I understand that the authors are following the experiment protocol followed by TimesNet, but there are several limitations: (1) Statistical methods such as AutoARIMA, AutoTHETA, AutoETS, Naive and Seasonal Naive, etc. were not compared with. These methods are important and very performant in practice, (2) N-BEATS and N-HITS were only compared during short-horizon f

Reviewer 03Rating 8· accept, good paperConfidence 4

Strengths

1. The paper is articulate and systematically structured, making the motivation and methodology behind the proposed solution evident. 2. The approach of modality alignment from time series to natural language is both innovative and promising, offering a new perspective for future research. 3. The empirical evaluation is thorough, encompassing an analysis of different LLM variations, an ablation study, computational efficiency considerations, and model interpretation.

Weaknesses

**Major** 1. The choice of datasets for evaluation is restrictive, as the ETT datasets involve similar metrics monitored under different conditions. Their mutual similarities might overinflate the perceived performance of Time-LLM. Inclusion of diverse datasets such as Weather, Electricity, and Traffic, commonly featured in literature, would offer a more holistic assessment. Moreover, there's an emerging consensus that long-term forecasting benchmarks have a preference for univariate models, pot

Code & Models

Repositories

Videos

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models· slideslive

Taxonomy

TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Forecasting Techniques and Applications

MethodsALIGN