Are Language Models Actually Useful for Time Series Forecasting?

Mingtian Tan; Mike A. Merrill; Vinayak Gupta; Tim Althoff; Thomas; Hartvigsen

arXiv:2406.16964·cs.LG·October 29, 2024·24 cites

Are Language Models Actually Useful for Time Series Forecasting?

Mingtian Tan, Mike A. Merrill, Vinayak Gupta, Tim Althoff, Thomas, Hartvigsen

PDF

Open Access 3 Repos 1 Models 1 Video

TL;DR

This paper critically evaluates the effectiveness of large language models in time series forecasting, revealing that simpler models often outperform or match LLMs, questioning their practical utility in this domain.

Contribution

The study systematically compares LLM-based forecasting methods with simpler models, demonstrating that LLMs do not provide significant advantages and are often less efficient.

Findings

01

Removing LLM components does not harm performance

02

Pretrained LLMs do not outperform models trained from scratch

03

Simple attention-based encoders perform similarly to LLMs

Abstract

Large language models (LLMs) are being applied to time series forecasting. But are language models actually useful for time series? In a series of ablation studies on three recent and popular LLM-based time series forecasting methods, we find that removing the LLM component or replacing it with a basic attention layer does not degrade forecasting performance -- in most cases, the results even improve! We also find that despite their significant computational cost, pretrained LLMs do no better than models trained from scratch, do not represent the sequential dependencies in time series, and do not assist in few-shot settings. Additionally, we explore time series encoders and find that patching and attention structures perform similarly to LLM-based forecasters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
lwaekfjlk/Time-Series-Library
model

Videos

Are Language Models Actually Useful for Time Series Forecasting?· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques

MethodsSoftmax · Attention Is All You Need · Activation Patching