One Fits All:Power General Time Series Analysis by Pretrained LM

Tian Zhou; PeiSong Niu; Xue Wang; Liang Sun; Rong Jin

arXiv:2302.11939·cs.LG·October 17, 2023·117 cites

One Fits All:Power General Time Series Analysis by Pretrained LM

Tian Zhou, PeiSong Niu, Xue Wang, Liang Sun, Rong Jin

PDF

Open Access 3 Repos

TL;DR

This paper demonstrates that pre-trained language and vision transformers can be effectively adapted for diverse time series analysis tasks without modification, achieving state-of-the-art results across multiple benchmarks.

Contribution

It introduces a method to leverage existing pre-trained models for time series analysis by using a frozen transformer, enabling universal application across tasks.

Findings

01

Pre-trained models achieve comparable or superior performance in time series tasks.

02

Self-attention in transformers behaves similarly to PCA, aiding domain adaptation.

03

The approach simplifies transfer learning for time series analysis.

Abstract

Although we have witnessed great success of pre-trained models in natural language processing (NLP) and computer vision (CV), limited progress has been made for general time series analysis. Unlike NLP and CV where a unified model can be used to perform different tasks, specially designed approach still dominates in each time series analysis task such as classification, anomaly detection, forecasting, and few-shot learning. The main challenge that blocks the development of pre-trained model for time series analysis is the lack of a large amount of data for training. In this work, we address this challenge by leveraging language or CV models, pre-trained from billions of tokens, for time series analysis. Specifically, we refrain from altering the self-attention and feedforward layers of the residual blocks in the pre-trained language or image model. This model, known as the Frozen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Topic Modeling

MethodsAttention Is All You Need · Linear Layer · Adam · Multi-Head Attention · Residual Connection · Layer Normalization · Softmax · Label Smoothing · Position-Wise Feed-Forward Layer · Dense Connections