MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for   General Time Series Forecasting

Linfeng Du; Ji Xin; Alex Labach; Saba Zuberi; Maksims Volkovs; Rahul; G. Krishnan

arXiv:2311.18780·cs.LG·February 9, 2024·1 cites

MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for General Time Series Forecasting

Linfeng Du, Ji Xin, Alex Labach, Saba Zuberi, Maksims Volkovs, Rahul, G. Krishnan

PDF

Open Access

TL;DR

MultiResFormer introduces an adaptive multi-resolution transformer model that dynamically selects optimal patch lengths to better capture complex temporal dependencies in time series forecasting, outperforming existing methods.

Contribution

It proposes a novel transformer architecture that adaptively models temporal variations with multiple patch lengths, enhancing forecasting accuracy.

Findings

01

Outperforms state-of-the-art patch-based transformers in long-term forecasting.

02

Consistently outperforms CNN baselines by a large margin.

03

Uses fewer parameters than comparable models.

Abstract

Transformer-based models have greatly pushed the boundaries of time series forecasting recently. Existing methods typically encode time series data into $patches$ using one or a fixed set of patch lengths. This, however, could result in a lack of ability to capture the variety of intricate temporal dependencies present in real-world multi-periodic time series. In this paper, we propose MultiResFormer, which dynamically models temporal variations by adaptively choosing optimal patch lengths. Concretely, at the beginning of each layer, time series data is encoded into several parallel branches, each using a detected periodicity, before going through the transformer encoder block. We conduct extensive evaluations on long- and short-term forecasting datasets comparing MultiResFormer with state-of-the-art baselines. MultiResFormer outperforms patch-based Transformer baselines on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Data Visualization and Analytics

MethodsSparse Evolutionary Training · Multi-Head Attention · Attention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer