MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for General Time Series Forecasting
Linfeng Du, Ji Xin, Alex Labach, Saba Zuberi, Maksims Volkovs, Rahul, G. Krishnan

TL;DR
MultiResFormer introduces an adaptive multi-resolution transformer model that dynamically selects optimal patch lengths to better capture complex temporal dependencies in time series forecasting, outperforming existing methods.
Contribution
It proposes a novel transformer architecture that adaptively models temporal variations with multiple patch lengths, enhancing forecasting accuracy.
Findings
Outperforms state-of-the-art patch-based transformers in long-term forecasting.
Consistently outperforms CNN baselines by a large margin.
Uses fewer parameters than comparable models.
Abstract
Transformer-based models have greatly pushed the boundaries of time series forecasting recently. Existing methods typically encode time series data into using one or a fixed set of patch lengths. This, however, could result in a lack of ability to capture the variety of intricate temporal dependencies present in real-world multi-periodic time series. In this paper, we propose MultiResFormer, which dynamically models temporal variations by adaptively choosing optimal patch lengths. Concretely, at the beginning of each layer, time series data is encoded into several parallel branches, each using a detected periodicity, before going through the transformer encoder block. We conduct extensive evaluations on long- and short-term forecasting datasets comparing MultiResFormer with state-of-the-art baselines. MultiResFormer outperforms patch-based Transformer baselines on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Data Visualization and Analytics
MethodsSparse Evolutionary Training · Multi-Head Attention · Attention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer
