ETSformer: Exponential Smoothing Transformers for Time-series Forecasting
Gerald Woo, Chenghao Liu, Doyen Sahoo, Akshat Kumar, Steven Hoi

TL;DR
ETSFormer introduces a novel Transformer architecture for time-series forecasting that incorporates exponential smoothing principles, enhancing interpretability, decomposition capabilities, and efficiency for long-term predictions.
Contribution
The paper proposes ETSFormer, a Transformer model utilizing exponential smoothing attention and frequency attention to improve time-series decomposition and forecasting accuracy.
Findings
Outperforms existing methods on multiple benchmarks
Demonstrates improved interpretability and decomposition
Achieves better long-term forecasting accuracy
Abstract
Transformers have been actively studied for time-series forecasting in recent years. While often showing promising results in various scenarios, traditional Transformers are not designed to fully exploit the characteristics of time-series data and thus suffer some fundamental limitations, e.g., they generally lack of decomposition capability and interpretability, and are neither effective nor efficient for long-term forecasting. In this paper, we propose ETSFormer, a novel time-series Transformer architecture, which exploits the principle of exponential smoothing in improving Transformers for time-series forecasting. In particular, inspired by the classical exponential smoothing methods in time-series forecasting, we propose the novel exponential smoothing attention (ESA) and frequency attention (FA) to replace the self-attention mechanism in vanilla Transformers, thus improving both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Data Stream Mining Techniques
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Absolute Position Encodings · Byte Pair Encoding · Label Smoothing · Dense Connections · Residual Connection · Softmax · Layer Normalization
