TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables
Yuxuan Wang, Haixu Wu, Jiaxiang Dong, Guo Qin, Haoran Zhang, Yong Liu,, Yunzhong Qiu, Jianmin Wang, Mingsheng Long

TL;DR
TimeXer introduces a Transformer-based model that effectively incorporates exogenous variables to improve time series forecasting accuracy, demonstrating state-of-the-art results on multiple real-world datasets.
Contribution
The paper presents a novel Transformer architecture, TimeXer, designed to integrate exogenous variables into time series forecasting, which is more practical and effective than existing methods.
Findings
Achieves state-of-the-art performance on 12 benchmarks.
Demonstrates strong generality across different datasets.
Exhibits scalability for large-scale forecasting tasks.
Abstract
Deep models have demonstrated remarkable performance in time series forecasting. However, due to the partially-observed nature of real-world applications, solely focusing on the target of interest, so-called endogenous variables, is usually insufficient to guarantee accurate forecasting. Notably, a system is often recorded into multiple variables, where the exogenous variables can provide valuable external information for endogenous variables. Thus, unlike well-established multivariate or univariate forecasting paradigms that either treat all the variables equally or ignore exogenous information, this paper focuses on a more practical setting: time series forecasting with exogenous variables. We propose a novel approach, TimeXer, to ingest external information to enhance the forecasting of endogenous variables. With deftly designed embedding layers, TimeXer empowers the canonical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Complex Systems and Time Series Analysis
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Dropout · Multi-Head Attention · Softmax · Dense Connections · Label Smoothing · Adam
