Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines
Aobo Liang, Yan Sun, Xiaohou Shi, Ke Li

TL;DR
Time Tracker introduces a mixture-of-experts transformer model with decoupled training pipelines, effectively capturing diverse temporal patterns and inter-series dependencies in multivariate time series forecasting, achieving state-of-the-art results.
Contribution
It proposes a novel mixture-of-experts transformer with Any-variate Attention and a graph learning module to better model complex multivariate time series data.
Findings
Achieves state-of-the-art forecasting accuracy
Improves model generalization across diverse time series
Effectively captures inter-series dependencies
Abstract
In the past few years, time series foundation models have achieved superior predicting accuracy. However, real-world time series often exhibit significant diversity in their temporal patterns across different time spans and domains, making it challenging for a single model architecture to fit all complex scenarios. In addition, time series data may have multiple variables exhibiting complex correlations between each other. Recent mainstream works have focused on modeling times series in a channel-independent manner in both pretraining and finetuning stages, overlooking the valuable inter-series dependencies. To this end, we propose Time Tracker for better predictions on multivariate time series data. Firstly, we leverage sparse mixture of experts (MoE) within Transformers to handle the modeling of diverse time series patterns, thereby alleviating the learning difficulties of a single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Visualization and Analytics · Forecasting Techniques and Applications
MethodsSoftmax · Attention Is All You Need
