MoDEx: Mixture of Depth-specific Experts for Multivariate Long-term Time Series Forecasting

Hyekyung Yoon; Minhyuk Lee; Imseung Park; Myungjoo Kang

arXiv:2602.00624·cs.LG·February 3, 2026

MoDEx: Mixture of Depth-specific Experts for Multivariate Long-term Time Series Forecasting

Hyekyung Yoon, Minhyuk Lee, Imseung Park, Myungjoo Kang

PDF

Open Access

TL;DR

MoDEx introduces a novel mixture of depth-specific experts for multivariate long-term time series forecasting, leveraging layer sensitivity to improve accuracy and efficiency across various benchmarks.

Contribution

The paper proposes MoDEx, a lightweight mixture of depth-specific MLP experts, inspired by layer sensitivity analysis, to enhance long-term time series forecasting performance.

Findings

01

Achieves state-of-the-art accuracy on seven benchmarks.

02

Ranks first in 78% of evaluated cases.

03

Uses fewer parameters and computational resources.

Abstract

Multivariate long-term time series forecasting (LTSF) supports critical applications such as traffic-flow management, solar-power scheduling, and electricity-transformer monitoring. The existing LTSF paradigms follow a three-stage pipeline of embedding, backbone refinement, and long-horizon prediction. However, the behaviors of individual backbone layers remain underexplored. We introduce layer sensitivity, a gradient-based metric inspired by GradCAM and effective receptive field theory, which quantifies both positive and negative contributions of each time point to a layer's latent features. Applying this metric to a three-layer MLP backbone reveals depth-specific specialization in modeling temporal dynamics in the input sequence. Motivated by these insights, we propose MoDEx, a lightweight Mixture of Depth-specific Experts, which replaces complex backbones with depth-specific MLP…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques · Time Series Analysis and Forecasting · Stock Market Forecasting Methods