DDT: A Dual-Masking Dual-Expert Transformer for Energy Time-Series Forecasting
Mingnan Zhu, Qixuan Zhang, Yixuan Cheng, Fangzhou Gu, Shiming Lin

TL;DR
The paper introduces DDT, a dual-masking dual-expert transformer that improves energy time-series forecasting by ensuring causal consistency and modeling complex dependencies, outperforming existing methods on multiple datasets.
Contribution
The paper presents a novel dual-masking mechanism combined with a dual-expert architecture, advancing the accuracy and robustness of energy time-series forecasting models.
Findings
Outperforms state-of-the-art baselines across all datasets and prediction horizons.
Introduces a dual-masking mechanism that ensures causal consistency and adaptively focuses on salient information.
Employs a dual-expert system to separately model temporal dynamics and variable correlations.
Abstract
Accurate energy time-series forecasting is crucial for ensuring grid stability and promoting the integration of renewable energy, yet it faces significant challenges from complex temporal dependencies and the heterogeneity of multi-source data. To address these issues, we propose DDT, a novel and robust deep learning framework for high-precision time-series forecasting. At its core, DDT introduces two key innovations. First, we design a dual-masking mechanism that synergistically combines a strict causal mask with a data-driven dynamic mask. This novel design ensures theoretical causal consistency while adaptively focusing on the most salient historical information, overcoming the rigidity of traditional masking techniques. Second, our architecture features a dual-expert system that decouples the modeling of temporal dynamics and cross-variable correlations into parallel, specialized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Solar Radiation and Photovoltaics · Forecasting Techniques and Applications
