Short-Term Electricity Load Forecasting Using the Temporal Fusion Transformer: Effect of Grid Hierarchies and Data Sources
Elena Giacomazzi, Felix Haag, Konstantin Hopf

TL;DR
This study evaluates the Temporal Fusion Transformer (TFT) for short-term electricity load forecasting across different grid levels and horizons, finding notable improvements at substation and week-ahead levels compared to LSTM models.
Contribution
It demonstrates the application of TFT to multi-level, multi-horizon load forecasting and compares its performance to LSTM models, highlighting its strengths and limitations.
Findings
TFT performs similarly to LSTM for day-ahead grid forecasting.
TFT significantly improves accuracy at substation level with aggregation.
TFT outperforms LSTM in week-ahead load forecasting.
Abstract
Recent developments related to the energy transition pose particular challenges for distribution grids. Hence, precise load forecasts become more and more important for effective grid management. Novel modeling approaches such as the Transformer architecture, in particular the Temporal Fusion Transformer (TFT), have emerged as promising methods for time series forecasting. To date, just a handful of studies apply TFTs to electricity load forecasting problems, mostly considering only single datasets and a few covariates. Therefore, we examine the potential of the TFT architecture for hourly short-term load forecasting across different time horizons (day-ahead and week-ahead) and network levels (grid and substation level). We find that the TFT architecture does not offer higher predictive performance than a state-of-the-art LSTM model for day-ahead forecasting on the entire grid. However,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Smart Grid and Power Systems · Grey System Theory Applications
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention · Adam · Residual Connection · Absolute Position Encodings · Softmax · Layer Normalization
