Benchmarks and Custom Package for Energy Forecasting
Zhixian Wang, Qingsong Wen, Chaoli Zhang, Liang Sun, Leandro Von, Krannichfeldt, Shirui Pan, and Yi Wang

TL;DR
This paper introduces new energy forecasting benchmarks, datasets, and a customizable framework that considers external factors and task-specific costs, advancing the evaluation and development of energy prediction models.
Contribution
It provides large-scale energy datasets, a flexible forecasting framework, and extensive benchmarking of 21 methods across multiple metrics, addressing unique challenges in energy forecasting.
Findings
New renewable energy dataset with meteorological data
Framework links forecasting errors to grid dispatch costs
Benchmark results for 21 forecasting methods
Abstract
Energy (load, wind, photovoltaic) forecasting is significant in the power industry as it can provide a reference for subsequent tasks such as power grid dispatch, thus bringing huge economic benefits. However, there are many differences between energy forecasting and traditional time series forecasting. On the one hand, traditional time series mainly focus on capturing characteristics like trends and cycles. In contrast, the energy series is largely influenced by many external factors, such as meteorological and calendar variables. On the other hand, energy forecasting aims to minimize the cost of subsequent tasks such as power grid dispatch, rather than simply pursuing prediction accuracy. In addition, the scale of energy data can also significantly impact the predicted results. In this paper, we collected large-scale load datasets and released a new renewable energy dataset that…
Peer Reviews
Decision·ICLR 2025 Conference Withdrawn Submission
An important and impactful application - framework can handle feature engineering, missing data, common pre-processing pipeline - can be customized to integrate more models and loss functions. designed to handle custom loss functions. - multiple evaluation metrics, loss, public datasets have been integrated.
The work is a very timely application for energy forecasting. The reviewer has some concern regarding the results and reproducability on new datasets and integration of new models. Comments: 1. Are the model features been handled as extrinsic and intrinsic features?, e.g. Weekday vs weekend, #occupancies, static features 2. If the framework code can follow OOP format to aggregate all models, would be more convenient. I was looking for the base model in the repository, could not find it. Not su
In this work, the authors develop an open-source forecasting package for energy forecasting. They also provide a dataset that includes different types of renewable energy generation and key meteorological factors.
While the authors claim that energy forecasting is different from general time series forecasting, they do not provide a detailed explanation of these differences. The feature engineering techniques are heavily centered on temperature and calendar variables. Other external factors that might affect energy forecasting, such as economic indicators or unexpected events (e.g., pandemics), are not extensively explored. The effectiveness of the proposed temperature-calendar feature engineering and
- Provision of a Renewable Energy dataset. - Important collection of Energy datasets. - Combination of various techniques, models, and metrics.
- Clarity could be improved to better understand the different parts/experiments of the proposal. - Paper flow could be enhanced. - Lack of details for some parts, even with the appendix. - Needs thorough proofreading. - Missing important state-of-the-art (SOTA) Time Series Forecasting baselines.
- The paper's package allows for customized forecasting pipelines - The paper collects many datasets and benchmarked on 21 models.
My major concern for this paper is as follows. - Scale of Datasets: The datasets included are relatively small and may not fully capture the scale or complexity of modern power systems. For instance, the load and renewable datasets lack the granularity typically needed at operational levels, such as bus-level load forecasting or wind farm-level generation forecasts. A large-scale dataset, such as the ARPA-E PERFORM dataset provided by NREL, which includes hundreds of time series, would serve a
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting · Time Series Analysis and Forecasting · Neural Networks and Applications
