NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs
Zhaoge Bi, Linghan Huang, Haolin Jin, Qingwen Zeng, Huaming Chen

TL;DR
This paper introduces NSW-EPNews, a comprehensive benchmark combining time-series data, weather, and news to evaluate the performance of traditional models and large language models in electricity price forecasting.
Contribution
It presents the first multimodal benchmark dataset for electricity price prediction that includes textual news, weather, and numerical data, enabling evaluation of LLMs in this domain.
Findings
Traditional models gain little from news features.
State-of-the-art LLMs show modest performance improvements.
LLMs often hallucinate or fabricate data in forecasts.
Abstract
Electricity price forecasting is a critical component of modern energy-management systems, yet existing approaches heavily rely on numerical histories and ignore contemporaneous textual signals. We introduce NSW-EPNews, the first benchmark that jointly evaluates time-series models and large language models (LLMs) on real-world electricity-price prediction. The dataset includes over 175,000 half-hourly spot prices from New South Wales, Australia (2015-2024), daily temperature readings, and curated market-news summaries from WattClarity. We frame the task as 48-step-ahead forecasting, using multimodal input, including lagged prices, vectorized news and weather features for classical models, and prompt-engineered structured contexts for LLMs. Our datasets yields 3.6k multimodal prompt-output pairs for LLM evaluation using specific templates. Through compresive benchmark design, we identify…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnergy Load and Power Forecasting
