Optimizing Day-Ahead Energy Trading with Proximal Policy Optimization and Blockchain
Navneet Verma, Ying Xie

TL;DR
This paper presents a novel framework combining reinforcement learning and blockchain to optimize day-ahead energy trading, enhancing grid stability, security, and transparency in decentralized renewable energy markets.
Contribution
It introduces a new system architecture integrating PPO-based RL with blockchain for secure, transparent energy trading and demonstrates its effectiveness with real-world data.
Findings
Demand-supply balancing within 2% accuracy
Near-optimal supply costs maintained
Robust battery storage policies developed
Abstract
The increasing penetration of renewable energy sources in day-ahead energy markets introduces challenges in balancing supply and demand, ensuring grid resilience, and maintaining trust in decentralized trading systems. This paper proposes a novel framework that integrates the Proximal Policy Optimization (PPO) algorithm, a state-of-the-art reinforcement learning method, with blockchain technology to optimize automated trading strategies for prosumers in day-ahead energy markets. We introduce a comprehensive framework that employs RL agent for multi-objective energy optimization and blockchain for tamper-proof data and transaction management. Simulations using real-world data from the Electricity Reliability Council of Texas (ERCOT) demonstrate the effectiveness of our approach. The RL agent achieves demand-supply balancing within 2\% and maintains near-optimal supply costs for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Electric Power System Optimization · Integrated Energy Systems Optimization
