Towards Optimal Pricing of Demand Response -- A Nonparametric Constrained Policy Optimization Approach
Jun Song, Chaoyue Zhao

TL;DR
This paper introduces a nonparametric constrained policy optimization method for demand response pricing, ensuring stability and improved optimality without restrictive policy assumptions, validated through experiments.
Contribution
It proposes a novel nonparametric RL approach for demand response pricing that guarantees stability and optimality, overcoming limitations of existing parametric methods.
Findings
Outperforms state-of-the-art RL algorithms in demand response cases
Ensures stable and optimal pricing policies
Provides a closed-form optimal policy update expression
Abstract
Demand response (DR) has been demonstrated to be an effective method for reducing peak load and mitigating uncertainties on both the supply and demand sides of the electricity market. One critical question for DR research is how to appropriately adjust electricity prices in order to shift electrical load from peak to off-peak hours. In recent years, reinforcement learning (RL) has been used to address the price-based DR problem because it is a model-free technique that does not necessitate the identification of models for end-use customers. However, the majority of RL methods cannot guarantee the stability and optimality of the learned pricing policy, which is undesirable in safety-critical power systems and may result in high customer bills. In this paper, we propose an innovative nonparametric constrained policy optimization approach that improves optimality while ensuring stability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Energy Efficiency and Management · Electric Power System Optimization
