Risk-Sensitive Learning and Pricing for Demand Response
Kia Khezeli, Eilyan Bitar

TL;DR
This paper develops a dynamic pricing strategy for demand response in electricity markets, balancing learning demand parameters and maximizing risk-sensitive payoff, with proven regret bounds and convergence guarantees.
Contribution
It introduces a novel pricing policy that adapts over time to unknown demand models, achieving near-optimal performance with regret bounds and convergence properties.
Findings
Expected payoff loss is at most O(√T log T) compared to an oracle.
Pricing sequence converges to the oracle optimal prices in mean square.
Policy effectively balances exploration and exploitation in demand response.
Abstract
We consider the setting in which an electric power utility seeks to curtail its peak electricity demand by offering a fixed group of customers a uniform price for reductions in consumption relative to their predetermined baselines. The underlying demand curve, which describes the aggregate reduction in consumption in response to the offered price, is assumed to be affine and subject to unobservable random shocks. Assuming that both the parameters of the demand curve and the distribution of the random shocks are initially unknown to the utility, we investigate the extent to which the utility might dynamically adjust its offered prices to maximize its cumulative risk-sensitive payoff over a finite number of days. In order to do so effectively, the utility must design its pricing policy to balance the tradeoff between the need to learn the unknown demand model (exploration) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
