Signalling for Electricity Demand Response: When is Truth Telling Optimal?
Rene Aid, Anupama Kowli, Ankur A. Kulkarni

TL;DR
This paper models demand response programs using Bayesian persuasion to analyze when truthful information sharing by TSOs is optimal, revealing conditions under which pricing influences honesty.
Contribution
It introduces a Bayesian persuasion framework to analyze demand response, highlighting how electricity pricing affects the TSO's incentive to disclose truthful information.
Findings
Truth-telling is optimal when electricity is priced at marginal cost.
Overpricing leads to no information being disclosed.
Pricing strategies influence the honesty of demand response signals.
Abstract
Utilities and transmission system operators (TSO) around the world implement demand response programs for reducing electricity consumption by sending information on the state of balance between supply demand to end-use consumers. We construct a Bayesian persuasion model to analyse such demand response programs. Using a simple model consisting of two time steps for contract signing and invoking, we analyse the relation between the pricing of electricity and the incentives of the TSO to garble information about the true state of the generation. We show that if the electricity is priced at its marginal cost of production, the TSO has no incentive to lie and always tells the truth. On the other hand, we provide conditions where overpricing of electricity leads the TSO to provide no information to the consumer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Electric Power System Optimization · Energy Efficiency and Management
Signalling for Electricity Demand Response:
When is Truth Telling Optimal?
René A1̈\mbox{\rmI}d
Anupama Kowli
Ankur A. Kulkarni
University Paris Dauphine — PSL Research University — LEDa UMR 8007-260, Paris, France, [email protected]
Electrical Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, 400076, Maharashtra, India. [email protected]
Systems and Control Engineering, Indian Institute of Technology Bombay, Powai, Mumbai, 400076, Maharashtra, India. [email protected]
Abstract
Utilities and transmission system operators (TSO) around the world implement demand response programs for reducing electricity consumption by sending information on the state of balance between supply demand to end-use consumers. We construct a Bayesian persuasion model to analyse such demand response programs. Using a simple model consisting of two time steps for contract signing and invoking, we analyse the relation between the pricing of electricity and the incentives of the TSO to garble information about the true state of the generation. We show that if the electricity is priced at its marginal cost of production, the TSO has no incentive to lie and always tells the truth. On the other hand, we provide conditions where overpricing of electricity leads the TSO to provide no information to the consumer.
keywords:
Bayesian persuasion, demand response, pricing, signalling
1 Introduction
The last decade has seen significant growth and deployment of renewable energy resources in the power sector [13]. Since renewable power generation is inherently uncertain, variable and hard to control, there is a need to explore sources of flexibility such as energy storage, flexible loads and controllable generators that can help mitigate the effects of renewable power. In particular, last decade has seen several demand response programs where load curtailments help balance out supply deficits. These curtailments can be mandated, or voluntary as a response to price/tariff signals, control signals or other means of signals. For instance, utilities may have direct load control contracts with end users or may impose peak pricing tariffs on them to induce load curtailments and/or shifting. Such demand response schemes are known to lower electricity prices and electricity production costs [9]. In this paper, we are interested in analyzing the outcomes of demand response programs where participation is voluntary and consumers react to signals received from the coordinator for DR.
The setting considered in this paper is relevant in situations where demand response is induced using alternative signals other than pricing. For instance, the MonEcoWatt program of the French TSO, RTE 111https://www.monecowatt.fr, only in French signals the state of the electric systems to induce a response. This is achieved by sending phone text messages when the system is stressed to voluntarily enrolled consumers who receive no economic compensation for their efforts, if any. The signal takes the form of a color: green when no need of action is required, yellow when the load is high, orange when reduction actions would be welcomed and red when they are mostly necessary to avoid load shedding. The French TSO has measured that orange alerts reduced the peak-load consumption by 3% in the region of Britain. There are similar other programs such as Shave The Peak222https://www.greenenergyconsumers.org/shavethepeak program in MA, USA and the Power Alert333https://www.eskom.co.za/eas/power-alert/ program in South Africa, where signals are sent out to enrolled users to indicate the grid stress. The signal can be binary – stress or no stress – or coloured where the color indicates the level of stress. There is not much academic literature on understanding the impacts of such signalling on the actions of the end users.
It is plausible that the impacts of such signalling would also depend on the pricing regime governing the consumer’s behaviour. Signalling and pricing could create conflicting incentives, or create overlapping effects. Again, there does not appear to be any literature covering this matter. We seek to fill this gap with a formal model of signalling in such settings.
1.1 Main contributions
We are concerned with a setting in which a TSO and a consumer are bound by a pre-agreed pricing contract that is in force for the duration being considered here. This contract is designed taking into account the statistics of the generation known at the time of signing the contract. However, owing to the large time duration in the contract, scenarios could arise wherein the contract is valid but not optimal any more given changed circumstances. In addition, in real time specific values of the uncertainty get realized due to natural fluctuations and the TSO may want to take advantage of this information. While the pricing contract cannot be modified on a scenario-wise basis, we explore the possibility for the TSO to influence the consumer’s action by sending signals to persuade the consumer to act accordingly. The design of such signals and their truthfulness is the subject of this paper.
We model a setting in which a TSO and a consumer, share, in addition to a binding pricing contract, a common prior distribution on the level of future generation. However, the realized level of generation is known only to the TSO. The TSO must decide how it may use this fresh information to influence the consumer. Decision-theoretically speaking, the consumer is the only player that can act in our setting: the amount of consumption chosen by the consumer determines the payoff of both players. Thus the TSO’s decisions are limited to only influencing this choice of the consumer by reshaping the latter’s information by sending suitable signals.
Formally we model our setting as that of Bayesian persuasion with the TSO as the sender and the consumer as the receiver of information about a state. In such a setting, the sender sends signals444We assume a setting of costless signalling, popularly known as cheap talk. and the receiver updates its belief about the unknown state using Bayes rule. The problem for the TSO is to design signals such that the TSO obtains the maximum payoff under the optimal action chosen by the consumer under the posterior distribution.
Our quest is to illuminate the role of truthtelling on the part of the TSO. More specifically, when do the signals sent by the TSO constitute truth, and nothing but the truth? Our main finding is that there is a close link between the nature of the contract and truthtelling – if the contract is such that the price equals the marginal cost of generation, truthtelling is optimal. In other words, if the contract is “priced to perfection” in that it has accurately estimated the probability distribution of the future generation, there are no additional benefits to be drawn from reshaping the truth. Similar results hold even in the case where there are multiple consumers.
Fascinatingly “mispriced” contracts may also induce truthtelling – this happens to be the case when the mispricing is in a direction where telling the truth is beneficial to the TSO. When the mispricing is in the opposite direction, garbling the truth is optimal.
1.2 Related works
There is a large academic literature on demand response in electricity from the lens of mechanism design [17, 20, 19, 22, 5, 10]. These works focus on pricing or other mechanism for influencing behaviour, or for eliciting information from consumers regarding their types (e.g., [19] focuses on reporting baselines). Our approach is in a sense dual to this – in our case the TSO provides information (possibly garbled) to the consumer.
Consequently, in our analysis, we ignore two features – adverse selection and moral hazard – of incentives theory. Adverse selection is relevant to settings where there are multiple types of consumers and the TSO is ignorant of the type of the consumer but knows the distribution of types in the population. In such cases, the pricing rule may depend on the optimal reduction effort as shown in [10] and [6]. We circumvent the adverse selection problem by restricting our analysis to only one type of consumer. The moral hazard problem pertains to the settings where the TSO cannot distinguish when the load curtailment is the result of the action of the consumer and when it is purely the outcome of a random event. In such a situation, the pricing rule has to take into account this feature (see [2]). We simplify our analysis by restricting our setting to cases where the consumption is a deterministic function of the consumer’s action.
There at least four key differences between the instruments of long-term pricing and real-time signalling – the timescales at which these are implemented, the knowledge they assume of the uncertainty, the costs involved in implementing them, and the compliance they induce. Incentives such as pricing rules and payment terms, once decided are held fixed for long periods of time. As such they are usually updated on a time frame of a few months or even years. Signals, on the other hand, can be sent to the consumer on a much more frequent basis. Indeed, the TSO may opt to send a fresh signal whenever it has new information, such as updated weather forecasts, any maintenance related down times and so on. The pricing contract is designed before the realization of uncertainty whereas signals are adapted to realized uncertainty. In addition, there is also a difference in the costs involved. Contracts entail legal, regulatory and marketing costs, whereas in our model, signals can be sent virtually free of cost via notifications on mobile phones or SMSes as the MonEcoWatt program. Finally, contracts are of a legally binding nature and therefore ensure compliance. On the other hand signals can be ignored by the consumer.
In a sender-receiver interaction [21], the setting where the sender shapes the probability distribution of the receiver by sending suitable signals is called Bayesian persuasion. In this setting, the sender and receiver have a prior distribution on a payoff-relevant state, whose realized value is known only to the sender. The sender sends signals, upon receiving which, the receiver computes a posterior distribution on the state using Bayes rule. The receiver then takes an action that maximizes the expected utility computed with this posterior distribution. In the simplest signalling setting, the sender has no action of its own, and the receiver’s action determines the payoff both players. The goal of the sender is to design a signalling policy anticipating this response of the sender.
The persuasion setting falls broadly in the area of information design [14]. The use of randomization by the sender leads to the receiver getting ‘mixed signals’; the result being that the true type of the sender gets obfuscated, and a benefit accrues to the sender. This deliberate, strategic ambiguity is a hallmark of such settings, see [15], [11] and [12]. Having said that, there are also models where randomization is not employed, and yet qualitatively similar phenomena occur (see e.g., [8]). Qualitatively similar behaviour has also been seen in the screening setting [23] where the receiver commits first, (see e.g., [27] and [28]), and where players move simultaneously [7].
Considering that the signalling generally results in the sender obfuscating the truth, it instructive to ask when is truth telling optimal for the sender? Distinct classes of equilibria called separating, pooling and semi-separating equilibria, have been studied [25] in this context. In a separating equilibrium, the receiver is able to discern the exact type from the sender’s signal, whereas in a pooling equilibrium, all sender types employ the same signal. An equilibrium that is neither separating nor pooling is called semi-separating. Recent work [8] characterizes these equilibria in a non-Bayesian setting through a quantity called the informativeness of the sender, in a specific setting where the receiver is interested in knowing the sender’s true type. Our contribution in this paper lies in relating truth-telling to the pricing of a pre-existing contract, and, in particular, to estimation of future uncertainties. To the best of our knowledge this connection has not been established before, more so in the context of demand response.
2 Model
2.1 Model: single consumer
We consider an electricity market where the TSO acts as single buyer of all electricity generation from conventional producers and renewable energy producers. We assume the generation is served by enough producers so that the market can be considered competitive. We assume that the TSO has a perfect forecast of the renewable generation denoted , but this level is not known to the consumers. Indeed, even though forecasting renewable energy generation at a daily-time scale is yet an imprecise exercise, the knowlegde of the consumers are significantly lower compared to TSO’s. In this section we consider that consumers can be aggregated in a single representative consumer whose current consumption is denoted .
The consumer can take an action to modify its consumption. To focus on the effects of the information known to the TSO, we assume that there is no noise in the action of the consumer whereby the consumption after action is given as . Thus, by observing the consumption, the TSO knows the action of the consumer and is not facing an observation problem as in the moral hazard demand-response model in [2]. The consumer’s decision has to be taken prior to the knowledge of the true level of renewable generation . The consumer and the TSO are bound by a pricing contract where the per unit price charged for consumption is a function of the consumption and the generation . We assume this function takes the following form
[TABLE]
whereby the unit price is a function difference between the consumption and the generation. Note that, though the generation is not known to the consumer, the above pricing rule is fixed and known. We define the utility of the consumer by
[TABLE]
where is the indirect utility of consuming electricity , is the cost of making undesired modification of consumption . We take the instanciation
[TABLE]
where
The TSO is assumed to know the utility function of the representative consumer.
We assume that both the TSO and the consumer share the same prior on denoted by , and under this prior has bounded support and finite variance. This prior can be thought as the known statistical distribution of wind during the year. The TSO observes the generation and sends a message to the consumer.
The signalling policy of the TSO specifies a probability distribution on the messages for each value of . The policy is denoted , where is the message sent and the observed renewable energy generation. We assume that the signal and the renewable generation share the same bounded support. Telling the truth, all the truth and nothing but the truth consists in setting iff .
Remark 2.1. Cheap talk: We assume that signals can be sent free of cost, commonly known as the setting of cheap talk. In other words, the cost of acquiring information, computing the signal and that of transmitting it is negligible. Physically, our assumption can be justified by observing that for the TSO all these costs are invariant with the actual signal being sent. Weather probes would typical incur a fixed cost and a constant operating cost irrespective of the weather itself. Computation and transmission costs have fallen dramatically over the last two decades and can be assumed to be negligible.
Remark 2.2. Committment: We assume that the TSO commits to a signalling policy. Indeed, if the distribution of the renewable energy is stationary and if the pricing rule does not change, the TSO has no incentive to deviate from an optimal signalling policy.
Remark 2.3. Pricing rule: We assumed that the pricing rule was defined prior to the design of information provision. One could object that the regulator might also act as player in the game between the TSO and the consumers which would make the pricing rule endogenous, provided a suitable objective function of the regulator was given. Yet, this setting might not completely reflect some electricity markets design where retailers might themselves develop demand-response contracts. Thus, the interaction between the pricing of electricity contracts to consumers and the information provision by TSOs is a problem beyond this model. For that reason, we postpone to further research work the analysis of these situations.
For a given signalling policy and for a given signal , the consumer’s problem is to solve:
[TABLE]
Assuming this problem admits a unique solution, we denote the best response of the consumer by .
Knowing the best-responses of the consumer to a given signalling policy , the TSO chooses the signalling policy to solve,
[TABLE]
where the expectation is taken under for the realisation of the pair and is the cost that the TSO has to bear due to the mismatch between the renewable generation and the consumption. This may be interpreted as the cost of procuring conventional generation in the event of a shortfall in renewable generation. Note that can be interpreted as the activation by the TSO of contractual demand-response. We assume a linear quadratic production cost function, i.e.,
[TABLE]
so that the marginal cost curve is affine in production. Although in real life, the marginal cost of electricity production exhibits a steeper curvature, the assumption we make is common in the economic literature of electricity market regulation as it allows closed-form expressions for comparative static studies (see examples in [1] and [6]). Besides, linear-quadratic preferences has also been investigated in the Bayesian persuasion economic literature as an important case (see [26]).
The pricing rule is defined as the marginal cost of production of electricity, namely
[TABLE]
The significance and implications of this assumption will be discussed in the following section. We will later consider deviations from this pricing rule. But note that taking the pricing rule as the marginal cost of production induces an effort reduction of the consumer that precisely corresponds to social welfare maximisation in the case of full information on the renewable generation as shown in Boiteux’s marginal cost pricing of electricity theory ([3, 4]).
We now reflect on how the consumer approaches this problem. To solve his problem (2), the consumer computes the posterior probability that the true generation is , upon receiving the message :
[TABLE]
assuming that the denominator above is nonzero. Thus the expectation in (2) is taken with respect to the posterior distribution above defined by
3 Results
3.1 When is truth telling is optimal?
We now present the first main contribution of this paper. Recall that we have assumed that the TSO and the consumer have agreed on a pricing contract.
Recall also that we had assumed that the unit price was equal to the marginal cost of generation. This assumption implies that the cost of generation is known perfectly and this cost has been priced in in the pricing formula. However, the pricing formula states that the price is a function of the difference between the generation and the consumption Since is not known to the consumer, a natural question is whether the TSO should convey some information to about to the consumer. And if so, what should be the information it should convey?
Our first main result shows that when the pricing is “perfect” in a certain sense, the optimal signal for the TSO is to disclose without any ambiguity. In other words, truth telling is optimal for the TSO.
We define the following constants
[TABLE]
Proposition 3.1
It holds that
The best-response function of the consumer for a given signalling policy is given by
[TABLE]
As a result, \mathbb{E}\big{[}\hat{a}^{\pi}\big{]}=a_{0}-\lambda\mathbb{E}\big{[}q\big{]} does not depend on . 2. 2.
It is optimal for the TSO to tell the truth.
**Proof. ** (i) For a given signal , consider the optimisation problem of the consumer given by (2). We have J(a,\tilde{q})=\mathbb{E}^{\mathbb{P}}\big{[}U(q,a)\big{]}. Taking derivative inside the expectation and writing first-order condition provides
[TABLE]
Thus the optimal response of the consumer is given by and which gives the result. Besides, by the law of iterated expectations, we get \mathbb{E}\big{[}\hat{a}^{\pi}\big{]}=a_{0}-\lambda\mathbb{E}\big{[}q\big{]}.
(ii) The criterion of the TSO can be written as
[TABLE]
with . The two first terms are independent of . Besides, because of (i), it holds that and thus, . Because , the only term that depends on in the criteria of the TSO is
[TABLE]
Thus, minimizing is equivalent to maximizing . Now, for any random variable correlated with , the following inequalities hold from Jensen’s inequality:
[TABLE]
When is independent of (i.e., no information is revealed), which is the minimum, whereas if (full information revelation), which the maximum. Thus, the TSO’s criteria is minimized when he tells the truth.
Remark 3.4. Increasing consumption: Even if what can be considered as the baseline consumption level is positive, it might happen that . When is small, the consumer is responsive to the information on the availability of cheap energy and takes these opportunities to increase his consumption. Although our model design is not intented to capture this phenomenon, it is a behaviour looked for by TSOs. The Low Carbon London research project is an example of a large scale demand-response research program where incentives to increase consumption were proposed to consumers [24]).
Remark 3.5. Trustful consumer: Assume the consumer does not behave as a Bayesian rational decision-maker but instead fully trust the sender’s signal similar to the setting considered in [16]. In that situation, the optimal response of the consumer is because he would strictly believe that the signal is the truth and makes his optimisation accordingly. Hence, his actions are independent of and . There are no reasons why would be equal to . Redoing the computation of the result (ii) in the Proof above for the criteria of the TSO shows that is a constant plus the term
[TABLE]
Thus, the TSO should maximize the distance of the signal from . Clearly, taking to be a constant equal to the end point the support of farthest from maximizes the TSO’s criterion.
Proposition 3.1 states that the marginal cost pricing rule ensures that the TSO has no incentive to garble the production forecast and that the consumer’s optimal response under optimal signalling is the social optimum. The most extreme case of garbling information would be if the TSO gives an uninformative signal. In other words, the posterior probability is equal to the prior. In that case the TSO’s payoff is minimum. Thus, sending any signal is better than sending no signal at all and sending a truthful signal is the optimal policy.
3.2 Deviation from marginal cost pricing
We investigate now if this property still applies if we deviate from the marginal production cost pricing rule. We assume now that the pricing function is given by
[TABLE]
Thus, the TSO charges a price either with a higher or lower slope compared to . In that situation, the only change in the best-response of the consumer is that is replaced by . Thus, we have
[TABLE]
The criteria of the TSO now becomes
[TABLE]
Besides, we have \mathbb{E}^{\pi\otimes\mathbb{P}}\big{[}q\hat{q}\big{]}=\mathbb{E}^{\pi\otimes\mathbb{P}}\big{[}(\hat{q})^{2}\big{]}. Thus, the TSO’s criteria is a constant term plus
[TABLE]
And, thus depending on the sign of , maximizing consists either on telling the truth or garbling information. Solving for the threshold above which the TSO has incentive to provide no information yields the following proposition.
Proposition 3.2
It holds that
The TSO has no incentive to deviate from telling the truth as long as
[TABLE] 2. 2.
When , it is optimal to provide no information.
**Remark 3.6. ** It is remarkable that the only factors that enter in the determination of the threshold are the proportional part of the pricing rule and the cost of reduction effort of the consumer. The function is decreasing and satisfies and . Hence, whatever the value of , there is a non-empty range of values for , namely where the TSO has an incentive to deceive the consumer. When reduction efforts are costless to the consumer, the threshold equals , meaning that the TSO starts to take advantage of the signal only when she is paid 50% more than the marginal cost. For instance, if the TSO always predicts high renewable energy generation, the consumer’s actions is where is anticipated to be high. Thus, the consumer anticipating low prices makes little reduction effort. The revenue term of the TSO criteria, namely , outweighs the production cost term .
3.3 Multiple consumers
We extend the former results in the case where the TSO faces a set of consumers. We assume she uses the same signal for all agents and send the same messages. The criterion of the TSO now becomes
[TABLE]
where the total renewable generation is assumed to have a bounded support. Again, the TSO criterion can be rewritten as
[TABLE]
with Thus, the TSO only has to care about the average response of the consumers to her signal . On their side, consumers act as Bayesian Nash players sharing all the same prior on as the TSO. For each consumer, we set
[TABLE]
Each consumer wishes to maximise
[TABLE]
For a given signalling policy , a Nash equilibrium of the consumers is vector of functions of the message such that for and for all
[TABLE]
We observe that the game played by consumers is an exact potential game (see [18]). Define the potential function
[TABLE]
It holds that
[TABLE]
Moreover, is strictly concave in for each . Hence the Nash equilibria of the game with payoffs are equivalent to maximizers of the potential functions . As is quadratic and negative definite, the equilibrium is unique and the Nash equilibrium is linear in . We have
[TABLE]
Since the TSO’s objective depends only on , we are in the same setting as in the single case agent with a TSO facing one agent with average response . Hence, the condition of truth-telling of Proposition 3.2 still applies. Yet, the threshold condition for optimality of truth telling changes. Telling the truth is optimal as long as
[TABLE]
Because , for sufficiently large, the constraint is violated and the TSO has incentive to provide no information. Uniqueness of the threshold is obtained by the monotonicity and continuity of the function . But, unless some hypothesis are made on the ’s, it is unlikely to extract a condition as simple as (11). Nevertheless, if we denote by with , the threshold for players with a profile of reduction effort costs , it holds that
[TABLE]
Relation (12) states that a population of consumers is less vulnerable to misinformation rather than a single representative consumer with the average characteristics of the population.
Acknowledgments
We warmly thank one of the referees for pointing properties (9) and the fact that the consumers’ game is a potential game. The research in this paper was supported by the grant 6301-2 from the Indo-French Centre for the Promotion of Advanced Research. Ankur’s work was also supported by the grant CRG/2019/002975 of the Science and Engineering Research Board, Department of Science and Technology, India. René’s work was supported by the Chair EDF-CA CIB of Finance and Sustainable Development, the Finance for Energy Markets Research Initiative and the ANR project EcoREES ANR-19-CE05-0042.
Appendix A Proof
We establish inequality (12). Note that
[TABLE]
which is the threshold in the case of one representative consumer in Proposition 3.2 with cost of reduction . Denote
[TABLE]
Both and are monotonic non-decreasing functions. We have
[TABLE]
We have and ; thus, the thresholds are such that . Besides, by Jensen’s inequality, it holds that
[TABLE]
and thus . Hence, for larger than , the difference is positive and thus, the zero of is larger than the zero of .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] René A 1̈\mbox{\rm I}d, Gilles Chemla, Arnaud Porchet and Nizar Touzi. Hedging and Vertical Integration in Electricity Markets. Management Science , 57(8):1438–1452, 2011.
- 2[2] René A 1̈\mbox{\rm I}d, Dylan Possama 1̈\mbox{\rm I}, and Nizar Touzi. Optimal electricity demand response contracting with responsiveness incentives. Mathematics of Operations Research , 47(3):2112–2137, 2022.
- 3[3] Marcel Boiteux. La Tarification Au Coût Marginal et Les Demandes Aléatoires. In Cahiers Du Séminaire d’Économétrie , 1:56–69. 1951.
- 4[4] Marcel Boiteux. Peak-Load Pricing. In The Journal of Business , 33(2):157–179. 1960.
- 5[5] Jun Cao, Bo Yang, Cailian Chen, and Xinping Guan. Optimal demand response using mechanism design in the smart grid. In Proceedings of the 31st Chinese control conference , pages 2520–2525. IEEE, 2012.
- 6[6] Claude Crampes and Thomas-Olivier Léautier. Demand response in adjustment markets for electricity. Journal of Regulatory Economics , 48(2):169–193, 2015.
- 7[7] Vincent P. Crawford and Joel Sobel. Strategic information transmission. Econometrica , pages 1431–1451, 1982.
- 8[8] Reema Deori and Ankur A. Kulkarni. Information revelation through signalling. to appear in Systems and Control Letters , 2022.
