A Minimal Incentive-based Demand Response Program With Self Reported   Baseline Mechanism

Deepan Muthirayan; Enrique Baeyens; Pratyush Chakraborty; Kameshwar; Poolla; Pramod P. Khargonekar

arXiv:1901.02923·eess.SY·December 20, 2024

A Minimal Incentive-based Demand Response Program With Self Reported Baseline Mechanism

Deepan Muthirayan, Enrique Baeyens, Pratyush Chakraborty, Kameshwar, Poolla, Pramod P. Khargonekar

PDF

TL;DR

This paper introduces a simple, incentive-compatible demand response program where consumers self-report baselines, using random selection and penalties to control baseline inflation and reduce costs effectively.

Contribution

It proposes a novel incentive-based DR mechanism with self-reported baselines, controlling inflation and costs through randomization and penalties, outperforming existing methods.

Findings

01

Self-reported baseline mechanism outperforms traditional methods.

02

Random selection and penalties effectively control baseline inflation.

03

Mechanism reduces system operator costs while ensuring load reduction.

Abstract

In this paper, we propose a novel incentive based Demand Response (DR) program with a self reported baseline mechanism. The System Operator (SO) managing the DR program recruits consumers or aggregators of DR resources. The recruited consumers are required to only report their baseline, which is the minimal information necessary for any DR program. During a DR event, a set of consumers, from this pool of recruited consumers, are randomly selected. The consumers are selected such that the required load reduction is delivered. The selected consumers, who reduce their load, are rewarded for their services and other recruited consumers, who deviate from their reported baseline, are penalized. The randomization in selection and penalty ensure that the baseline inflation is controlled. We also justify that the selection probability can be simultaneously used to control SO's cost. This allows…

Tables2

Table 1. TABLE I: Notation

$q$	Energy consumption of consumer
$θ$	Exogenous random variable
$u$	Utility of consumer expressed in monetary units
$π_{0}$	Retail price of energy
$π_{2}$	Reward/kWh awarded to consumer $k$
$f$	Baseline report of consumer
$π^{*}$	Threshold Market Clearing Price (TMC)
$p$	Probability of consumer being signaled
$R$	Reward function for load reduction
$Φ$	Penalty function for deviation from baseline
$Π$	Inverse supply function
$Q_{0}$	Peak load

Table 2. TABLE II: Baseline Inflation, δ f ∗ 𝛿 superscript 𝑓 \delta f^{*}

$d$	0.1	0.2	0.3	0.4
$δ f^{*}$ (theory)	$0.0011$	$0.0017$	$0.0022$	$0.0028$
$δ f^{*}$ (simul.)	$0.0012$	$0.0017$	$0.0023$	$0.0028$

Equations190

μ (q, θ) = \frac{\partial u ( q , θ )}{\partial q} .

μ (q, θ) = \frac{\partial u ( q , θ )}{\partial q} .

\forall q : μ (q, θ) > 0, \frac{\partial μ ( q , θ )}{\partial q} < 0.

\forall q : μ (q, θ) > 0, \frac{\partial μ ( q , θ )}{\partial q} < 0.

R(\pi_{2},f,q)=\left\{\begin{array}[]{ll}\pi_{2}(f-q),&\text{if consumer is called,}\\ 0,&\text{otherwise.}\end{array}\right.

R(\pi_{2},f,q)=\left\{\begin{array}[]{ll}\pi_{2}(f-q),&\text{if consumer is called,}\\ 0,&\text{otherwise.}\end{array}\right.

\Phi(f,q)=\left\{\begin{array}[]{ll}0,&\text{if consumer is called,}\\ \phi(f-q),&\text{otherwise.}\end{array}\right.

\Phi(f,q)=\left\{\begin{array}[]{ll}0,&\text{if consumer is called,}\\ \phi(f-q),&\text{otherwise.}\end{array}\right.

ϕ (0) = ϕ^{'} (0) = 0, \forall x : ϕ (x) = ϕ (- x), ϕ^{''} (x) > 0,

ϕ (0) = ϕ^{'} (0) = 0, \forall x : ϕ (x) = ϕ (- x), ϕ^{''} (x) > 0,

H (f) = \mathds E_{θ} [q min {π_{0} q - u (q, θ) + Φ (f, q) - R (π_{2}, f, q)}] .

H (f) = \mathds E_{θ} [q min {π_{0} q - u (q, θ) + Φ (f, q) - R (π_{2}, f, q)}] .

CP: f min H (f) .

CP: f min H (f) .

J^{a} (q, θ) = π_{0} q - u (q, θ),

J^{a} (q, θ) = π_{0} q - u (q, θ),

q^{a} (θ) = ar g q min J^{a} (q, θ),

q^{a} (θ) = ar g q min J^{a} (q, θ),

π_{0} - \frac{\partial u ( q , θ )}{\partial q} = 0.

π_{0} - \frac{\partial u ( q , θ )}{\partial q} = 0.

q^{a} (θ) = μ^{- 1} (π_{0}, θ),

q^{a} (θ) = μ^{- 1} (π_{0}, θ),

J^{b} (f, q, θ) = π_{0} q - u (q, θ) + ϕ (f - q) .

J^{b} (f, q, θ) = π_{0} q - u (q, θ) + ϕ (f - q) .

q^{b} (f, θ) = ar g q min J^{b} (f, q, θ),

q^{b} (f, θ) = ar g q min J^{b} (f, q, θ),

π_{0} - \frac{\partial u ( q , θ )}{\partial q} - ϕ^{'} (f - q) = 0.

π_{0} - \frac{\partial u ( q , θ )}{\partial q} - ϕ^{'} (f - q) = 0.

q^{b} (f, θ) = μ^{- 1} (π_{0} - ϕ^{'} (f - q^{b} (f, θ)), θ),

q^{b} (f, θ) = μ^{- 1} (π_{0} - ϕ^{'} (f - q^{b} (f, θ)), θ),

J^{c} (q, f, θ) = π_{0} q - u (q, θ) - π_{2} (f - q) .

J^{c} (q, f, θ) = π_{0} q - u (q, θ) - π_{2} (f - q) .

q^{c} (f, θ) = ar g q min J^{c} (f, q, θ) .

q^{c} (f, θ) = ar g q min J^{c} (f, q, θ) .

π_{0} - \frac{\partial u ( q , θ )}{\partial q} + π_{2} = 0.

π_{0} - \frac{\partial u ( q , θ )}{\partial q} + π_{2} = 0.

q^{c} (θ) = μ^{- 1} (π_{0} + π_{2}, θ) .

q^{c} (θ) = μ^{- 1} (π_{0} + π_{2}, θ) .

H (f) = p \mathds E_{θ} J^{c} (f, q^{c}, θ) + (1 - p) \mathds E_{θ} J^{b} (f, q^{b}, θ),

H (f) = p \mathds E_{θ} J^{c} (f, q^{c}, θ) + (1 - p) \mathds E_{θ} J^{b} (f, q^{b}, θ),

M (f) = p \mathds E_{θ} \frac{\partial u ( q ^{c} , θ )}{\partial q} + (1 - p) \mathds E_{θ} \frac{\partial u ( q ^{b} , θ )}{\partial q} .

M (f) = p \mathds E_{θ} \frac{\partial u ( q ^{c} , θ )}{\partial q} + (1 - p) \mathds E_{θ} \frac{\partial u ( q ^{b} , θ )}{\partial q} .

\frac{π ^{*} ( f - \mathds E _{θ} q ^{c} ( θ ))}{\mathds E _{θ} q ^{a} ( θ ) - \mathds E _{θ} q ^{c} ( θ )} = π^{*} + π^{*} \frac{\mathds E _{θ} δ f ( θ )}{\mathds E _{θ} q ^{a} ( θ ) - \mathds E _{θ} q ^{c} ( θ )},

\frac{π ^{*} ( f - \mathds E _{θ} q ^{c} ( θ ))}{\mathds E _{θ} q ^{a} ( θ ) - \mathds E _{θ} q ^{c} ( θ )} = π^{*} + π^{*} \frac{\mathds E _{θ} δ f ( θ )}{\mathds E _{θ} q ^{a} ( θ ) - \mathds E _{θ} q ^{c} ( θ )},

\frac{d H ( f )}{df} = p \mathds E_{θ} \frac{d J ^{c} ( f , q ^{c} , θ )}{df} = - p π_{2} .

\frac{d H ( f )}{df} = p \mathds E_{θ} \frac{d J ^{c} ( f , q ^{c} , θ )}{df} = - p π_{2} .

p \to 0 lim E_{θ} δ \tilde{f}^{*} (θ) = 0.

p \to 0 lim E_{θ} δ \tilde{f}^{*} (θ) = 0.

p \to 0 lim E_{θ} δ f^{*} (θ) = 0.

p \to 0 lim E_{θ} δ f^{*} (θ) = 0.

\mathds E_{θ} δ f^{*} (p) = f^{*} - \mathds E q^{a} (θ) = (d + λ) \frac{p π _{2}}{1 - p} .

\mathds E_{θ} δ f^{*} (p) = f^{*} - \mathds E q^{a} (θ) = (d + λ) \frac{p π _{2}}{1 - p} .

\mathds E_{θ} δ f^{*} = f^{*} - \mathds E_{θ} q^{a} (θ) \geq \frac{d p π _{2}}{1 - p} .

\mathds E_{θ} δ f^{*} = f^{*} - \mathds E_{θ} q^{a} (θ) \geq \frac{d p π _{2}}{1 - p} .

\phi(f-q)=\left\{\begin{array}[]{ll}\frac{(|f-q|-\epsilon)^{2}}{2\lambda},&\textrm{if}\ |f-q|\geq\epsilon,\\ 0,&\textrm{otherwise.}\end{array}\right.

\phi(f-q)=\left\{\begin{array}[]{ll}\frac{(|f-q|-\epsilon)^{2}}{2\lambda},&\textrm{if}\ |f-q|\geq\epsilon,\\ 0,&\textrm{otherwise.}\end{array}\right.

\forall (q, θ) : \frac{\partial u ^{2} ( q , θ )}{\partial q ^{2}} = - 1/ d .

\forall (q, θ) : \frac{\partial u ^{2} ( q , θ )}{\partial q ^{2}} = - 1/ d .

\mathds E_{θ} δ f^{*} (p) = f^{*} - \mathds E q^{a} (θ) \leq (d + λ) \frac{p π _{2}}{( 1 - p )} + ϵ,

\mathds E_{θ} δ f^{*} (p) = f^{*} - \mathds E q^{a} (θ) \leq (d + λ) \frac{p π _{2}}{( 1 - p )} + ϵ,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A Minimal Incentive-based Demand Response Program With Self Reported Baseline Mechanism

Deepan Muthirayana, Enrique Baeyensb, Pratyush Chakrabortyc,

Kameshwar Poollad and Pramod P. Khargonekara This research is supported by the National Science Foundation under grants EAGER-1549945, CPS-1646612, CNS-1723856 and by the National Research Foundation of Singapore under a grant to the Berkeley Alliance for Research in Singaporea Department of Electrical Engineering and Computer Science, University of California, Irvine, CA, USAb Instituto de las Tecnologías Avanzadas de la Producción, Universidad de Valladolid, Valladolid, Spain c Department of Physics and Astronomy, Northwestern University, IL, USA d Department of Electrical Engineering and Computer Science, University of California, Berkeley, CA, USA

Abstract

In this paper, we propose a novel incentive based Demand Response (DR) program with a self reported baseline mechanism. The System Operator (SO) managing the DR program recruits consumers or aggregators of DR resources. The recruited consumers are required to only report their baseline, which is the minimal information necessary for any DR program. During a DR event, a set of consumers, from this pool of recruited consumers, are randomly selected. The consumers are selected such that the required load reduction is delivered. The selected consumers, who reduce their load, are rewarded for their services and other recruited consumers, who deviate from their reported baseline, are penalized. The randomization in selection and penalty ensure that the baseline inflation is controlled. We also justify that the selection probability can be simultaneously used to control SO’s cost. This allows the SO to design the mechanism such that its cost is almost optimal when there are no recruitment costs or atleast significantly reduced otherwise. Finally, we also show that the proposed method of self-reported baseline outperforms other baseline estimation methods commonly used in practice.

Index Terms:

Demand Response, Baseline Estimation, Baseline Inflation.

I Introduction

Demand Response (DR) programs [1] are potentially powerful tools to modulate the demand for electricity in a wide variety of situations. For example, at certain times such as mid-afternoons on hot summer days, the supply of additional electric power is scarce and expensive. At these times, it is more cost-effective to reduce demand than to increase supply to maintain power balance. Another scenario is a grid with high renewable penetration. Here, DR promises to be a better alternative compared to other expensive and polluting reserves to balance the variability in renewable generation. Realizing its potential, the 2005 Energy Policy Act provided the Congressional mandate to promote DR in organized wholesale electricity markets. The FERC order 745 [2] met this mandate by prescribing that demand response resource owners should be allowed to offer their demand reduction as if it were a supply resource rather than a bid to reduce demand so that the market operates fairly.

Dynamic pricing based DR programs [3, 4] can ideally achieve market efficiency, but they require more complex metering and communication infrastructure to achieve this which raises their implementation costs [5, 6]. Furthermore, consumers may not be responsive to dynamic pricing [7]. Alternatively, consumers could be signaled to reduce consumption and paid for their load reductions. Such schemes are referred to as Incentive-based DR programs or Demand Reduction programs. There are three key components of any incentive-based DR program: (a) a baseline against which demand reduction is measured, (b) a payment scheme for agents who reduce their consumption from the baseline, and (c) various contractual clauses such as limits on the frequency of DR events or penalties for nonconforming agents.

Thus, incentive-based DR programs require an established baseline against which consumer’s load reduction is measured. The baseline is an estimate of the consumption when the consumer is not participating in the DR program. For example, the California Independent System Operator (CAISO) uses the average of the consumption on the ten most recent non-event days as the baseline estimate [8]. The CAISO method also uses a morning adjustment factor to account for any variability in consumption pattern during the day of the DR event from the past. Current methods to establish the baseline raise several concerns. One major concern is that the consumers have an incentive to artificially inflate their baseline to increase their profits [9, 10, 11, 12]. Cases have been reported where the participants artificially inflated their baseline for increasing payments [13]. Fairness can also be a concern. Consider, for instance, an agent who happens to be on vacation during a DR event and receives a payment for load reduction without suffering any hardship. This can be perceived as unfair by other agents who deliberately curtail their consumption and suffer some disutility.

I-A Our Contribution

We propose a setting where the System Operator (SO) recruits DR providers as an alternate resource to balance supply and demand during high price periods. The providers could be either individual consumers or aggregators of DR services. We also assume that the SO has access to market outcomes, which is a reasonable assumption. The objective of the SO is to minimize cost when energy purchase from the wholesale energy market is expensive. This usually happens during peak load scenarios, when the market price exceeds a threshold market clearing (TMC) price. The TMC price is the price above which it is profitable for the SO to call the recruited DR providers or consumers to provide load reduction.

The main aspects of the DR mechanism we propose are: (i) self-reported baseline (ii) randomized selection of consumers, and (iii) penalty for uninstructed deviations. In this mechanism, the consumers are required to self-report their baselines and are paid at a pre-determined reward for every unit of reduction they provide. A large group of consumers is recruited so that the necessary load reduction is delivered reliably. When a DR event occurs, consumers are selected randomly from this pool of recruited consumers to provide the required service. The load reduction is measured by the difference between the self-reported baseline and the measured consumption. The consumers signaled to reduce are paid in proportion to the measured reduction and the prescribed reward. The consumers who are not called are penalized for uninstructed deviation from the baseline. This penalty and randomized selection controls baseline inflation. The proposed DR program requires only baseline information from the individual consumers, which makes it minimal in terms of the information it elicits from the consumers.

In this paper, we characterize baseline inflation for a quadratic utility function with uncertain consumption and a quadratic penalty function with and without a deadband. The deadband in the penalty function is required to achieve individual rationality. Using this characterization, we show that the proposed mechanism controls baseline inflation. We also justify that by choosing an appropriate calling probability, which depends on the recruitment cost, the SO can signficantly reduce its costs. Finally, we show that the self-reported baseline establishes a better estimate of the mean baseline when compared to conventional methods such as the CAISO’s m/m method [8]. Since the excess payments made to the consumers are proportional to the baseline used in a DR program, this establishes that the self-reported baseline approach is more cost effective than the CAISO approach.

Two concerns can arise with the self-reported baseline idea. One is the fatigue in reporting a baseline and the other is the lack of knowledge of one’s own baseline. Notwithstanding, self-reported baseline is still a viable method. This is because, firstly, the proposed mechanism is for peak load scenarios which are rare events. Secondly, we expect a energy management system to manage the load consumption pattern of a consumer in the future. Given the consumer’s preferences, this energy management system should have the capability to estimate the baseline and report it to the operator or the load serving entity. In addition to all of the above aspects, the self-reported baseline DR mechanism can also avoid bias and inflation in its estimate of the baseline.

I-B Related Work

There exists substantial literature on baseline estimation methods [14, 15, 16, 17, 18, 19, 20, 18, 21, 22, 23]. These can be broadly classified into three classes: (a) averaging, (b) regression, and (c) control group methods.

Averaging methods determine baselines by averaging the consumption on past days that are similar (e.g., in weather conditions) to the event day. A detailed comparison of different averaging methods is offered in [14, 15, 17]. Averaging methods are simple but they suffer from estimation biases [17, 19, 18], and require a significant amount of data, especially for residential DR programs [18].

Regression methods estimate a load prediction model based on historical data which is then used to predict the baseline [21, 16]. They can potentially overcome biases incurred by averaging methods [24, 18]. But they often require considerable historical data for acceptable accuracy, and the models may not capture the complex behavior of individual consumers.

Control group methods have been suggested to have better accuracy than averaging or regression methods and do not require large amounts of historical data [22]. However these methods require the SO to recruit an additional set of consumers and also install additional metering infrastructure for these consumers. In addition, prior data based analysis, to identify the most appropriate control group, might be required depending on the control group method deployed. This raises their costs of implementation [22]. We also show later that the adverse incentives to inflate still persists in this method. Compared to all the above methods, the proposed self-reported baseline avoids all of these issues, i.e. (i) bias and inflation, (ii) need for historical data, and (iii) high implementation cost.

In order to avoid baseline estimation, in a previous work [25], we addressed the DR problem as a mechanism design problem. The setting considered in [23] has an aggregator and an Utility or SO. The Utility determines the required load reduction $D$ kWh that is to be delivered by the aggregator based on the system requirements. The aggregator recruits consumers to deliver the required reduction. The mechanism that we proposed for the aggregator requires the consumers to report both their marginal utility and their baseline consumption. The aggregator uses the marginal utility reports to select consumers such that its overall cost is minimized while the load reduction target $D$ is met. A drawback of this mechanism in terms of implementation is that the consumers need not have knowledge of their true marginal utilities.

The new approach proposed here also avoids baseline estimation by requiring the consumers to report their baseline consumption, but the individual marginal utility need not be disclosed. The mechanism is minimal in terms of the information it elicits from the consumers because it does not require either historical data or any additional infrastructure. The authors in [26] use a similar problem formulation to ours and propose an incentive based DR mechanism, but do not address how the reward price is set, how the consumers are selected so that the DR service is delivered reliably and the cost aspect of the mechanism. In addition they also ignore any randomness in the consumption of the consumers. Here, we consider all of the above aspects and the randomness in consumption. We also provide comparison with other baseline estimation methods.

While some parallels can be drawn with dynamic pricing based DR mechanisms [27, 28, 29], the setting we consider here is different. These mechanisms essentially influence consumers by using time varying prices to alter their energy consumption so that system objectives are met. On the contrary, the central problem we consider is to recruit DR resources that can deliver a certain amount of load reduction at certain times of a month which coincides with peak load conditions. This requires the estimation of consumer baseline because measuring load reduction requires a baseline. Hence baseline estimation becomes a primary concern in our setting whereas such a requirement does not arise in the dynamic pricing DR setting.

The remainder of this paper is organized as follows. In Section II, we introduce the consumer model and the incentive-based self-reported DR program. In Section III, we solve for the optimal consumer forecast and characterize baseline inflation for a quadratic utility function and a quadratic penalty with and without deadband. In section IV we discuss SO’s cost. In Section V, we compare self-reported baseline with other conventional baseline estimation methods. Finally, we conclude in Section VI.

II Problem Setup

In this section, we describe the market model, the consumer model and the incentive-based DR program. A summary of the notations is given in Table I.

II-A Market Model

The market model is represented by the wholesale market’s inverse supply function $\Pi(Q)$ which provides the energy price as a function of the net energy transacted in the wholesale market, see Fig. 1. The market inverse supply function is assumed to be convex with respect to $Q$ and monotone increasing, i.e. with positive derivative $\Pi^{\prime}(Q)>0$ . The threshold market clearing price $\pi^{*}$ (TMC price), as defined earlier, is the market price above which it is profitable for the SO to call the consumers. Given the inverse supply curve of the market, this price can be computed a priori. In scenarios where the inverse supply function is not available a priori, the SO can estimate it using data from past twelve months. This is typical of many system operators such as the CAISO which publishes threshold market clearing prices for the next target month using data from past twelve months. The assumption we make is that this estimate is reflective of the true TMC price.

Note that the calculation of the TMC price from the inverse supply function may not be straightforward for a network model. This would require a detailed analysis of how the congestion constraints influence the Locational Marginal Prices (LMPs) of the nodes and is model specific. The main results of this paper will still hold provided $\pi^{*}$ , i.e. the threshold market clearing price for a node in the network, is determined via the network model. It is beyond the scope of this paper to discuss in detail the determination of this price under such more complicated settings. The main goal of this paper is to illustrate the idea of self-reported baseline and its effectiveness.

II-B Consumer Model

Consider a residential consumer whose consumption is denoted by $q$ . Let $\theta$ be a random variable that is drawn from a continuous distribution. The utility of consumption of a consumer depends on this random variable. We assume that the distribution of $\theta$ includes every possible source of uncertainty. For example, $\theta$ could represent the consumer’s state where the consumer could either be at home or not. It could also model the randomness induced due to external weather conditions like temperature. Let the private utility function which is expressed in monetary units be $u(q,\theta)$ , which is assumed to be a strictly concave monotone increasing in $q$ . We also assume that the random variable $\theta$ is realized at the time when consumption is accomplished. Define the marginal utility $\mu(q,\theta)$ as follows:

[TABLE]

Note that since $u(q,\theta)$ is monotone increasing and strictly concave in $q$ , we have:

[TABLE]

II-C Incentive-Based Demand Response Program

The SO signals a DR event when the market price exceeds the TMC price. The novel DR program that we propose comprises a self-reported baseline mechanism. The mechanism has two stages which are as follows.

Stage 1 (Reporting)

In this stage, the consumer self reports its baseline $f$ and the SO announces the following quantities:

the probability $p$ of calling a consumer, 2. 2.

the reward function $R(\pi_{2},f,q)$ for reducing consumption $(f-q)$ , 3. 3.

reward per unit reduction $\pi_{2}$ which is equal to the TMC price $\pi^{*}$ , 4. 4.

the penalty function $\Phi(f,q)$ for consumers who deviate from their reported baseline when they are not called for DR service.

This penalty function $\Phi$ is critical to ensure that the consumers do not inflate their baseline report. At the same time, the penalty should not discourage participation by preventing lack of profitability for the participants. Based on the reward per unit reduction $\pi_{2}$ and the penalty $\Phi(f,q)$ , each consumer submits the baseline report $f$ .

Stage 2 (DR Event)

In the second stage, a DR event is triggered when the SO expects the market price to shoot above the TMC price. The SO then selects randomly from the pool of recruited consumers and the selected consumers are signaled to reduce consumption. The SO observes the aggregated consumption $Q$ of those selected consumers. By the mechanism, the consumers who are signaled and reduce consumption are paid $\pi_{2}$ per unit of reduction. However, those recruited consumers that are not signaled are penalized for deviating from their reported baseline as prescribed by the penalty function.

II-C1 Consumer Recruitment and Selection

The objective of SO is to minimize its cost during DR events. During a DR event, the load is at its peak $Q_{0}$ , and is desirable to achieve a load reduction of $\Delta Q^{*}$ , which is the optimal load reduction (Refer Fig. 1). The SO recruits $n$ sets of consumers. The recruitment is such that each set of consumers reduces load by $\Delta Q^{*}$ for the specified reward/kWh $\pi^{*}$ . The consumers are tested before they are recruited. Here, the assumption is that the aggregate load reduction can be more reliably established than the individual load reduction which requires a reliable baseline estimate. Since the probability of selection or calling of each individual consumer is restricted to probability $p$ , the number $n$ of such sets of consumers recruited satisfies $np=1$ . When a DR event occurs, one set is randomly chosen and its members are signaled to reduce consumption. This recruitment and selection process ensures that one set is always chosen. Hence, the required level of reduction $\Delta Q^{*}$ is delivered during all DR events while satisfying the calling probability of each recruited consumer.

It is inconceivable that each set of consumers will exactly deliver $\Delta Q^{*}$ amount of reduction at the prescribed reward. Hence, in the proposed mechanism, the SO is allowed to adjust the selected consumers within the DR event window. If the price remains higher than the TMC price within the DR event then the SO can call more consumers till the price falls to the desired level. Note that this does require the SO to recruit some set of consumers who can respond on short notice. Such type of consumers can be recruited under the flexible resource category.

Here, we provide a very simple example to illustrate how the consumers are grouped and selected. Consider the case where the consumers are identical and have a capacity to deliver $0.5$ kWh of reduction when paid at $\pi^{*}=\$ 0.05/\text{kWh} $. Let the probability of calling a consumer be$ p=0.1 $and the optimal load reduction$ \Delta Q^{}=10\ \text{kWh} $. Then the SO would recruit$ n=1/p=10 $groups each with a capacity to deliver$ 10\ \text{kWh} $of reduction when paid at$ \pi^{}=$0.05/\text{kWh} $. This implies that each of these groups would contain$ 20 $such consumers and each of these groups will be called or selected by the probability$ p=0.1$ when a DR event occurs.

Remark 1.

As stated earlier, for an incentive-based demand response program, determining the right baseline is very important as baseline can not be measured. In our mechanism, consumers self-report their baseline. No other information from consumers is needed other than baseline report. Hence, our mechanism is minimal in the information it elicits from the consumers.

II-C2 Reward and Penalty Function

The reward function in the mechanism is set as

[TABLE]

Thus, the SO pays the consumers according to the measured reduction $f-q$ , where $f$ is the consumer’s baseline report and $q$ is the measured consumption during the DR event. The reward per unit reduction is $\pi_{2}$ . The consumer’s penalty function is specified as follows,

[TABLE]

where the penalty function $\phi$ in (3) is chosen to be convex, symmetric, and nonnegative with minimum value zero at the origin, i.e. it satisfies the following conditions:

[TABLE]

where $\phi^{\prime}$ and $\phi^{\prime\prime}$ denote the first and second derivative of the penalty function $\phi$ .

II-D Consumer’s Optimization Problem

The minimum expected cost incurred by a consumer is a function of the baseline report $f$ and is given by

[TABLE]

The consumer’s problem is formulated as follows:

[TABLE]

Hence, the consumer’s problem is a two stage stochastic decision problem. In the first stage the consumer decides the optimal baseline report $f$ , and in the second stage decides the optimal consumption $q$ .

Definition 1.

Let $f^{*}$ be defined as the baseline report that minimizes the cost that is incurred by the consumer, i.e. $f^{*}=\arg\min~{}H(f)$ .

III Optimal baseline report and inflation

In this section, we derive an optimality condition for the consumer baseline report that minimizes the expected cost of the consumer. The optimality condition has a nice economic interpretation because it establishes that the baseline report that minimizes the expected cost is such that the marginal utility of the consumers equals the retail price of the electricity.

The consumer’s optimization problem CP given by (6) is a two-step stochastic decision problem. We characterize consumers’s consumption decisions corresponding to the second stage problem and then obtain the optimality condition for the consumer’s baseline report by solving the first stage problem.

III-A The Consumer’s Second Stage Problem

The consumer has several choices. It can decide to participate or not to participate in the DR program. If it decides to participate, then it can be signaled to reduce its consumption or not signaled. This gives rise to three possible scenarios for the second stage: a) consumer is not participating in the DR program, b) consumer is participating in the program but is not signaled to reduce consumption, c) consumer is participating in the program and is signaled to reduce consumption. We obtain the optimal consumption for each of the three cases assuming that the baseline report $f$ is given. The consumption when the consumer is not participating corresponds to the true baseline. Hence, we use this value as the baseline to characterize inflation in the DR program.

Consumer is not participating in the DR program

In this case, $R=0$ and $\Phi=0$ . Let $J^{a}(q,\theta)$ denote the realized cost function for this case. It is then given by

[TABLE]

where $\pi_{0}$ is the retail price of electricity. The optimal consumption is given by

[TABLE]

which is a function of $\theta$ because its value is realized when the consumption decision is made. Note that $q^{a}(\theta)$ is the solution of the first order optimality condition,

[TABLE]

Hence, $q^{a}(\theta)$ is given by

[TABLE]

where $\mu^{-1}$ denotes the inverse function of the marginal utility, see (1), that always exists for every $\theta$ . Moreover, since the consumer’s utility is monotone increasing and concave in $q$ , the consumption $q^{a}(\theta)$ is always nonnegative.

Consumer is participating in the program but is not signaled

to reduce consumption

The reward and penalty functions are given by (2) and (3). Let $J^{b}(f,q)$ denote the realized cost function which is given by

[TABLE]

As before, the value of $\theta$ is realized when the consumption decision is made. In this scenario the realized cost is also a function of the baseline report $f$ in addition to the consumption decision and the value of $\theta$ . The optimal consumption is given by

[TABLE]

and so it satisfies the first order optimality condition,

[TABLE]

Hence, the optimal consumption satisfies the following implicit equation,

[TABLE]

and $q^{b}(f,\theta)$ is also a function of $f$ because the deviation from $f$ incurs a penalty.

Consumer is participating in the program and is signaled to

reduce consumption

Again, the reward and penalty functions are given by equations (2) and (3), respectively. Let $J^{c}(q,\theta)$ denote the realized cost function which is given by

[TABLE]

The optimal consumption is given by

[TABLE]

So $q^{c}(f,\theta)$ is the solution of

[TABLE]

Hence, the optimal consumption $q^{c}$ is independent of $f$ and is given by

[TABLE]

The relation between the consumptions for the three different cases $q^{a}(\theta)$ , $q^{b}(\theta,f)$ , $q^{c}(\theta)$ and the consumer’s baseline report $f$ are stated in the following lemma.

Lemma 1.

The optimal consumptions for the three cases $q^{a}(\theta)$ , $q^{b}(\theta,f)$ , $q^{c}(\theta)$ satisfy the conditions (i) $q^{c}<q^{a}$ and (ii) $q^{a}<q^{b}<f$ or $f<q^{b}<q^{a}$ for every $\theta$ .

Proof.

Refer Appendix. ∎

As a result of Lemma 1, a rational consumer that is participating in the DR program and is signaled always provides a load reduction with respect to its true baseline consumption $q^{a}$ . However, according to this lemma, a consumer that is participating and not signaled for reduction may inflate its consumption near to its inflated baseline report to avoid the penalty and gain from the inflated baseline when called for reduction. This behaviour needs to be controlled.

III-B The Optimal Baseline Report

Let $p$ denote the probability that the consumer is signaled to reduce when a DR event occurs. The expected cost that is incurred by the consumer (5) can be expressed in terms of the probability $p$ as follows:

[TABLE]

and it follows that the optimal baseline report $f^{*}$ minimizes this $H(f)$ . In the following lemma, we show that the consumer’s expected cost $H(f)$ is a convex function of its argument $f$ .

Lemma 2.

The consumer’s expected cost $H(f)$ is a (strictly) convex function of its argument $f$ if and only if the penalty $\phi$ is (strictly) convex.

Proof.

Refer Appendix. ∎

Since the penalty function $\phi$ was chosen to be convex, the consumer’s expected cost $H(f)$ is also convex.

Definition 2 (Consumer’s Expected Marginal Utility).

The consumer’s expected marginal utility under the incentive-based self-reported DR program is given by

[TABLE]

The consumer’s expected marginal utility is a function of the baseline report $f$ , because the consumption $q$ is a function of $f$ . For example, if the consumer is participating in the DR program and is signaled, then its consumption is $q^{b}(f,\theta)$ which solves the implicit equation (12) and does depend on $f$ . The following theorem establishes the optimality condition for the optimal baseline report $f^{*}$ in terms of $M(f)$ ,

Theorem 1.

The optimal baseline report $f^{*}$ satisfies $\pi_{0}=M(f^{*})$ and is a global minimizer of the cost function $H(f)$ . Moreover, the minimizer is unique when $\phi$ is strictly convex.

Proof.

Refer Appendix. ∎

The optimality condition obtained in Theorem 1 has a nice interpretation from the classical consumer theory in economics [30]. The optimal baseline report $f^{*}$ is such that the consumer’s marginal utility equals the retail price of the electricity. Given $f^{*}$ , the expected reward per unit of energy (in kWh) paid for the expected load reduction provided by a consumer is given by

[TABLE]

where $\delta f(\theta)=f-q^{a}(\theta)$ is defined to be the inflation of the baseline report. From the second term, we infer that the expected inflation of the baseline report should be small to avoid a large excess payment.

III-C Control of Baseline Inflation

Here, we show that the penalty function in combination with randomized calling allows the SO to control the inflation of the optimal baseline report $\delta f^{*}(\theta)=f^{*}-q^{a}(\theta)$ . First we establish that penalty is necessary and then show that with a penalty, the probability of calling $p$ provides us a lever to control baseline inflation.

III-C1 Optimal Baseline Report without Penalty

In this case, the optimality condition for the optimal baseline report $f^{*}$ is given by

[TABLE]

Since the sensitivity of the consumer’s cost $H(f)$ is negative with respect to $f$ , it indicates that the consumer will report a very high baseline.

III-C2 Optimal Baseline Report with Penalty

The introduction of a penalty function allows us to control the inflation in the baseline report by adjusting the probability of calling. This result is shown in the following lemma.

Theorem 2.

Let the penalty function $\phi$ be a quadratic function such that $\forall x:\phi^{\prime\prime}(x)=1/\lambda$ . Then the measurable inflation in the optimal baseline report $\delta\tilde{f}^{*}(\theta)=f^{*}-q^{b}(f^{*},\theta)$ satisfies

[TABLE]

And when $\frac{\partial u^{2}(q,\theta)}{\partial q^{2}}=-1/d$ ,

[TABLE]

Proof.

Refer Appendix. ∎

For specific consumer utility and penalty functions, an explicit expression for the expected baseline report inflation can be obtained. The following theorem provides this expression for the case where the consumer’s utility and the penalty function are both quadratic.

Theorem 3.

Let the consumer’s utility $u$ and the penalty function $\phi$ be quadratic functions such that

i)

$\forall(q,\theta):\frac{\partial u^{2}(q,\theta)}{\partial q^{2}}=-1/d$ , 2. ii)

$\forall x:\phi^{\prime\prime}(x)=1/\lambda$ ,

where $d$ and $\lambda$ are positive scalars, then the expected inflation of the baseline report is given by

[TABLE]

Proof.

Refer Appendix. ∎

The law of diminishing marginal utility establishes that the marginal utility declines with increase in consumption [30]. In Theorem 3, $1/d$ is the rate of diminishment of the consumer’s marginal utility, and it is a private feature of the consumer that cannot be modified by the system operator. Unlike $d$ , $\lambda$ is a parameter of the DR program, because it defines the quadratic penalty function, i.e. $\phi(x)=x^{2}/(2\lambda)$ . Hence, the SO can choose $\lambda$ in the design of the DR program. Since $\lambda>0$ , a lower bound for the expected inflation of baseline report is obtained by setting $\lambda=0$ ,

[TABLE]

Moreover, by choosing the parameter of the penalty function $\lambda$ to be small enough, the expected inflation of the baseline report can be made arbitrarily close to its lower bound. Note that this lower bound is a function of $p$ and is decreasing with $p$ . Consequently, by choosing $\lambda$ and $p$ to be small the baseline inflation can be controlled.

Here, we provide a simple numerical example to validate the above results. In this example, $u=cq-(0.5/d)q^{2}$ , where $c=\$ .5/\text{kWh} $and$ d\in{0.1,0.2,0.3,0.4} $in$ ($/\text{kWh}^{2})^{-1} $. The retail price$ \pi_{0}=$0.12/\text{kWh} $and the TMC price$ \pi^{*}=$0.05/\text{kWh} $and are typical values (Refer [[31](#bib.bib31)]). These set of parameter values correspond to a typical price sensitivity value of$ \sim-0.3 $[[32](#bib.bib32), [33](#bib.bib33)]. The penalty coefficient$ \lambda=0.1\ ($/\text{kWh}^{2})^{-1} $. The probability$ p $is chosen to be$ p=0.1$. Table II summarizes the simulation results and how it compares with the theoretical results for this example.

III-D Ensuring Individual Rationality with a Deadband

The DR program is not guaranteed to be individually rational from the point of view of a single consumer because of the presence of uncertainty $\theta$ . When a consumer is not called, it consumes $q^{b}(f,\theta)$ which varies with $\theta$ and is different from $f$ , as it was shown in Lemma 1. As a result the consumer incurs a penalty and the mechanism is not guaranteed to be individually rational. This is not an issue in the absence of uncertainty. Individual rationality of the program can be ensured by introducing a deadband in the penalty as illustrated in Figure 2. A penalty with deadband can be expressed as,

[TABLE]

It is evident from such a design that there exists a deadband width $\epsilon$ such that the mechanism is individually rational. But a deadband worsens the inflation in baseline. In the theorem below, we provide an upper bound for the inflation in baseline when the penalty function has a deadband. The upper bound explicitly proves that the baseline inflation can worsen with the introduction of a deadband. But this trade off has to be made to guarantee individual rationality.

Theorem 4.

Let the consumer’s utility $u$ be a quadratic function such that

[TABLE]

The penalty function $\phi$ is defined in (22), where $d$ and $\lambda$ are positive scalars. Let $\max\{q_{\max}-\mathds{E}_{\theta}q^{a}(\theta),\mathds{E}_{\theta}q^{a}(\theta)-q_{\min}\}\leq\epsilon$ , where $q^{a}(\theta)\in[q_{min},q_{max}]$ , then the expected inflation of the baseline report is bounded by

[TABLE]

and the mechanism is individually rational.

Proof.

Refer Appendix. ∎

IV SO’s cost

The SO’s overall cost includes four terms: the cost to purchase power from the wholesale market, the payment for DR services, the retail energy payments and the recruitment cost. We ignore the recruitment cost for the initial analysis here. This allows us to mathematically derive an order approximate expression, with respect to $p$ , for the resultant cost. Using this we show that $p$ can be used as a lever to control SO’s cost as well. This allows the SO to achieve an almost optimal cost in this case by choosing a very small value for $p$ .

We then discuss the case where the recruitment cost is non-trivial. Here, we show that $p$ is restricted as a lever for controlling SO’s cost. This is because the recruitment costs becomes unbounded as $p\rightarrow 0$ . However, we show, for a typical DR scenario, that the SO’s cost is decreasing with $p$ up to a certain threshold value. This threshold value is small enough that the cost can be significantly reduced by choosing this threshold as the selection probability. This suggests that the SO can still reduce its cost to significantly lower levels for typical DR scenarios.

IV-A Without Recruitment Cost

Let $Q$ denote the net energy purchased from wholesale market, $\Pi(Q)$ the wholesale market’s price, $\Delta\tilde{Q}$ the measured net load reduction provided by the called DR resources, and $\pi_{0}$ the retail energy price. Then, the SO’s overall cost, ignoring the recruitment cost, is given by

[TABLE]

Let $Q_{0}$ denote the overall load had the DR resources not been called to reduce load and $\Delta Q$ the true net reduction provided by the called DR resources, then $Q_{0}=Q+\Delta Q$ and the SO’s cost can also be written as follows,

[TABLE]

where $\Delta\tilde{Q}-\Delta Q$ corresponds to the inflation in net load reduction estimate, which arises from the inflation in baseline estimates of the recruited DR providers.

From Theorem 3, it follows that the inflation $\Delta\tilde{Q}-\Delta Q$ is $O(p)$ where $p$ is the probability of calling a consumer, which is a design variable of the DR mechanism. Hence, in this case, $\min_{\Delta Q,p=0}J_{SO}=J^{*}_{SO}$ , i.e., the SO’s optimal cost can be achieved by driving the probability to zero. And so the optimal reduction $\Delta Q^{*}=\arg\min_{\Delta Q,p=0}J_{SO}$ . From the convexity of $J_{SO}$ , when $p$ is zero, it follows that $\Delta Q^{*}$ satisfies the first order condition,

[TABLE]

The market price corresponding to $Q_{0}-\Delta Q^{*}$ is exactly the TMC price $\pi^{*}$ because $Q_{0}-\Delta Q^{*}$ is the optimal reduction. Consequently, $\Delta Q^{*}$ satisfies

[TABLE]

The SO recruits $n=1/p$ sets of consumers such that each set can provide $\Delta Q^{*}$ of load reduction when called for a DR event. Hence, the cost for the SO (25) when a particular set is called during a DR event is given by

[TABLE]

Using definition of $J_{SO}^{*}$ ,

[TABLE]

Substituting for baseline inflation from Theorem 3,

[TABLE]

where $\bar{N}$ is the number of consumers in the set and $\bar{d}$ is the average rate of diminishment of the marginal utility across the recruited consumers in the set, which is an unknown. The SO chooses the reward rate as $\pi_{2}=\pi^{*}$ , and therefore

[TABLE]

Note that with the inclusion of deadband to ensure individual rationality, the SO’s cost becomes,

[TABLE]

Thus, for this case, the SO’s cost is $O(p)$ and $O(\epsilon)$ optimal and the SO’s cost $J_{SO}$ approaches $J_{SO}^{*}$ when both $p$ and $\epsilon$ approach zero. This result suggests that the SO can achieve an almost optimal cost in this case by choosing a very small value for $p$ .

IV-B With Recruitment Cost

Denote the recruitment cost per customer by $\pi_{\text{rec}}$ . Let $N_{T}$ be the total number of consumers recruited. Then $N_{T}$ is given by

[TABLE]

where $n=1/p$ is the number of groups and $\bar{N}_{i}$ is the number of consumers in group $i$ . Including the total recruitment cost, which scales with $N_{T}$ , SO’s overall cost is given by

[TABLE]

Note that, in this case, the optimal cost for SO $J^{*}_{SO}\neq\min_{\Delta Q,p=0}J_{SO}$ . The reason is that the last term grows unboundedly as $p\rightarrow 0$ . This also suggests that, in this case, an almost optimal cost cannot be achieved by choosing $p$ to be very small. This is illustrated in the example below.

We provide a simple example here to illustrate how the SO’s cost varies with $p$ when the recruited consumers provide $\Delta Q^{*}$ reduction and when the recruitment cost is non-trivial. In the example we consider here, $c=5\times 10^{2}\ \$ /\text{MWh} $,$ \pi_{0}=$120/\text{MWh} $,$ Q_{0}=8000 $MWh. We consider two different values for$ d $, i.e.,$ d=0.1,d=0.01 $. The values of$ d $are derived from demand reduction provided by typical customers assuming the payment to be$ $100/\ \text{MWh} $. The two$ d $values correspond to large industrial customers and commercial places like retail stores etc. respectively [[34](#bib.bib34)]. We assume that the supply ranges from$ 5000 $MWh to$ 8000 $MWh. Using the representative supply curve from [[35](#bib.bib35)] we approximate the inverse supply curve for this range by$ \Pi(Q)=aQ+bQ^{2} $where$ a=-0.0415 $in$ $/\text{MWh} $and$ b=8.3\times 10^{-6} $in$ $/\text{MWh}^{2} $. For this supply curve and$ Q_{0}=8000 $MWh,$ \pi^{}\sim$100/\text{MWh} $and$ \Delta Q^{}\sim 1200 $MWh. The reward payment of$ $100/\text{MWh} $and the aggregate load reduction of$ 1200$ MWh are typical of DR programs spanning the region covered by a SO [36].

The first two plots of Figure 3 provides the variation of SO’s cost with respect to $p$ when the recruitment cost is $\pi_{\text{rec}}=\$ 2/\text{Customer} $for different values of$ d $and the bottom row plot of Figure [3](#S4.F3) provides the variation of SO’s cost when the recruitment cost$ \pi_{\text{rec}}=$10/\text{Customer} $. The former recruitment cost, i.e.$ \pi_{rec}=2 $, is based on typical service costs charged per customer on a monthly basis to recover the metering implementation and maintenance cost [[37](#bib.bib37)]. In our case, we consider the worst-case scenario where the SO bears this cost instead of passing it on to the DR participants. Note that the approximation of the SO’s cost by ignoring recruitment cost, as in the previous section, is a reasonable approximation of the SO’s cost up to a certain threshold probability. This threshold probability is as low as$ 0.1 $and$ 0.2 $for the cases$ d=0.1 $and$ d=0.01 $respectively. These$ d$ values and other parameter values are typical values as stated before. Hence we expect that, in a typical scenario such as this, a SO can still reduce its cost significantly by setting the calling probability equal to this threshold value.

V Self-Reported Baseline vs. Other Baseline Estimation Methods

Here, we shall use the CAISO $m/m$ method [8] for our comparative study. We emphasize here that a similar analysis applies to other estimation methods like control group methods. In CAISO’s m/m method, the SO computes the average consumption of the most recent $m$ similar but non-event days and uses this average-based estimate as the baseline. Hence, the baseline estimate is a moving average of the consumption profile of the consumers. Typically this average-based estimate from past consumption data is corrected by an adjustment factor to account for any variation in the consumption pattern from the past. This adjustment factor is common to all baseline estimation methods and is highly recommended. As we shall see this factor is the primary cause for the existence of adverse incentives to inflate baseline. Hence, the analysis to follow equally applies to all current baseline methods that use an adjustment factor, which includes the control group methods.

As discussed before, the individual optimal consumption decision depends on whether the consumer is signaled or not for reduction on a particular day. Also the baseline estimate used for the payments depends on the consumption in the days prior to the DR event. So the payments made during future DR events can influence the consumer to inflate their consumption during a non-event day. On a particular day, the consumer’s benefit depends on whether the consumer is signaled or not. As explained in Section III-A, if the consumer is participating in the DR program and is signaled to reduce, its total cost is $J^{c}(q,f,\theta)$ given by (13) where $f$ is the baseline estimate obtained by the CAISO $m/m$ method. However, if the consumer is not signaled, its cost is $J^{a}(q,\theta)$ (see equation 7). Unlike the incentive-based DR program with self-reported baseline, the CAISO program does not impose a penalty, and the consumer’s cost is the same as if it were not participating in the DR program, when it is not called.

Let $\mathcal{T}_{N}$ denote the set of most recent $m$ similar but non-event days, and let $f^{c}$ denote the baseline calculated in CAISO’s $m/m$ model. Then,

[TABLE]

where $\{q_{\tau}:\tau\in\mathcal{T}_{N}\}$ is the set of consumption for the near past $m$ similar non-event days. The baseline estimation $f^{c}$ is multiplied by an adjustment factor $C_{f}$ to account for any variation in the consumption pattern. Hence, the CAISO baseline with adjustment factor is given by

[TABLE]

where $f^{c}$ was defined in (34). Let $q$ denote the consumption on the current DR event day, and $q^{-}$ the consumption on the day before. Let $\mathcal{T}_{E}$ be the set of days before the days in the set $\mathcal{T}_{N}$ , and define $f^{-}=\frac{1}{m}\sum_{\tau\in\mathcal{T}_{E}}q^{-}_{\tau}$ as the average consumption of the days in the set $\mathcal{T}_{E}$ . The correction factor for the current DR day is then computed as

[TABLE]

Typically, the consumers are signaled a day ahead of the DR event. So, the reward during the DR event on the current day can influence the consumer to inflate its day-ahead consumption $q^{-}$ . The day-ahead consumption is obtained by minimizing the joint cost of the current DR day and the day before with respect to $q^{-}$ , as these are the only two terms in the overall cost of the consumer that $q^{-}$ can influence. The joint cost for the two days is given by $J^{a}(q^{-},\theta)+J^{c}(q,\theta)$ , where

[TABLE]

Here, for illustration purposes, we have assumed identical utility functions and retail price for both the days. The analysis can be trivially extended to the general case where they are not identical. The term $\pi_{2}(\bar{f}^{c}-q)$ is the payment that the consumer receives for reducing consumption and the value of $\theta$ is realized when the consumption decision is made.

By definition it follows that the optimal consumption $q^{-*}$ on the day before the current DR event day is given by

[TABLE]

From the first order optimality condition it follows that $q^{-*}$ should satisfy

[TABLE]

On the DR event day, $f^{c}$ and $f^{-}$ are constants. This implies,

[TABLE]

Hence, the optimal consumption on the day before the current DR event day is

[TABLE]

Using this result, we provide a lower bound for the expected value of baseline inflation in the CAISO m/m method with adjustment factor, when the utility function is quadratic, in the following lemma.

Lemma 3.

Let the consumer’s utility $u$ be a quadratic function such that $\forall(q,\theta):\frac{\partial u^{2}(q,\theta)}{\partial q^{2}}=-1/d$ where $d$ is a positive scalar, then the baseline report $\bar{f}^{c}$ satisfies

[TABLE]

Proof.

Refer Appendix. ∎

In the proposed DR mechanism, the expected baseline inflation with quadratic utility and penalty function was obtained in Theorem 4. From the discussion in the previous two sections, it follows that $\lambda$ and $p$ can be used as levers to control baseline. Hence by choosing $\lambda$ and $p$ to be sufficiently small and provided $\epsilon$ is not comparable to $d\pi_{2}$ , which is the case when $\pi_{2}\sim O(\pi_{0})$ , we get that

[TABLE]

Thus, in the self-reported approach, we can ensure that the inflation in baseline per consumer is significantly smaller compared to conventional baseline estimation methods, such as CAISO’s m/m method, that uses an adjustment factor.

VI Conclusion

We proposed a mechanism for incentive-based DR programs where the only information that is elicited from each consumer is a self-report of its baseline consumption. The mechanism entails a calling probability for each consumer and a penalty when the consumer is not called. The mechanism provides the required service reliably by selecting a certain set of consumers during every DR event. We showed that the probability of calling and the penalty can be used to control the baseline inflation. We also justified that the mechanism’s cost can be significantly reduced by deploying DR resources. Finally, we showed that the self-reported baseline estimates a better baseline estimate than conventional methods such as the CAISO’s m/m method.

-A Proof of Lemma 1

In order to prove the second statement, we note that $J^{b}(f,q,\theta)$ is the sum of two convex functions $U_{1}(q)=\pi_{0}q-u(q,\theta)$ and $U_{2}(f,q)=\phi(f-q)$ . The minimizer of $U_{1}$ is $q^{a}(\theta)$ and the minimizer of $U_{2}$ is $q=f$ . Then the minimizer of $J^{b}=U_{1}+U_{2}$ necessarily lies between the minimizers of $U_{1}$ and $U_{2}$ which implies that $q^{b}(f,\theta)$ lies between $q^{a}(\theta)$ and $f$ . The first statement follows from (15), (9) and the properties of the consumer’s utility that is monotone increasing.

-B Proof of Lemma 2

We start by showing that $0\leq\alpha(f,\theta)<1$ , where $\alpha$ is the cost sensitivity defined as $\alpha(f,\theta)=\frac{dq^{b}(f,\theta)}{df}$ . The optimal consumption $q^{b}(f,\theta)$ satisfies (11). Holding $\theta$ fixed and differentiating (11) further we get,

[TABLE]

Convexity of $\phi$ and strict convexity of $-u$ implies the existence of $\frac{dq^{b}}{df}$ and is given by

[TABLE]

and satisfies $0\leq\alpha<1$ . Next, we differentiate $H(f)$ twice to show that $H^{\prime\prime}(f)>0$ . Differentiating $H(f)$ we get

[TABLE]

Differentiating once again, we get

[TABLE]

Before we showed that $\left(1-\alpha(f,\theta)\right)>0$ . Then, it follows that $H(f)$ is (strictly) convex if and only if $\phi$ is (strictly) convex.

-C Proof of Theorem 1

The optimal forecast $f^{*}$ satisfies the first order condition:

[TABLE]

The sensitivity of optimal cost $J^{b}(f,q^{b},\theta)$ with respect to $f$ is given by

[TABLE]

where $\alpha(f,\theta)=\frac{dq^{b}(f,\theta)}{df}$ . Then, taking into account that $q^{b}(f,\theta)$ satisfies (11), we get

[TABLE]

The sensitivity of optimal cost $J^{c}(f,q^{c},\theta)$ with respect to $f$ is given by

[TABLE]

where $\beta(\theta)=\frac{dq^{c}(\theta)}{df}$ . As before, $q^{c}(\theta)$ satisfies (14) and we get

[TABLE]

From equations (42) and (43), we obtain

[TABLE]

The optimality condition follows from equations (11) and (14) because

[TABLE]

The right hand side in (45) is the expected marginal utility which implies that $\pi_{0}=M(f^{*})$ . Since $\phi$ was selected to be convex, from Lemma 2, $f^{*}$ is a global minimizer of $H(f)$ . Moreover, if $\phi$ is a strictly convex function, again from Lemma 2, $f^{*}$ is unique.

-D Proof of Theorem 2

From equation (44), we have

[TABLE]

This implies,

[TABLE]

From the optimality condition for $q^{b}(f^{*},\theta)$ (11) and when $\frac{\partial u^{2}(q,\theta)}{\partial q^{2}}=-1/d$ ,

[TABLE]

Taking expectations on both sides we get

[TABLE]

That is,

[TABLE]

-E Proof of Theorem 3

The consumptions $q^{a}(\theta)$ and $q^{c}(\theta)$ have the expressions:

[TABLE]

Also from (45), we get

[TABLE]

This implies,

[TABLE]

Since the utility function $u$ is quadratic in $q$ , the marginal utility $\mu=u^{\prime}$ is affine in $q$ . Moreover, since $\mu^{\prime}$ is independent of the random variable $\theta$ , it holds

[TABLE]

and substituting $q^{b}(\theta)$ in the previous expression, we obtain

[TABLE]

The consumer’s utility $u$ and the penalty $\phi$ are quadratic functions, then their derivatives $u^{\prime}$ and $\phi^{\prime}$ are affine and their inverse functions are also affine. Moreover, the derivative of the inverse functions satisfy:

[TABLE]

Then, the expressions of $\mu^{-1}\left(\pi_{0}-\frac{p\pi_{2}}{1-p},\theta\right)$ and $\phi^{\prime-1}\left(\frac{p\pi_{2}}{1-p}\right)$ become

[TABLE]

and

[TABLE]

Using (44), we get

[TABLE]

By taking expectations,

[TABLE]

and by substitution in (46), we obtain

[TABLE]

-F Proof of Theorem 4

Define a family of penalty functions with deadband as follows:

[TABLE]

for $0\leq\Delta<\epsilon$ . Note that $\phi_{\Delta}$ is continuous with continuous derivatives up to second order for $0<\Delta<\epsilon$ , and it approaches the penalty function $\phi$ given by (22) as $\Delta$ approaches zero, i.e. $\phi=\lim_{\Delta\rightarrow 0^{+}}\phi_{\Delta}$ .

The derivative of $\phi_{\Delta}$ is:

[TABLE]

for $x\geq 0$ and $\phi_{\Delta}^{\prime}(x)=-\phi_{\Delta}^{\prime}(-x)$ for $x\leq 0$ , which is invertible for any $x\neq 0$ .

From the fact that this penalty function is double differentiable, the optimality conditions established before hold for this specific case as well. We do a case based analysis.

Case $f^{*}\geq\mathds{E}_{\theta}q^{a}(\theta)+\epsilon$ : From Lemma 1 we have that $f^{*}\geq q^{b}(f^{*},\theta)\ \forall\theta$ . Then using (44), the convexity of $\phi^{\prime}$ for $x\geq 0$ , that $f^{*}\geq q^{b}(f^{*},\theta)\ \forall\theta$ and Jensen’s inequality, we get

[TABLE]

It is always possible to choose $\Delta$ such that

[TABLE]

For this value of $\Delta$ , $\phi_{\Delta}^{\prime}(x)$ is always restricted to $x>\epsilon$ . Since $\phi^{\prime}_{\Delta}$ is affine and invertible, we get

[TABLE]

Since $\mathbb{E}(f^{*}-q^{b}(f^{*},\theta))\geq 0$ , the fact that $\phi^{\prime}$ is increasing for $x\geq 0$ and from the previous equation it follows that

[TABLE]

and substituting the value of $\mathbb{E}_{\theta}q^{b}(\theta)$ given by (47), we obtain

[TABLE]

Hence, the result for the penalty function with deadband $\phi$ defined in (22) is obtained by taking limit when $\Delta$ approaches zero,

[TABLE]

Case $f^{*}<\mathds{E}_{\theta}q^{a}(\theta)+\epsilon$ : By this case it follows that

[TABLE]

Individual Rationality: For an $\epsilon$ such that $\max\{q_{\max}-\mathds{E}_{\theta}q^{a}(\theta),\mathds{E}_{\theta}q^{a}(\theta)-q_{\min}\}\leq\epsilon$ , the report $f=\mathds{E}_{\theta}q^{a}(\theta)$ is individually rational. Thus the optimal baseline report $f^{*}$ should be individually rational. Hence proved.

-G Proof of Lemma 3

We start by showing that $f^{c}>f^{-}$ . Recall that $f^{-}$ is the average of consumption on the days prior to the previous $m$ non-event days. The consumption on these days only appear in the denominator of the CAISO’s baseline estimate for future DR events. Hence, the incentive for the consumer is to reduce the consumption on these days so as to inflate the baseline. On the other hand, $f^{c}$ is the average of the consumption on the previous $m$ non-event days. And the consumption on these days only appear in the numerator of the baseline estimate for any future DR events, through the term $f^{c}$ . Hence, the incentive for the consumer is to increase the consumption on these days. Since everything else is the same for the day prior to the non-event day and the non-event day except for this incentive to reduce and increase, respectively, we conclude that $f^{c}>f^{-}$ . This implies:

[TABLE]

Hence,

[TABLE]

Taking expectation with respect to $\theta$ we obtain

[TABLE]

and this completes the proof.

Bibliography37

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. H. Albadi and E. El-Saadany, “A summary of demand response in electricity markets,” Electric power systems research , vol. 78, no. 11, pp. 1989–1996, 2008.
2[2] F. E. R. Commission, “Demand response compensation in organized wholesale energy markets,” Final Rule Report , 2011.
3[3] C. D. W. Paul L. Joskow, “Dynamic pricing of electricity,” The American Economic Review , vol. 102, no. 3, pp. 381–385, 2012. [Online]. Available: http://www.jstor.org/stable/23245561
4[4] P. Chakraborty, E. Baeyens, and P. P. Khargonekar, “Distributed control of flexible demand using proportional allocation mechanism in a smart grid: Game theoretic interaction and price of anarchy,” Sustainable Energy, Grids and Networks , vol. 12, pp. 30–39, 2017.
5[5] J. L. Mathieu, T. Haring, J. O. Ledyard, and G. Andersson, “Residential demand response program design: Engineering and economic perspectives,” in European Energy Market (EEM), 2013 10th International Conference on the . IEEE, 2013, pp. 1–8.
6[6] S. Borenstein, M. Jaske, and A. Ros, “Dynamic pricing, advanced metering, and demand response in electricity markets,” Journal of the American Chemical Society , vol. 128, no. 12, pp. 4136–45, 2002.
7[7] A. Faruqui and J. Palmer, “Dynamic pricing and its discontents,” Regulation , vol. 34, no. 3, pp. 16 – 22, 2011.
8[8] CAISO, Demand Response User Guide. Version 4.3 , California ISO, May 2017.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A Minimal Incentive-based Demand Response Program With Self Reported Baseline Mechanism

Abstract

Index Terms:

I Introduction

I-A Our Contribution

I-B Related Work

II Problem Setup

II-A Market Model

II-B Consumer Model

II-C Incentive-Based Demand Response Program

Stage 1 (Reporting)

Stage 2 (DR Event)

II-C1 Consumer Recruitment and Selection

Remark 1**.**

II-C2 Reward and Penalty Function

II-D Consumer’s Optimization Problem

Definition 1**.**

III Optimal baseline report and inflation

III-A The Consumer’s Second Stage Problem

Consumer is not participating in the DR program

Consumer is participating in the program but is not signaled

Consumer is participating in the program and is signaled to

Lemma 1**.**

Proof.

III-B The Optimal Baseline Report

Lemma 2**.**

Proof.

Definition 2** (Consumer’s Expected Marginal Utility).**

Theorem 1**.**

Proof.

III-C Control of Baseline Inflation

III-C1 Optimal Baseline Report without Penalty

III-C2 Optimal Baseline Report with Penalty

Theorem 2**.**

Proof.

Theorem 3**.**

Proof.

III-D Ensuring Individual Rationality with a Deadband

Theorem 4**.**

Proof.

IV SO’s cost

IV-A Without Recruitment Cost

IV-B With Recruitment Cost

V Self-Reported Baseline vs. Other Baseline Estimation Methods

Lemma 3**.**

Proof.

VI Conclusion

-A Proof of Lemma 1

-B Proof of Lemma 2

-C Proof of Theorem 1

-D Proof of Theorem 2

-E Proof of Theorem 3

-F Proof of Theorem 4

-G Proof of Lemma 3

Remark 1.

Definition 1.

Lemma 1.

Lemma 2.

Definition 2 (Consumer’s Expected Marginal Utility).

Theorem 1.

Theorem 2.

Theorem 3.

Theorem 4.

Lemma 3.