Designing reactive power control rules for smart inverters using support vector machines
Mana Jalali, Vassilis Kekatos, Nikolaos Gatsis, Deepjyoti Deka

TL;DR
This paper introduces a machine learning approach using support vector machines to design nonlinear inverter control rules for voltage regulation, aiming to improve upon preset local rules with reduced communication overhead.
Contribution
It formulates inverter control rule design as a multi-task learning problem, enabling customized, nonlinear control rules that enhance voltage regulation in distribution grids.
Findings
Nonlinear control rules outperform preset local rules.
The approach reduces communication overhead.
Trade-offs between voltage regulation and rule sparsity are demonstrated.
Abstract
Smart inverters have been advocated as a fast-responding mechanism for voltage regulation in distribution grids. Nevertheless, optimal inverter coordination can be computationally demanding, and preset local control rules are known to be subpar. Leveraging tools from machine learning, the design of customized inverter control rules is posed here as a multi-task learning problem. Each inverter control rule is modeled as a possibly nonlinear function of local and/or remote control inputs. Given the electric coupling, the function outputs interact to yield the feeder voltage profile. Using an approximate grid model, inverter rules are designed jointly to minimize a voltage deviation objective based on anticipated load and solar generation scenarios. Each control rule is described by a set of coefficients, one for each training scenario. To reduce the communication overhead between the grid…
| C4) | C5) | C6) | C7) | |
|---|---|---|---|---|
| Running time [min] |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Designing Reactive Power Control Rules
for Smart Inverters using Support Vector Machines
Mana Jalali, , Vassilis Kekatos, , Nikolaos Gatsis, , and Deepjyoti Deka Manuscript received March 1, 2019; revised June 4, 2019, and August 15, 2019; accepted September 16, 2019. Date of publication DATE; date of current version DATE. Paper no. TSG.00318.2019. This work was supported in part by the NSF-CAREER grant 1751085.M. Jalali and V. Kekatos are with the Bradley Dept. of ECE, Virginia Tech, Blacksburg, VA 24061, USA. N. Gatsis is with the ECE Dept., University of Texas at San Antonio, San Antonio, TX 78249, USA. D. Deka is with the Theoretical Division at Los Alamos National Laboratory, NM 87545, USA. Emails: {manaj2,kekatos}@vt.edu; [email protected]; [email protected] versions of one or more of the figures is this paper are available online at http://ieeexplore.ieee.org.Digital Object Identifier XXXXXX
Abstract
Smart inverters have been advocated as a fast-responding mechanism for voltage regulation in distribution grids. Nevertheless, optimal inverter coordination can be computationally demanding, and preset local control rules are known to be subpar. Leveraging tools from machine learning, the design of customized inverter control rules is posed here as a multi-task learning problem. Each inverter control rule is modeled as a possibly nonlinear function of local and/or remote control inputs. Given the electric coupling, the function outputs interact to yield the feeder voltage profile. Using an approximate grid model, inverter rules are designed jointly to minimize a voltage deviation objective based on anticipated load and solar generation scenarios. Each control rule is described by a set of coefficients, one for each training scenario. To reduce the communication overhead between the grid operator and the inverters, we devise a voltage regulation objective that is shown to promote parsimonious descriptions for inverter control rules. Numerical tests using real-world data on a benchmark feeder demonstrate the advantages of the novel nonlinear rules and explore the trade-off between voltage regulation and sparsity in rule descriptions.
Index Terms:
Support vector machines; multi-kernel learning; voltage regulation; linearized distribution flow model.
I Introduction
Several electric utilities in the US currently experience issues while integrating residential- and commercial-scale solar generation. A solar farm connected at the end of a long rural feeder can incur voltage excursions along the feeder, while frequent power flow reversals strain the apparent power capabilities of substation transformers [1]. Solar generation from residential photovoltaics (PVs) can fluctuate by up to 15% of their rating within one-minute intervals [2]. Utility-owned voltage control equipment, such as load-tap-changing transformers, capacitor banks, and step-voltage regulators, involves discrete control actions, and its lifespan is related to the number of switching operations [3]. Regulating voltage under increasing renewable generation may require more frequent switching and further installations, thus critically challenging reactive power control in distribution grids.
On the other hand, PVs are interfaced by inverters featuring advanced communication, metering, and control functionalities. Using inverters for reactive power control has been advocated as a fast-responding solution [1]. The amended IEEE 1547 standard allows inverters to be operating at non-unit power factors [4]. Nonetheless, coordinating in real-time hundreds of inverters distributed over a feeder is a formidable task. In a typical setup, the values of instantaneous loads and solar generation are communicated to a utility controller; the controller minimizes ohmic losses subject to voltage regulation constraints; and the computed setpoints are sent back to inverters. The problem of finding the optimal reactive injection setpoints for inverters is an instance of the optimal power flow (OPF) task, which is non-convex in general. Different convex relaxations have been proposed; see [5] for a survey. The uncertainty in loads and solar generation over the next control period is usually accounted for through stochastic and robust formulations [6], [7]. To reduce complexity, approximate grid models have also been employed [8], [9]; though heavy two-way utility-inverter communication is still needed.
Alternatively, decentralized solvers where inverters decide their setpoints upon communicating with neighboring inverters have been devised [10], [11], [12]. On the other extreme, localized schemes suggest having inverters implementing Volt-VAR and/or Watt-VAR curves given only local measurements [1]. Although such rules have been analytically shown to be stable and fast-converging, their equilibria unfortunately do not coincide with the sought OPF minimizers [13], [8], [14], [15]. In fact, there exist cases where local rules perform worse than the no-reactive support option [16].
The previous literature review indicates that centralized schemes incur high computational complexity; decentralized solvers require multiple communication exchanges among inverters; and local schemes have no performance guarantees. As a middle-ground solution, inverter setpoints can be designed in a quasi-static fashion via control rules. A rule expresses each setpoint as an affine function of given inputs, such as generation, load, or voltage. Albeit the related weights are optimized periodically in a centralized fashion, control rules are applied in real time. Controlling inverters via affine rules has been accomplished using chance-constrained [17]; robust [16], [18]; and closed-loop formulations [19]. Optimal rules however are not necessarily linear: If an apparent power constraint becomes active, reactive injections can become nonlinear functions of solar generation. To capture this nonlinearity, recent approaches engage learning models which are trained to optimize: Given pairs of grid conditions (load and solar generation) and their optimal inverter dispatches computed, the aforesaid approaches learn dispatch rules using linear or kernel-based regression [20], [21].
This work combines machine learning tools with physical grid models, and advocate a kernel-based approach for designing inverter control rules. The contribution is on two fronts: First, the design of inverter control rules is posed as a multi-task learning problem. Each inverter rule is modeled as a nonlinear function of control inputs. Rules are coupled through the electric grid to yield a system voltage profile. Using an approximate grid model, inverter rules are learned jointly so that they minimize a voltage regulation cost using anticipated load and solar generation scenarios. Each rule is described by a set of coefficients, one for each scenario. As a second contribution, we engineer the voltage regulation objective, so that the optimal rules are described by a few scenario coefficients. Such parsimonious representation of inverter rules saves communications. Numerical tests on a benchmark feeder showcase the advantages of nonlinear rules and explore the trade-off between voltage regulation and sparse rules.
Regarding notation, lower- (upper-) case boldface letters denote column vectors (matrices), while calligraphic symbols are reserved for sets. Symbol ⊤ stands for transposition and denotes the -norm of .
II Reactive Power Control
This section formulates the task of voltage regulation using inverters. Consider a distribution grid having buses served by the substation indexed by . Let denote the voltage magnitude, and the complex power injection at bus . The active injection is decomposed into , where is the solar generation and the inelastic load at bus . Reactive injections can be similarly expressed as . Collect injections in -length vectors:
[TABLE]
The reactive power injected by inverter is constrained as
[TABLE]
where is the apparent power limit for inverter ; see [1].
Given loads and solar generation , voltage regulation aims at optimally setting such that voltage deviations are kept minimal. To formally describe this task, one has to deal with the nonlinear power flow equations relating voltages to power injections. Trading modeling accuracy for computational tractability, we resort to the linearized model [22]
[TABLE]
where and matrices depend on the feeder. Model (3) can be derived by linearizing the power flow equations around the flat voltage profile. In fact, the linearization can be performed at any system state , yet matrices and would then depend on the state ; see [19]. From (1) and (3), the vector of voltage deviations from its nominal value can be approximated as
[TABLE]
where and is a vector of all ones.
The goal here is to design the inverter injections so that bus voltage magnitudes remain within regulation limits. The ANSI-C.84.1 standard dictates that service (load) voltages should remain within per unit (pu). However, our grid model of (4) stops at the level of distribution transformers. A distribution (pole or pad-mounted) transformer may be serving several residential customers. Each customer is typically connected to the distribution transformer through a triplex cable, which incurs a voltage drop between the transformer and the service voltage: Suppose a customer is connected to a 50 kVA, 7200-240/120 V center-tapped transformer via a 1/0 AA 100-ft triplex cable. The customer runs a constant-current load of 10 kVA at the nominal voltage of 120 V with 0.9 lagging power factor. If load currents are equally distributed among the three supplies (two 120 V and one 240 V), the service voltage drops by pu. If the load is distributed among supplies non-uniformly, the service voltage can drop by even pu. Due to this, the current practice is to maintain voltages at distribution transformers within pu, to ensure that service voltages remain within pu; see exercises of [23].
Given loads, solar generation, and grid parameters, the goal is to decide to regulate voltage while satisfying the apparent power constraints of (2). The setpoints for reactive power injections from inverters can be found as the minimizer
[TABLE]
The set captures the constraints in (2) for all ; and is a voltage regulation objective. A typical choice for is the sum of squared voltage deviations [13], [16], [19]
[TABLE]
Alternatively, the utility may want to maintain voltages within the range of for say . Then, a pertinent objective is [14]
[TABLE]
where is the -th canonical vector of length , and the operator is defined as
[TABLE]
Function returns zero when all voltages are within limits. Otherwise, it increases linearly with voltage excursions; see [14] for distributed solvers of (5) with .
It is worth noticing that depends only on the network and the linearization point, whereas the set and vector depend on the variable loads and solar generation, collectively denoted as .
Ideally, the reactive control process entails three steps:
- S1)
Each bus communicates its to the operator. 2. S2)
The operator solves (5) knowing the current . 3. S3)
The operator sends the optimal setpoints to inverters.
Under variable solar generation, the process has to be repeated on a per-minute basis. Observe that S1) establishes inverter-to-utility communication links, and S3) requires another utility-to-inverter links. Running this process for multiple feeders can become a computationally and communication-wise challenging task.
To adaptively adjust inverter setpoints based on , affine control rules in the form of have been suggested in [16], [17], [18]. Based on these rules, the reactive injection of inverter is expressed as an affine function over a subvector of . The premise is to design the rule in a quasi-stationary fashion, but apply it in real-time. We extend linear to nonlinear control rules enjoying varying cyber requirements after briefly reviewing the toolbox of kernel-based learning.
III Preliminaries on Kernel-based Learning
Given pairs of features belonging to a measurable space and target values , kernel-based learning aims at finding a function or mapping . From all possible options of arbitrarily complex functions, one needs to select a specific family where belongs. Kernel-based learning postulates that lies in the function space [24]
[TABLE]
This is the space of functions that can be expressed as linear combinations of a given kernel (basis) function evaluated at arbitrary points . When is a symmetric positive definite function, then becomes a reproducing kernel Hilbert space (RKHS) whose members have finite norm ; see [25]. Some options for the kernel function are provided under Examples 1–2 in Section IV-A.
Learning from data can be formulated as the regularization task [24], [26]
[TABLE]
where is an intercept term. When it comes to regression, typical choices for the data-fitting loss include the least-squares (LS) fit , or the -insensitive loss . The second term in (10) ensures and facilitates generalization over unseen data [25]. Parameter balances fitting versus generalization, and is tuned via cross-validation: i) problem (10) is solved for a specific using of the data; ii) the learned function is validated on the unused of the data; iii) the process is repeated times to calculate the average fitting error for this ; and iv) the attaining the best fit is selected; see [24] for details.
The advantage of confining to lie in the RKHS is that the functional optimization of (10) can be equivalently posed as an minimization problem over a finite-dimensional vector: The celebrated Representer’s Theorem asserts that the solution to (10) admits the form [24]
[TABLE]
In other words, the minimizer of (10) is described only by rather than infinitely many ’s. Based on (11), evaluating at the given data provides , where ; matrix is the kernel matrix with entries ; and .
From properties of the RKHS’s, it holds that ; see [25]. For regression under an LS loss, the functional minimization in (10) becomes the vector optimization
[TABLE]
where is the square root of and .
It is worth stressing that (11) applies not only to the given data , but any . Evaluating requires knowing the minimizing (12), and being able to evaluate the kernel for . We next use kernel-based learning to develop nonlinear inverter control rules.
IV Kernel-based Control Policies
The reactive injection by inverter is modeled by the rule
[TABLE]
whose ingredients are explained next.
Control inputs: Vector is the input to control rule for inverter . This vector may include load, solar generation, and/or line flow measurements collected locally or remotely. For a purely local rule, this input can be selected as
[TABLE]
where the first entry relates to the apparent power constraint and has been defined in (2). The voltage could also be appended in ; however the stability of the resultant control loop is hard to analyze even when is linear; see e.g., [27], [14], [15], [19], [20].
Selecting the controller structure, i.e., the content for each , can affect critically the performance of this control scheme. Ideally, each inverter rule can be fed all uncertain quantities, that is the three numbers in the right-hand side of (14) across all buses. In that case, the input vectors become all equal and of size . However, this incurs the communication burden of broadcasting values in real time. Hybrid setups with ’s carrying a combination of local and remote data can be envisioned. To eliminate the effect of this trade-off between communications and performance, this work assumes that the content of ’s is prespecified. The task of input selection could be possibly pursued along the lines of sparse linear or polynomial regression [19], [20], [28]; and automatic relevance determination [29, Sec. 6.4].
Control function: Selecting the form of is the second design task. To leverage kernel-based learning, the inverter rule is postulated to lie in the RKHS
[TABLE]
determined by the kernel function .
Linear rules can be designed by selecting the linear kernel . Nonlinear rules can be designed by selecting for example a polynomial kernel or a Gaussian kernel with design parameters and ; see [24].
Intercept : Although it could be incorporated into by augmenting with a constant entry of , it is kept separate to avoid its penalization through [24].
IV-A Learning rules from scenario data
The rules of (13) can be learned from scenario data indexed by with . Scenario consists of the control inputs for , and the associated vector defined in (4). Evaluating rule of (13) under scenario yields the inverter response . Let us collect the outputs from all inverters into vector . Note that the goal is not to fit by , but to minimize the voltage deviations . The control functions and the intercepts accomplishing this goal can be found via the functional minimization
[TABLE]
where is a voltage regulation objective [cf. (6)–(7)].
Remark 1**.**
The proposed approach is related to [20]–[21], where inverter rules are also trained using machine learning. However, the aforementioned works proceed in two steps: They first solve a sequence of OPF problems similar to (5) to find the optimal inverter setpoints under different scenarios. Secondly, they learn the mapping between controller inputs and optimal setpoints decided by the OPF problems. During this process, they also select which inputs are more effective to be communicated to inverters. The mapping is learned via linear or kernel-based regression. On the other hand, the approach proposed here consolidates the OPF and the learning steps into a single step: The advantage is that the OPF decisions of (16) are taken under the explicit practical limitation that can only be a function of , since inverter will not have access to the complete grid conditions. To get some intuition, suppose ones designs linear control rules of known input structure using the single-step approach of (16) with and the two-step approach of [20]–[21]. The single-step approach yields rules , and the two-step approach yields rules . Let us evaluate and on the training scenarios. Rules are not necessarily feasible per scenario , whereas rules are. Moreover, rules do not necessarily coincide with the minimizers of (5). For the sake of comparison, let us assume that rules turn out to be feasible per scenario, and hence feasible for (16). Being the minimizers of (16), rules attain equal or smaller voltage deviation cost compared to over the training data. Numerical tests in Section VI corroborate the advantage of over for and during the operational phase as well.
Different from (10), the optimization in (16) entails learning multiple functions (one per inverter). Since inverter injections affect voltages feeder-wise, inverter rules are naturally coupled through in (16). Similar multi-function setups can be found in collaborative filtering or multi-task learning [25], [30].
Fortunately, Representer’s Theorem can be applied successively over in (16). Therefore, each rule is written as
[TABLE]
Once the coefficients have been found, rule can be evaluated for any . Similar to (11), evaluating rule over the scenario data gives
[TABLE]
where for , and . The RKHS norms can be written as
[TABLE]
In this way, the functional minimization in (16) is cast as a vector minimization over and . The exact form of this minimization and its properties for different are discussed later in Section V. For now, let us clarify how the kernel functions effect different rule forms.
Example 1: Affine rules. The linear kernel yields affine rules. The sought functions can be written as
[TABLE]
Given scenario data and for and , we would like to find through (16). Collect the input data for inverter in the matrix . According to Representer’s Theorem, the optimal can be expressed as for some . Evaluating the control rule for any input yields
[TABLE]
Evaluating the rule at the input data yields (18) with . The squared function norm is .
Example 2: Non-linear rules. For non-linear rules, transform the input to vector via a non-linear mapping . The entries of could be for example all the first- and second-order monomials formed by the entries of . The dimension of can be finite (e.g., polynomial kernels) or infinite (Gaussian kernels) [29]. Then, the control function
[TABLE]
with is non-linear in . The developments of Example 1 carry over to Example 2 by using and replacing by . Depending on the mapping , the vectors may be of finite or infinite length [24]. The critical point is that does not depend on ’s directly, but only on their inner products for any and . These products can be easily calculated through the kernel function as ; see [24].
Since the constraints in (16) are enforced for the scenario data, the learned rules do not necessarily satisfy these constraints for all with . This limitation appears also in scenario-based and chance-constrained designs [17]. Once a control rule is learned, in real-time , it can be heuristically projected within as
[TABLE]
IV-B Implementing reactive control rules
Our control scheme involves four steps; see also Fig. 1:
- T1)
The utility collects scenario data for all and . 2. T2)
The utility designs rules by solving (16); see Section V. 3. T3)
Each inverter receives data from the utility, which describe . 4. T4)
Over the next minutes and at real time , each inverter will be collecting and applying the rule
[TABLE]
The aforesaid process is explicated next. Regarding T1), scenario data should be as representative as possible for the grid conditions anticipated over the following -min control period. One option would be to use load and solar generation forecasts. A second option would be to use historical data from the previous day and same time, if they representative of today’s conditions. A third alternative would be to use the most recent grid conditions known to the utility. For example, if smart meter data are collected every min anyway, they can be used in lieu of forecasts for the next control period.
The numerical tests of Section VI adopt the third option and use the minute-based grid conditions observed over the last -minutes as scenarios to train the inverter rules for the upcoming -minute interval. Obviously, the number of training scenarios does not have to coincide with the length of the control period measured in minutes. These two parameters relate to loading conditions; feeder details; availability and quality of scenario data; communication and computational resources. Selecting their optimal values goes beyond the scope of this work.
During T4), inverter has already received and during T3). Each may consist of local data and a few active flow readings collected from major lines or transformers. If the entries of are all local, the rule can be applied with no communication. Otherwise, the non-local entries of have to be sent to inverter . If non-local inputs are shared among inverters, broadcasting protocols can reduce the communication overhead.
Remark 2**.**
Suppose each inverter knows the training data for . Function can be described in two ways: Either through (17) using the data described under T3); or through (20)–(21) via . For the second way, vector has entries in the linear case and entries in the nonlinear case. For the linear case, if , representing through (20) by is more parsimonious. Representation (17) becomes advantageous only when under the nonlinear case.
V Support Vector Reactive Power Control
This section converts (16) to a vector minimization and explores different options for . From (18), the output of inverter across all scenarios is . Then, the apparent power constraints in (16) can be written as
[TABLE]
where . Moreover, the vector of voltage deviations can be expressed as
[TABLE]
where is the -th column of . Substituting (19) and (23)–(V), the optimization in (16) can be posed as a second-order cone program (SOCP) over and .
Nonetheless, solving (16) with yields optimal ’s with several non-zero entries. This means that to describe rule by (22), the utility needs to communicate the entire vector during T3). If scenarios are not known by the inverter, they have to be communicated along with as well. The number of scenarios may be large when learning rules under complex feeder setups. A related approach for minimizing a convex combination of and power losses has been suggested in the conference precursor of this work [31], but inherits the same difficulty of non-sparse ’s.
Inspired by support vector machines (SVM), we engineer to obtain inverter rules described by possibly fewer scenarios: Promoting sparse ’s alleviates the communication overhead during step T3). To this end, we put forth the cost
[TABLE]
for some . If scenario yields a vector of voltage deviations with -norm smaller than , this scenario incurs no cost. If , the voltage regulation penalty grows with . The cost in (25) can be expressed as an SOCP over the slack variable
[TABLE]
Applying the same epigraph trick for the function norms, problem (16) can be solved as the SOCP
[TABLE]
where and . The variables can be eliminated using the substitutions of (V). Solving (26) takes operations with interior point-based solvers [32]. However, the advantage of inverter control rules is that (26) is not solved in real time. If standard interior point-based solvers are not scalable to larger grids, one may resort to (distributed) first-order algorithms; warm-start initializations; and cutting-plane methods.
The coefficients ’s minimizing (26) enjoy two types of sparsity, across inverters and across scenarios. To explain the first type of sparsity, express the second summand in the cost of (26) as . Having these non-squared -norms in the objective promotes block sparsity across , in the sense that for larger , some vectors may be set to zero. This effect is a direct consequence of block-sparse solutions encountered in group Lasso (G-Lasso)-formulations; see [33], [26], [30]. All inverters receive a reactive power setpoint , but if the optimal becomes zero, inverter will not be changing its reactive injection in real-time. One may drop the intercept from the control rule of (13) and the optimization of (26), and modify the feature vector as
[TABLE]
Thus, obtaining from (26) enables inverter selection.
The next proposition studies the second type of sparsity; see the appendix for a proof.
Proposition 1**.**
Consider (16) with and its minimizer in (17). If for scenario at the optimum, then for every inverter with .
Proposition 1 explains how promotes block sparsity across : If scenario does not experience severe voltage violations, the corresponding coefficients will be zero for all inverters that have not reached their apparent power limit. Block sparsity across time identifies non-critical scenarios. Phrased in the SVM context, the so-termed ‘support vectors’ here correspond to scenarios with significant voltage deviations. Larger values of effect fewer critical scenarios.
These two forms of sparsity offer communication savings since the related do not need to be communicated to inverters. This enables training the rules for larger number of scenarios at the same communication overhead. Note that for fixed , the sparsity of ’s depends on the training data ’s as well. If a particular sparsity goal is to be met, the utility has to solve (26) repeatedly for various values of and . Such computations can be significantly sped up by initializing an optimization algorithm for one value of to the minimizer obtained using the previous value of [24, Sec. 18.4]; however, such techniques will not be pursued here.
Different from , cost is not expected to yield as sparse ’s. The next claim (proved in the appendix) explains that even if a single bus experiences voltage deviation larger than for scenario , then for all . In other words, a voltage violation at a single bus for scenario renders this scenario critical for all inverter rules.
Proposition 2**.**
Consider (16) with and its minimizer in (17). If for scenario at the optimum, then for all .
VI Numerical Tests
The novel inverter rules were tested on the IEEE 123-bus feeder [34], converted to a single-phase grid as described in [35]. Residential load and solar data were extracted from the Pecan Street dataset as delineated next [2]. Minute-sampled active load and solar generation data were collected for June 1, 2013 between 8:00–16:00. We downloaded data from the first 123 Pecan Street nodes, after excluding nodes with empty data records. Regarding solar generation, unless stated otherwise, of the buses had solar generation by excluding nodes with bus indexes that are multiples of .
Load data were scaled on a per bus basis so that their daily peak values matched of the benchmark load. Since the Pecan Street data included only active power, we drew lagging power factors uniformly at random within for each bus and kept them fixed across time. The scaling factors for active loads were also used for scaling solar data. To allow for reactive power compensation even at peak solar irradiance, inverters were over-sized by providing an apparent power capacity of for all ; see [1].
Our numerical tests included six control schemes:
C1) The optimal reactive injections computed by (5) on a per-minute basis;
C2) The optimal reactive injections computed by (5) on a per-minute basis assuming a -minute communication delay;
C3) The fixed Watt-VAR control rules of [1, (12)–(14)];
C4) The rules of (16) for linear kernels and ;
C5) The rules of (16) for Gaussian kernels and ;
C6) The rules of (16) for linear kernels and ; and
C7) The rules of (16) for Gaussian kernels and .
The input to inverter consisted of local data as in (14). Each entry of was centered by its daily mean and normalized by its daily standard deviation. To avoid rank deficiency, we added to all kernel matrices.
Schemes C1),C2) were solved using SDPT3 and YALMIP with MATLAB [36, 37]. Schemes C4)–C7) were solved by invoking the MOSEK solver directly through MATLAB [38]. Tests were run on a 2.4 GHz Intel Core i5 laptop with 8 GB RAM. The average running time for solving (16) with is given in Table I. It should be emphasized that although the control rules were designed using the LDF grid model, the voltage deviations experienced by all control rules were tested using the full AC model.
During training, we used scenarios to learn the SVM-based control rules of C4)–C7). These scenarios comprised the load and solar data observed during the last minutes. During validation, the inverter control rules were tested over the following minutes. Parameters and were selected via -fold cross-validation. The ranges of and were empirically chosen to yield an average communication overhead similar to the one needed by the affine rule of (20) as discussed under Remark 2: An affine rule is described by data per inverter. If only of the entries of are nonzero, then communicating entails sending data as well. The sparsity of ’s depends on input data along with the values of or . These parameters were set so that ’s had nonzero entries on the average across time and buses.
We next explored the trade-off between voltage deviation and the sparsity of ’s for C4)–C7). The expectations from this test were two: i) voltage deviations are expected to increase for sparser ’s; ii) schemes C4) and C5) should exhibit improved sparsity over C6) and C7). To validate these hypotheses, we recorded the voltage deviations for values of and for C4)–C7). The average absolute voltage deviation and the average percentage of non-zero coefficients were calculated over the day and across buses, and are shown in Figure 2. From Figure 2, the value of yielding a sparsity of roughly is . Figure 2 reveals three important points. First, voltage deviations increase as ’s become sparser as expected. Second, for a given sparsity in ’s, the rules obtained by exhibit smaller voltage deviations compared to the rules obtained by . Because of this, we focus on the performance of C4)–C5) for the rest of this section. Third, the Gaussian kernel-based rules attained lower voltage deviations than the related linear kernel-based rules.
We next tested the effect of on inverter selection and voltages. Larger values of are expected to set more ’s to zero. To eliminate the inverters with , the parameter was appended in as delineated in (27). For a fixed value of , for scheme C4), the values of were obtained using cross-validation across the day. The control rules were designed again using different values of . As expected, by increasing the value of , the number of all-zero ’s and the corresponding voltage deviations were increased. Figure 3 depicts the absolute voltage deviation averaged over time for each inverter. Notice that the values of and were kept fixed, although the training data ’s varied across the day. Due to this, the reported sparsity in Figure 2 is the average sparsity across time and inverters. Moreover, the number of inverters in Figure 3 is the average number of activated inverters across the day. Even though the values of and can be adjusted on a -min basis to meet specific sparsity requirements, we chose to keep them fixed to simplify the exposition. In fact, the rest of this section reports the worst-case instead of average voltage deviations across time and for each bus.
We next compared the proposed SVM-based control rules against the alternative schemes of C1)–C3). To this end, voltage deviations were calculated between 8:00–16:00 for schemes C1)–C5). Figures 4 and 5 demonstrate the average and the maximum voltage deviations over the test period. It can be observed from both figures that the Gaussian SVM-based rule performs better than C1)–C3) due to its ability to capture non-linear behaviors. Although C3) needs no communication, it violates the ANSI-C.84.1 standard voltage constraints. Furthermore, despite the high communication needed, scheme C2) shows no superiority in performance over C5) and corroborates the need for real-time response to system inputs.
In all previous tests, the rules were fed with locally recorded data. To evaluate the advantage of adding remote control inputs, we appended the values of active power flows on the lines feeding buses , , and , to all input vectors . The daily maximum and the average voltage deviations attained by C1)–C5) are depicted in Figures 6 and 7, respectively. As expected, the results suggest that adding remote inputs to the rules improves the grid voltage profile at the expense of increased inter-network communication.
As mentioned in Section IV-B, the length of the control period (in minutes) over which rules remain constant does not have to agree with the number of scenarios used for training the rules. To evaluate how the control rules perform for longer control periods, Figure 8 compares the voltage deviations obtained by training rules using scenarios, but keeping them unaltered over , , and minutes. As expected, voltage regulation deteriorates as rules remain unchanged for longer periods.
All previous tests assumed solar penetration of . We also tested the performance of C1)–C5) under penetrations of and . To simulate penetration, solar generation and smart inverters were installed only in buses with even indexes. Likewise, to simulate penetration, we considered buses whose indexes were multiples of . Figures 9 and 10 depict the attained maximum absolute voltage deviations, which apparently decrease with decreasing solar penetration. For lower penetrations, the Gaussian-based rule preserves its superior voltage profile over the other schemes.
Schemes C4) and C5) were also tested under less communication by scaling down the sparsity in ’s by a factor of : Voltage deviations were evaluated for corresponding to non-zero entries for ’s on the average. Figure 11 demonstrates the maximum absolute voltage deviation for C1)–C5). Even with fewer coefficients communicated, the voltage constraints of ANSI-C.84.1 were still satisfied.
The last set of numerical tests compares the developed single-step approach with the two-step approach of [20]–[21]; see also Remark 1. We used both approaches to design local linear control rules for the IEEE 13-bus feeder [23], under the voltage deviation cost with . The top and center panels of Fig. 12 show respectively the maximum and average voltage deviation per bus computed across time. The bottom panel shows the voltage deviation cost , time-averaged per control period. The bottom panel also shows the voltage deviation cost with attained upon training both rules using instead of . Similar results were obtained for other values of and . According to these tests, the single-step approach achieved: 1) lower maximum per-bus voltage deviations; 2) lower average per-bus voltage deviations; and 3) smaller voltage deviation costs during the operational phase.
VII Conclusions
A novel approach for designing inverter control rules has been put forth. It relies on both data-based learning and physical grid modeling. Inverter rules are not learned independently using input/output pairs of the OPF problem. Instead, they are learned jointly by posing the related OPF problem as a multi-function learning task. Because of the way voltage deviations couple inverter outputs, the conventional support vector machine approach fails to yield sparse rule descriptions. We have engineered a voltage deviation cost to identify ‘support scenarios,’ that is a few scenarios with non-zero coefficients for most of inverter rules. The devised control rules were tested using on a benchmark feeder using the exact AC model. The novel scheme attained superior voltage regulation performance compared to preset local rules, and oftentimes comparable performance to an optimal inverter dispatch delayed by minutes. The numerical tests have further corroborated the benefits of nonlinear rules with non-local inputs, and explored the trade-off between voltage regulation performance and sparsity. Finally, this work motivates several questions. On the implementation side, testing the novel formulations on multiphase grids along with capacitor banks, voltage regulators, and ZIP loads, is of practical interest. On the analytical side, chance-constrained formulations; studying the stability of nonlinear rules with voltages as inputs; using kernels to learn functions with constraints; and selecting non-local control inputs; are some open and interesting questions.
Proof:
Consider first the linear rules of (20), for which for all . Problem (16) with can be reformulated as
[TABLE]
Express voltage deviations at in terms of ’s and
[TABLE]
Let us next introduce the Lagrange multipliers [32]:
- •
and corresponding to the linear inequalities in (28c) for all ;
- •
related to constraint (28d) for all ; and
- •
related to constraint (28e) for all .
Collect multipliers in , and vectors and . After some algebra, the Lagrangian of (28) can be written as
[TABLE]
Minimizing over the primal variables provides
[TABLE]
From (30), the dual of (28) becomes the SOCP problem
[TABLE]
It is not hard to check that (28) and (31) are strictly feasible, so strong duality holds and both problems are solvable. The optimal primal and dual variables satisfy complementary slackness SOCPs; see [32, Sec. 4.1]. For constraints (28d) and (31d), these conditions identify three cases:
- c1)
If , then ; 2. c2)
If , then ; or 3. c3)
If and , then .
Recall that from (30b). Moreover, it is not hard to see that at the optimum of (28). Then, case c1) cannot occur. The other two cases entail that for some . Substituting from (30c), and evaluating rule at the tested scenarios gives
[TABLE]
Here we identify and the coefficients in (22) as
[TABLE]
Focus now on the complementary slackness for (28e) and (31e). The equivalent to condition c1) reads now as:
- c1’)
If , then .
Suppose the optimal primal variables satisfy . Then follows from (28), and c1’) gives . The -th entry of in (32) is
[TABLE]
Complementary slackness for (28c) implies that if at the optimal, thus proving the claim for linear rules. The result in (33) holds for nonlinear rules too. The analysis carries over upon matching the length of with the length of , and substituting by . ∎
Proof:
Rewrite (16) for as
[TABLE]
The Lagrangian multipliers of (34) are similar to shose of (28), except for being replaced by () and collected in and . Minimizing the Lagrangian of (34) over the primal variables yields
[TABLE]
Similar to Prop. 1, the -th entry of becomes
[TABLE]
If the optimal primal variables satisfy , then and accordingly, complementary slackness for (34e) implies that or . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. Turitsyn, P. Sulc, S. Backhaus, and M. Chertkov, “Options for control of reactive power by distributed photovoltaic generators,” Proc. IEEE , vol. 99, no. 6, pp. 1063–1073, Jun. 2011.
- 2[2] (2018) Pecan Street Inc. Dataport. [Online]. Available: https://dataport.cloud/
- 3[3] Y. Agalgaonkar, B. Pal, and R. Jabr, “Stochastic distribution system operation considering voltage regulation risks in the presence of PV generation,” IEEE Trans. Sustain. Energy , vol. 6, no. 4, pp. 1315–1324, Oct. 2015.
- 4[4] IEEE 1547 Standard for Interconnecting Distributed Resources with Electric Power Systems , IEEE Std., 2018. [Online]. Available: http://grouper.ieee.org/groups/scc 21/1547/1547_index.html
- 5[5] S. Low, “Convex relaxation of optimal power flow — Part II: Exactness,” IEEE Trans. Control Netw. Syst. , vol. 1, no. 2, pp. 177–189, Jun. 2014.
- 6[6] G. Wang, V. Kekatos, A.-J. Conejo, and G. B. Giannakis, “Ergodic energy management leveraging resource variability in distribution grids,” IEEE Trans. Power Syst. , vol. 31, no. 6, pp. 4765–4775, Nov. 2016.
- 7[7] Y. Zhang, N. Gatsis, and G. B. Giannakis, “Robust energy management for microgrids with high-penetration renewables,” IEEE Trans. Sustain. Energy , vol. 4, no. 4, pp. 944–953, Oct. 2013.
- 8[8] M. Farivar, R. Neal, C. Clarke, and S. Low, “Optimal inverter VAR control in distribution systems with high PV penetration,” in Proc. IEEE Power & Energy Society General Meeting , San Diego, CA, Jul. 2012.
