Decentralized DC MicroGrid Monitoring and Optimization via Primary Control Perturbations
Marko Angjelichinoski, Anna Scaglione, Petar Popovski, Cedomir, Stefanovic

TL;DR
This paper presents a decentralized method for DC MicroGrid monitoring and optimization that uses local voltage measurements and primary control perturbations, eliminating the need for external communication.
Contribution
It introduces a novel decentralized estimation approach integrated within primary control loops for autonomous MicroGrid operation.
Findings
Accurate estimation of generation capacities, load demands, and line conductances.
Effective decentralized solution for economic dispatch.
Validation through simulations demonstrating practical applicability.
Abstract
We treat the emerging power systems with direct current (DC) MicroGrids, characterized with high penetration of power electronic converters. We rely on the power electronics to propose a decentralized solution for autonomous learning of and adaptation to the operating conditions of the DC Mirogrids; the goal is to eliminate the need to rely on an external communication system for such purpose. The solution works within the primary droop control loops and uses only local bus voltage measurements. Each controller is able to estimate (i) the generation capacities of power sources, (ii) the load demands, and (iii) the conductances of the distribution lines. To define a well-conditioned estimation problem, we employ decentralized strategy where the primary droop controllers temporarily switch between operating points in a coordinated manner, following amplitude-modulated training sequences.…
| Parameter | Value |
|---|---|
| Simulation platform | MATLAB |
| Reference voltage (volts) | |
| Lower and upper voltage margins , (volts) | , |
| Distribution network topology | cut-ring |
| Max. gen. capacity per DER (kW) | |
| Max. const. conductance demand per bus (kW) | |
| Max. const. current demand per bus (kW) | |
| Max. const. power demand per bus (kW) | |
| Average conductance per line (S) | |
| Sampling frequency (kHz) | |
| Sampling noise standard dev. (volts/sample) | |
| Transient time duration (ms) | |
| Nominal droop control params. , (volts) | , |
| Max. voltage drop in -phase (volts) | |
| Total number of slots in the training epoch | |
| Total number of slots in sub-phase | |
| Total number of slots per block in sub-phase | |
| Other -phase params. , , | , , |
| OED epoch duration (s) | |
| Backup gen./storage cost (units/W) | |
| DC bus signaling threshold |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Decentralized DC MicroGrid Monitoring and Optimization via Primary Control Perturbations
Marko Angjelichinoski, Anna Scaglione, Petar Popovski, Čedomir Stefanović M. Angjelichinoski, P. Popovski and Č. Stefanović are with the Department of Electronic Systems, Aalborg University, Denmark (e-mail: @es.aau.dk). A. Scaglione is with the School of Electrical, Computer and Energy Engineering, Arizona State University, AZ, USA (e-mail: [email protected]).The work presented in this paper was supported in part by EU, under grant agreement no. 607774 “ADVANTAGE”.
Abstract
We treat the emerging power systems with direct current (DC) MicroGrids, characterized with high penetration of power electronic converters. We rely on the power electronics to propose a decentralized solution for autonomous learning of and adaptation to the operating conditions of the DC Mirogrids; the goal is to eliminate the need to rely on an external communication system for such purpose. The solution works within the primary droop control loops and uses only local bus voltage measurements. Each controller is able to estimate (i) the generation capacities of power sources, (ii) the load demands, and (iii) the conductances of the distribution lines. To define a well-conditioned estimation problem, we employ decentralized strategy where the primary droop controllers temporarily switch between operating points in a coordinated manner, following amplitude-modulated training sequences. We study the use of the estimator in a decentralized solution of the Optimal Economic Dispatch problem. The evaluations confirm the usefulness of the proposed solution for autonomous MicroGrid operation.
Index Terms:
direct current MicroGrids, droop control, training, Maximum Likelihood, Optimal Economic Dispatch
I Introduction
Since their inception, MicroGrids (MGs) have evolved substantially, particularly in the domain of low voltages (LV), leading to variety of use cases and topologies [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]: from small clusters of distributed energy resources (DERs) serving houses or buildings, to large meshes of small MGs covering large areas, such as neighborhoods, industrial complexes and remote villages. As a result, the future smart grid (SG) is envisioned as a mesh of interconnected autonomous MG systems. It is also within the field of MGs where direct current (DC) power networks have experienced a renaissance due to the seamless integration with DC renewable generation, DC energy storage systems and DC smart loads [2, 3, 4]. Hence, LV DC MGs are considered as a solution for residential and industrial use cases.
A distinctive characteristic of DC MGs is the use of programmable DC/DC and AC/DC power electronic converters (PECs) to connect the DERs to the DC distribution system. PECs are digital signal processors (DSPs) that allow for software implementation of advanced control systems [2, 3]. Leveraging on the advanced features of PECs the control system design also shifted from simple strategies, suitable for small systems [12, 13, 14], to modular hierarchical architectures where several interacting control layers dynamically respond to state variations on different time scales and pursue various complementary objectives [3, 4, 15, 16, 17, 18, 19, 20, 21, 22, 23]. Specifically, the MG control plane is organized into dual-layer architecture, comprising primary and upper control layer [3, 15]. The primary control is decentralized and deals with high frequency dynamic compensation and state regulation [3]. The upper control layer deals with slow, global changes in the MG by providing updated primary control references and is implemented in distributed/centralized fashion [15, 16, 17, 18, 19, 20, 21, 22, 23]. An exemplary upper layer application is the Optimal Economic Dispatch (OED), which aims to compute the optimal dispatch policies that minimize the total generation cost while keeping the load balanced [17].
The standard design assumption is that the feedback of the upper control layer is closed via an external communication system, usually via off-the-shelf wireless technologies [3, 17, 21]. However, this approach was challenged recently due to several issues [2, 3]. First, the distributed power systems, particularly MGs, are dynamic and ad-hoc in nature, thus the installation of communication hardware may prove impractical and cost inefficient. Second, the external communication system reduces the resilience of the overall MG system, as it becomes a factor in the system reliability/availability. Finally, there is a growing concern about the cyber-security of power systems that exploit external communications, as the related security threats and attacks might severely compromise their stability and operation, leading to blackouts, equipment damage, data theft and investment losses [24, 25, 26, 27].
A straightforward solution would be to remove the upper layer completely and run the DC MG only with primary control without any further coordination. However, the approach is not suitable for advanced MG topologies, as it can not foster optimal and sustainable regulation. The DC bus signaling has been introduced as an enhancement of the above idea [12, 13, 14]. It uses the variations of the steady state bus voltage as an implicit coordination signal that tells the DERs how to behave in specific conditions. The idea is motivated by the fact that DC systems are inherently tolerant to steady state voltage variations, allowing for voltage ripples of up to [2, 3, 5]. Each PEC monitors the local voltage and if it the crosses predefined threshold, the PEC takes predefined actions. This approach has reliability, availability and security advantages over traditional networked design and requires only software modifications of the PECs. However, it is configuration-dependent, performing well in environments with predictable loads, but not in large, dynamic and general-purpose MGs. Moreover, the range of upper layer applications that can be supported is limited. Another alternative to wireless communications is to use conventional powerline communications (PLC) [28]. This way, some of the security concerns can be alleviated as now an attacker would need physical access to the MG. Nevertheless, PLCs are still essentially an external communication system coupled to the control of the MG, as they require installation of dedicated modems.
Motivated by the shortcomings of the above approaches, we propose a decentralized dual-layer control architecture for autonomous DC MGs in which each primary controller locally acquires the information required for the operation of the upper layer and determines the updated primary control references without the support of external communication enabler. To support the majority of applications, the upper control layer requires information about: i) the generation capacities of the dispatchable DERs, ii) the demands of the loads, and iii) the conductance matrix of the distribution network [17, 19]. This information can be inferred from local voltage observations, since the bus voltages are functionally related to the MG parameters through a non-linear model. To extract these parameters, the PECs deliberately move the MG through a sequence of sub-optimal states via coordinated and amplitude-modulated perturbations of the primary control parameters, referred to as training sequences. This way, the PECs obtain sequences of local bus voltage measurements from which the required information can be uniquely estimated, provided that the training sequences satisfy sufficiency criteria. To this end, we formulate a constrained Maximum Likelihood (ML) estimation problem that estimates the MG parameters jointly with the state of the DC MG. To solve the non-convex optimization problem, we develop an iterative algorithm and compare its performance against the Cramer-Rao Lower Bound (CRLB). We illustrate the practical potential of the method by applying it in decentralized OED (DOED) and we show how to minimize the operational cost by optimizing the design of the training sequences. The proposed solution does not rely on any additional communication hardware, as it exploits the signal processing capabilities of the PECs and its locally available voltage measurements, such that it can be implemented only in software.
The rest of the paper is organized as follows. Section II gives an overview of the main contributions. Section III introduces the system model. Section IV presents the training protocol and formulates the decentralized system identification problem. Section V is the pivotal section of the paper, presenting our take to the problem formulated in Section IV. Section VI introduces the periodic DOED protocol. Section VII presents the results and Section VIII concludes the paper.
Notation: Column vectors and matrices are denoted by lowercase and uppercase bold letters, e.g., and . is obtained from by removing the element at position . Similarly, is obtained from by removing the -th column . , , , , , and denote the transpose, the pseudo-inverse, the vectorization, the dimension, the rank, the trace and the -norm of the argument. denotes the Kroneker product while and denote the Hadamard (element-vise) product and division of vectors/matrices of adequate dimensions. The vectors , and , denote the all-one, all-zero and the principal coordinate vector, , denote the all-one and all-zero matrices, and is the identity matrix. denotes diagonal matrix with the entries of on the main diagonal. We frequently use the identity where the matrix .
II Overview of Contributions
The proposed solution is illustrated in Fig. 1. We consider a generic DC MG model with multiple buses, described in Sections III and IV. We assume that the MG does not have access to reliable external communication resources. The physical state of DC MGs is characterized by the steady state bus voltages. We introduce a parameter vector that collects all system variables whose values are determined by exogenous influences; this includes the generation capacities of the DERs, the load demands and the distribution network topology, i.e., the conductance matrix, see Section IV-A. Using the power balance equation, we represent the bus voltages thorough a non-linear and implicit model, parametrized by , see Section III-B. Evidently, varies with time; to respond to its variations on different time scales, the DC MG is governed by a hierarchical control system, comprising primary and upper control layer. The primary control is decentralized: several controllers regulate the bus voltages, using only local feedbacks without exchanging any information with peer controllers. They are very fast and capable of responding to high frequency variations in . Popular primary controller in DC MGs is the Voltage Source Converter (VSC) with voltage droop control, which is reminiscent to the widespread frequency droop control in AC systems, but defined over the DC voltage; it is therefore standard practice to refer to it simply as droop controller [2, 3]. The upper control layer, on the other hand, responds to less frequent changes in that affect the global behavior of the system; examples include changes of the load/generation profile, faults, attacks, etc. Its main role is to adapt the system to the new conditions by computing updated optimal control references for the primary controllers; all upper layer control applications require full/partial knowledge of to determine the control references that adequately reflect the new conditions [15, 16, 17, 18, 19, 20, 21, 22, 23].
Unlike conventional centralized networked control solutions, where the upper control layer is supported by an external communication enabler, we propose a decentralized control architecture that relies solely on the DSP capabilities of the PECs: namely, in our solution the upper control layer is implemented locally within each PEC, and uses only the locally available state measurements, as depicted in Fig. 1. The solution comprises two main functional blocks, i.e., the monitoring and optimization, executed sequentially.
Monitoring. This functional block exploits the fact that the steady state bus voltages are functionally related with through the power balance equation; hence, each controller can compute a local estimate of . The key challenge is that it is impossible to infer by using only local measurements of a single realization of the state, as the system is not observable and the estimation is ill-conditioned. To address this, the monitoring block comprises two procedures: (1) coordinated decentralized training [29, 30] via primary control perturbations, see Section IV, and (2) Joint System Identification and State Estimation (J-SISE), see Section V. During training, the controllers perturb the values of the local droop control parameters, for a limited period of time, following predetermined training sequences. This generates a sequence of different realizations of the state. The controllers collect the local measurements of the state sequence and modulate them into the perturbation signals, see Section V-C. In other words, the relation between the primary control perturbation signals and the induced state deviations is interpreted as the input-output relation of an implicit communication channel [31, 32, 33, 34, 35], through which the controllers exchange their local observations. Hence, the training sequences are used both for generating multiple states and communicating the local state observations. If the training sequences satisfy sufficiency criteria, see Section V-B, each controller is able to compute unique estimate using the steady state voltage measurements acquired during training and the J-SISE algorithm, see Section V-D. The J-SISE is formulated as non-convex, constrained ML optimization problem in classical estimation framework which we solve via iterative algorithm based on partially linearized constraints and evaluate its performance using the CRLB, see Sections V-E and VII-B.
Optimization. The local estimates are used as inputs to an energy management application which computes updated primary control references, see Fig. 1. Any application for which is sufficient can be applied. We focus on DOED with linear generation cost model, since a simple, decentralized closed form solution is available in this case [17, 35]. To this end, we design periodic protocol, detailed in Section VI, where the controllers first perform training and obtain via J-SISE, then re-dispatch. Finally, we show how to minimize the operational cost of the protocol by calibrating the training parameters, see Section VII-C.
We conclude by highlighting the benefits of the proposed solution. First and foremost, it promotes the principle of self-sustainability in SG as it reuses the DSP features of the available power electronics and obviates critical reliance on external communication system. Further, the optimization block is not limited only to OED, as the knowledge of allows each controller to solve locally a great deal of energy management optimizations (even if they do not have decentralized formulation) such as Optimal Power Flow (OPF), Unit Commitment (UC) and security-related applications, such as Fault Detection and Diagnosis (FDD) [21, 19]. This flexibility strengthens the autonomous operation of the DC MG. Finally, the developed framework can be adapted for arbitrary DC MG systems, as discussed in Section VII-B.
III System Model
The terminology and the notation system applied to the model is standardly used in power engineering literature [3, 19]. Section IV introduces compact, matrix notation of the power balance equation which is easier to manipulate later on; this can be also seen as a standalone contribution, as this is the first work that introduces such compact notation for droop-controlled DC MG.
III-A General Multiple-Bus DC MicroGrid
III-A1 Buses and Distribution Network
A DC MG is a collection of DERs and loads, connected to low voltage DC distribution system, see Fig. 2. The distribution system consist of buses, indexed in the set . Each bus in steady state is characterized by a bus voltage , and all DERs and loads connected to bus measure the same voltage . The distribution line connecting buses and has a line conductance denoted by [3]. The topology of the distribution system is specified via the symmetric conductance matrix with elements:
[TABLE]
III-A2 Distributed Energy Resources
We model each DER as separate bus, i.e., we assume that each bus hosts at most one DER; hence, the total number of DERs is and they are indexed in the set . This modeling choice simplifies the notation without losing generality; in fact, if DERs and are connected to the same physical point, i.e., the same bus, by definition . The th DER has current and power output . We assume that the DERs in the MG are small-scale power sources such as renewables (RESs) or distributed generators (DGs) based on traditional fossil fuel. Each DER has an instantaneous generation capacity , and the output power should satisfy .
III-A3 Loads
The th bus hosts a collection of loads, represented through an aggregate model as a mixture of three components (also known as ZIP load model [36]): 1) constant conductance , 2) constant current , and 3) constant power component , see Fig. 2. The quantities , and are the instantaneous power demands of the components at a rated voltage . For a given , the constant power component in steady state is approximated with an equivalent positive current source in parallel with negative conductance and the electrical parameters are [3]:
[TABLE]
III-A4 Primary Control
The DERs use PECs to interface the buses; the bus voltage and/or current , i.e., power are locally controlled through decentralized primary controller, which is a software program executed by the PEC [3]. Two primary control schemes, i.e., modes are commonly used, see Fig. 3: 1) a closed loop Voltage Source Converter (VSC), and 2) an open loop Current Source Converter (CSC). VSC regulates the bus voltage and current of the DER as the loads/generation in the system change in order to keep the bus voltage within predefined margins and foster fair power sharing. It contains fast inner and slow outer control loops. An inner control loop consists of a cascade of voltage and current loops with control bandwidth of the order of several tens of kHz, equal to the sampling frequency of the converter. Its role is to maintain the output bus voltage to specific reference value, dictated by the outer control loop. The outer control loop is closed via filtered current feedback, and is slower than the inner control loop by an order of magnitude. The current feedback generates the reference value for the inner voltage loop, via the following steady state control law:
[TABLE]
This is known as decentralized droop control for DC MGs [3, 15] with two controllable parameters: the reference voltage and the virtual conductance . Their values are set (i) to keep the bus voltage, as closely as possible to the rated voltage , within predefined margins for any , and (ii) to enable fair power sharing among DERs based on their instantaneous generation capacities [3]. Fig. 4 depicts a widespread droop control law that meets the above conditions, with droop control parameters set as follows:
[TABLE]
where is the droop slope (in volts*-2*). The configuration enables proportional power sharing among the DERs. When the DER operates close to its capacity, the maximal voltage drop is , . In steady state, the droop-controlled VSC units are modeled as voltage sources in series with virtual conductance, see Fig. 2.
The other primary control mode CSC does not have outer control loop and inner voltage loops, see Fig. 3. The reference for the inner current loop is generated via a separate algorithm that gets as an input fixed power reference [3]. Hence, a CSC acts as a constant power component, neither participating in voltage regulation nor power sharing. It is modeled as a negative current source and parallel conductance, as in (4) but with opposite sign. It is architecturally equivalent to a negative constant power load, see Fig. 2.
The subsets of DERs operating in VSC/CSC, denoted respectively by , are determined dynamically by the upper layer application, see Section VI for an example. To support this dynamic operation, each converter is assumed to have dual mode, and is capable to switch between VSC and CSC control mode seamlessly [3, 37], see also Fig. 3.
III-B Steady State Equations
A DC MG is governed by Ohm’s and Kirchhoff’s laws, resulting in a system of steady state power balance equations for buses:
[TABLE]
with given by:
[TABLE]
The binary variable in (8) is if DER is configured in VSC/CSC control mode, respectively. The system of equations is quadratic in the bus voltages, such that, in general, a closed form solution for is not possible. The non-linear nature of the power balance equations stems from the presence of constant power components [38], both constant power loads and CSCs. Hence, in the case when for all and , the system (7) becomes linear in the bus voltages. Another special case with closed-form solution is the Single-Bus DC MG which we have studied separately [34] due to its practical importance.
IV Problem Formulation and Training Epoch
The DC MG is not connected to an external communication system and the PECs only have the local voltage/current measurements to work with. To learn (i) the generation capacities of remote DERs, (ii) the power demands of the loads and, (iii) the conductances of the distribution lines, the controllers need to solve a decentralized system identification problem, formulated below.
Before we begin, we list the main assumptions:
- ()
The primary controllers are fully synchronized to a common time reference. 2. ()
No prior knowledge on the generation capacities, load demands or the conductance matrix is used. 3. ()
The rate of load/generation/topology variations is an order of magnitude smaller than the frequency of the primary controllers.
IV-A Parameter Vector
Let be a vector that collects the instantaneous generation capacities of all DERs in the MG. Similarly, the instantaneous load demands are collected in separate vectors: , and . The load demand vector is defined as . Further, we observe that is fully specified by its supra(infra)-diagonal elements, see (3). We organize these elements in a vector , , , with dimension . Using , we can write as the weighted Laplacian , where is the oriented incidence matrix [39].
The deterministic parameter vector is defined as:
[TABLE]
with dimension . From the discussion in Section III-B, the steady state bus voltage depends on , see eq. (6), (8). This suggests that an arbitrary controller can infer the parameter vector locally, using local measurements of the steady state bus voltage (see also [40] and references therein for similar approaches). However, it is impossible to determine uniquely in classical, non-Bayesian estimation framework, using only a single observation of the local steady state bus voltage. To address this issue, the following subsection introduces a technique based on decentralized training via primary control perturbations.
IV-B Training Protocol and Training Sequences
We introduce a dedicated training epoch of predefined duration, in which (i) all controllers switch to VSC mode using a droop control law of the form (6), (ii) perturb their local droop control parameters, causing deviations of the bus voltages, and (iii) measure the local bus voltage response, collecting sequences of steady state bus voltage measurements. The training epoch design uses the assumptions and . Specifically, the time axis during the training epoch is divided into time slots, see Fig. 5, and all controllers are synchronized to this structure. We index each slot with . The slot duration complies with the control bandwidth of the primary control loops, allowing the bus to reach a steady state after a transient time , yielding voltage samples per slot for each controller, see Fig. 5. The system constant , usually several milliseconds [3], is determined by the sampling frequency and the line capacitors. Following , can be assumed to remain constant during the training epoch.
We use to denote the unperturbed, i.e., nominal droop control parameters during the training epoch; we use the law (6) with equal reference voltages and droop slopes:
[TABLE]
In slot , all controllers simultaneously perturb the reference voltages and droop slopes, according to perturbation signals , , ; they are organized in training matrices , , defined as and , , . The columns / of /, correspond to the training sequence injected by controller .
IV-C Steady State Bus Voltages and Measurement Vectors
The steady state bus voltage corresponds to the nominal, unperturbed, droop parameters , . The steady state bus voltage response in the th slot is . The steady state bus voltage matrix is defined as , , . The following proposition characterizes in terms of , and :
Proposition 1**.**
The steady state of DC MG during the training epoch is characterized by the implicit power balance equation:
[TABLE]
where is defined as , and given by:
[TABLE]
The subsets and comprise all training matrices and that keep within .
Proof.
See Appendix A. ∎
The power balance equation (11) reflects the requirement to keep the system balanced and stable, i.e., in a valid (albeit suboptimal) operating point, in each slot during training. It also gives an implicit relation between and , since (11) cannot be solved in closed form for .
The th controller measures the -th column of during the training epoch. The noisy measurement obtained by controller in slot is an average of multiple voltage samples collected during the steady state period of the slot, and can be written as with denoting the additive noise. The bus-voltage measurements matrix , with , , , is given as:
[TABLE]
where represents the noise and is a zero-mean, white Gaussian random vector with standard deviation [41], such that the probability density function (pdf) of is:
[TABLE]
The decentralized system identification problem for DC MGs is about devising an efficient and unbiased estimator of the local parameter vector , denoted with , using only local bus voltage measurements , for any .
IV-D Relaxing Assumptions
We briefly discuss the implications that arise when assumptions are no longer valid; addressing these implications is out of the paper’s scope. We start with , as the strongest assumption. Maintaining precise synchronization among the controllers on the level of slot and training epoch can be easily achieved if the PECs are equipped with GPS modules. Alternatively, one can use common decentralized network synchronization approaches, typically used in sensor networks [42]. Since the method operates in a time scale in the order of milliseconds, it should be significantly easier to maintain (at least coarse) synchronization for long periods of time. Finally, if synchronization is not possible, and the controllers inject perturbation signals without any prior coordination, then the formulation of the problem should be modified accordingly to account for asynchronous training. For instance, the parameter vector should be extended to include binary variables that capture the activity patterns of the controllers and the start times of individual training sequences, as well as their end times in case of variable training sequence durations.
Assumption simply casts our problem in classical estimation framework. In practice, prior knowledge is always available to some extent; in fact, can be assumed to evolve over time following a stochastic process, paving the way for formulating the identification problem in sophisticated Bayesian filtering/prediction framework [43]. Nevertheless, the analysis of the non-Bayesian case naturally comes first.
We use assumption to postulate that remains fixed during training, which is not true in general. In practice, might change at any time due to load/generation variation or a system fault. To incorporate this notion we should reformulate the problem accordingly. One way is to first relax assumption and model the dynamic evolution of via stochastic process, where relaxing assumption arises naturally. We can avoid relaxing and still use the classical framework as presented in the paper, but with modified definition of the parameter vector. For instance, let us assume that has changed no more than times during training; then, the parameter vector should comprise different values for as defined in (9), in addition to the time instances when the changes have occurred. Such formulations in the literature are known as model change detection, see [44].
V Decentralized Generation, Demand and Topology Estimation
V-A Preliminaries and Notation
In the case when the controllers do not not have any knowledge of the steady state bus voltages at remote buses, the system is not observable; hence cannot be uniquely identified in classical, non-Bayesian sense (see Appendix B).
Motivated by the ideas in [31], we propose a decentralized solution that splits the slots into two consecutive training phases: (i) measurement phase, denoted as -phase, and (ii) communication phase, denoted as -phase. The slots in the -phase are used to disseminate the local steady state voltage measurements acquired in the -phase to remote controllers via amplitude modulation of the reference voltage perturbation signals. Each controller then uses a sequential-type of demodulator to process the local bus voltage measurements acquired in the -phase and acquire full knowledge of the portion of that corresponds to the -phase. If the training matrices in the -phase satisfy predefined conditions, elaborated in subsection V-B, then knowing only the -phase portion of is sufficient to uniquely estimate the parameter vector locally.
The temporal organization of the proposed training protocol is depicted in Fig. 6, see also Fig. 7. The -phase is further split into sub-phases (channel estimation sub-phase) and (modulation and demodulation sub-phase). The -phase contains the first slots, indexed in , the sub-phase takes the subsequent slots indexed in and the sub-phase comprises the remaining slots indexed in . The sub-phase is further split into blocks, one for each slot in the -phase, see Fig. 6; hence, the blocks are indexed in . Each block is formed by consecutive time slots, such that . We write where is the set indexing the slots in block . As elaborated in subsection V-C, in block , the controllers disseminate the measurements obtained in slot in the -phase, see Fig. 7. We introduce notation corresponding to (sub-)phase-wise and block-wise partition of the matrices , , , and . Take the measurement matrix as an example (analogous notation applies to , , and ); it can be partitioned as, see Fig. 7:
[TABLE]
The matrix , with , contains the steady state bus voltage measurements from the -phase; , as well as each of the matrices are defined analogously. denotes the th column of ; analogous notation applies to the other matrices.
V-B Sufficient Excitation
The purpose of the -phase is to enable each controller to learn , which is sufficient to generate locally a unique estimate of for any , if and only if the Jacobians of w.r.t. and , denoted with and , respectively, satisfy the rank conditions:
[TABLE]
for any . The sufficient excitation conditions provide practical guidelines for designing the training matrices and ; this is further discussed in subsection V-F.
We note that the vectorization of is linear in :
[TABLE]
In fact, it can be shown that it is always linear in and ; however, the linearity in is a direct corollary of the virtual resistance configuration (6) for proportional power sharing based on the instantaneous generation capacities. This result is useful for finding good initial estimates of which will be used to initialize the iterative algorithm.
V-C Training Phases and Sub-phases
In the -phase, the th controller obtains , the -th column of . Learning the remaining columns and obtaining local copy of , denoted with , is done in the -phase where controller disseminates to remote controllers by modulating the amplitudes of the reference voltage deviations and, in the same time, demodulates from the locally available measurements via sequential demodulator, see Fig. 7.
In the -phase, we adopt the following perturbation signals:
[TABLE]
where is the reference voltage perturbation and is the perturbation amplitude; hence, the droop slopes in the -phase are kept fixed to the nominal value and the communication channel is established via the reference voltage perturbation signals. The -phase training matrices and can then be written as follows:
[TABLE]
where and are the reference voltage perturbation and perturbation amplitude matrices, defined as and , , respectively. To facilitate the design of the demodulator, we make the following small signal assumption: the reference voltage deviation amplitudes in the -phase are relatively small w.r.t. the nominal reference voltage, i.e., , , . Using Taylor’s series expansion, the signal collected by controller in the -phase can be written as:
[TABLE]
The model above defines the input-output relation of a real, linear, synchronous communication channel with channel vector given by the gradient (evaluated at the nominal droop values) which contains the real coefficients of the equivalent linear channels that controller sees to the other controllers; in localized and strongly connected MGs, the entries in do not differ significantly (see also [33]), i.e., the channel (20) experiences strong all-to-all property.
We use the linear model to design sequential transceiver that operates as follows. First, in sub-phase , controller estimates ; for this purpose, we fix the perturbation amplitudes to be all known and equal constants:
[TABLE]
Then, in sub-phase the controllers disseminate the information acquired in the -phase via the following linear amplitude modulation (without any additional error protection):
[TABLE]
where and are known positive constants. Clearly, remains fixed in block , carrying the information about by embedding it into the amplitude of the perturbation signal . The controllers operate in full duplex transmission mode, simultaneously broadcasting and receiving one voltage measurement per block to/from all other controllers.111The scheme suits well channels with strong all-to-all property, i.e., channels where the gains in do not differ significantly; this is the case for small and localized MGs. As the system grows in size and scope, the all-to-all property ceases to be valid and one should consider applying more sophisticated digital modulation/demodulation and scheduling schemes, including error protection coding; see [32, 33] for alternatives.
To guarantee the uniqueness of the local copies , we restrict the columns of the reference voltage perturbation matrices and to be zero mean and orthogonal:
[TABLE]
where , , for every , . We note that the above assumptions are a bit restrictive. Given the perturbation signals (21) and (22) in sub-phases and , the sufficient conditions for uniqueness of for any are for any ; however, we use (23), (24) for convenience, namely, to obtain compact expression for without loosing generality. Replacing (21) and (22) in (20) and using assumptions (23), (24), we derive :
Proposition 2**.**
The local estimators of are given by:
[TABLE]
for any ; for notational brevity, we used , , and .
Proof.
See Appendix C. ∎
By the end of the training epoch, the th controller has a local copy of the -phase measurement matrix ; if the sufficient excitation conditions (15), (16) hold, then is sufficient to estimate . Formulating an ML estimation problem using requires knowledge of the pdf ; however, obtaining the closed from expression is tedious since (25) involves ratios of non-zero Gaussian random variables. Therefore, we derive Gaussian approximation for the pdf of based on first-order perturbation-theoretic approach (see supplementary material). Using Neumann series expansion, we get:
[TABLE]
The covariance matrix can be computed via the first-order approximation (see Appendix D) and is given by:
[TABLE]
The approximation is valid for sub-phase signals satisfying . In practice, this is expected to be satisfied as the probability that is negative or larger than is negligible. In light of this, one can easily verify that the Gaussian approximation converges to the true distribution of in the limit . Expression (27) also captures the effect of the -phase and the transmission schemes we adopted there on the uncertainty in the local copies ; specifically, the initial uncertainty in , represented with the first term in (27), increases due to (1) measurement noise in sub-phase (second term) and, (2) the uncertainty in the channel estimates induced in sub-phase (third term).
V-D Joint System Identification and State Estimation
By the end of the training epoch, the th controller has and the -phase measurement vectors and . The reference voltage training matrix is deterministic, so can still be useful when formulating the estimation problem. On the other hand, the training matrix in sub-phase is modulated with -phase measurements; since controller knows only the noisy copy , it is impossible to reconstruct perfectly which makes of no further use. The optimal ML that uses all available information should be defined over an augmented vector, comprising and . Including increases the dimensionality of the problem, but the numerical investigations indicate that it does not yield any practically significant performance gain. We therefore omit from the ML for clarity of exposition.
The relation between the steady state bus voltages and the parameter vector is defined implicitly in Proposition 1; therefore, we define a joint system identification and state estimation (J-SISE) problem via constrained ML estimation [43, 45]. We introduce the joint parameter/state vector:
[TABLE]
We define as the globally optimal solution to:
[TABLE]
formulated w.r.t. the true distribution of . The problem (29) is neither convex nor concave due to the quadratic nature of the constrains that contain bilinear terms in the decision variables. Since is sufficiently differentiable in , the constrained optimization problem (29) can be restated as an unconstrained one using the Lagrange method of multipliers [46]. Using the Gaussian approximation (26) and applying the Karush-Kuhn-Tucker (KKT) conditions yields a non-linear system of equations. Using the result of the following proposition, we propose Algorithm 1 based on partially linearized constraints to solve the system iteratively [45]. Specifically, denote in the -th iteration and let:
[TABLE]
be the linear approximation of around . The Jacobians , are evaluated in . We obtain the following result:
Proposition 3**.**
If the sufficient excitation conditions (15), (16) are satisfied in , the global solution to (29) after substituting the power balance constraint with (30) is given by:
[TABLE]
Proof.
See Appendix E. ∎
The algorithm starts with an initial guess . Then, we apply Proposition 3 iteratively; the solutions (31), (32) in each iteration serve as an input for the next iteration until convergence. In order to apply Algorithm 1, controller should know the covariance matrix up to a scaling factor, i.e., knowledge of the noise variance is not necessary. To ensure fast convergence, we propose the following initialization: once is locally available, a reasonable initial estimate of the state can be obtained via eq. (13):
[TABLE]
Then, we evaluate in and solve (17) for :
[TABLE]
where is the -th column of . It can be easily verified that satisfies the KKT conditions and is a stationary point of the objective in (29). Section VII shows that (34) is unbiased but not efficient estimator of . In this regard, Algorithm 1 serves to refine the initial estimate and further reduce its covariance.
V-E Performance
The Mean Squared Error matrix of the unbiased estimator of is defined as:
[TABLE]
and are defined analogously. In stead of deriving the MSE matrix directly, we use the CRLB inequality to bound it and derive an approximate lower bound using the Gaussian approximation (26). Referring to the optimization problem (29), a straightforward way to bound is to use the constrained CRLB [47]. Let denote the matrix whose columns form the orthonormal basis for the null space of the Jacobian . Then, can be bounded as follows [47]:
[TABLE]
where denote all-zero matrices of adequate dimensions. is is computed numerically, as it is unavailable in closed form. To bound the MSE matrices of and separately, we need to perform numerical block inversion of the right-hand side of (36); the following proposition gives alternative and simpler closed form expressions for these bounds:
Proposition 4**.**
The MSE matrices and can be bounded from below as follows:
[TABLE]
where denotes the Fisher Information Matrix of .
Proof.
See Appendix F. ∎
The expressions (37) and (38) can be verified to be asymptotically tight; it can be shown that if Algorithm 1 converges to the global optimum, the MSE matrix is of the same analytical form as (37) and (38), but evaluated at . Conversely, expressions (37) and (38) prove the asymptotic efficiency of Algorithm 1.
V-F Discussion
We take a closer look on few crucial aspects that set the applicability boundaries of the proposed method. We consider the sufficient excitation conditions, outlined in subsection V-B; they provide guidelines for designing the training sequences and they determine the overall duration of the training epoch. A straightforward way to guarantee (15), (16) is to ensure that and and/or . The minimal duration of the -phase is determined by the conditions for uniqueness of , such that the total duration of the training epoch (in slots) in a system with DERs is lower bounded as:
[TABLE]
The lower bound on can be attained by random training sequences. Alternatively, when using deterministic codes such as orthogonal Walsh-Hadamard sequences, meeting the rank conditions and, possibly additional conditions such as (23), (24) might require more time slots than .
The frequency of the training epoch should match the requirements of the upper layer application. If the application runs periodically, then the training epoch should be invoked in each period, preferably at the beginning, while in event-triggered applications, the training epoch should be invoked whenever the application is triggered. While the frequencies should be equal, the total duration (in seconds) is expected to constitute only a fraction of the average time between two consecutive application runs. Then, we have the following upper bound on the slot duration:
[TABLE]
where is obtained by fixed and .
Further, since the proposed method is developed in classical estimation framework, each controller requires perfect knowledge of the training matrices , , and . This means that the training matrices should be designed a priori, delivered to the controllers and kept fixed afterward (via hard-coding for instance). Relaxing this condition requires adequate modifications of the problem formulation, which is out of the paper’s scope. For instance, if no prior knowledge is available, we have no choice but to model the training matrices are deterministic unknowns and modify the definition of the parameter vector to include them.
The method can only identify buses that host at least one DER whose primary controller engages in decentralized training. In other words, buses that host only loads are unidentifiable. However, we can still apply the method in MGs with (potentially many) load buses; in this case, the method identifies the Kron-reduced conductance matrix, which is obtained by isolating the DER buses in the original network and applying block inversion on the original conductance matrix. Analyzing the structure of the Kron-reduced conductance matrix, the controllers might be able to deduce some information on the original conductance matrix, see [48].
VI Decentralized OED via Training
We illustrate the practical potential of the proposed system identification method by applying it in decentralized OED (DOED) as the most common upper layer application in power systems. In OED, each DER is assigned a monotonic and convex cost function that determines the cost of the output power of DER . The aim of the OED is to find the optimal local output powers, referred to as optimal dispatch policies that minimize the total cost such that the total load demand is balanced and the box constraints on the output powers are satisfied:
[TABLE]
where , and . Distributed MGs with small-scale DERs typically use linear cost functions [17]. Hence, we adopt where is the constant marginal cost of the th DER per unit of injected/stored power. Without loss of generality, the costs are ordered as , which divides the DERs in several ordered cost groups based on the marginal costs. The optimal solution to (41) is the following decentralized program:
[TABLE]
for any (see also [17, 35]). Specifically, the total load demand is first filled with the capacities of the DERs from the cheapest cost groups, until the third condition in (45) is met. Then, the DERs from the cost group that meets this condition share the remaining net load demand proportionally to their local capacities while the DERs from the remaining, most expensive cost groups do not inject power. The DERs that satisfy the first condition in (45) are operated at a constant power (at capacity) and their local controllers are configured in CSC mode (forming the subset ), whereas the DERs that satisfy the third condition have flexible power outputs and their local controllers are configured in VSC mode, tuned for proportional power sharing (forming ).
Knowing , specifically and , is sufficient for implementing the decentralized program (45). We design a OED protocol in which the controllers utilize decentralized training and Algorithm 1 to acquire the information necessary to execute (45). Fig. 8 illustrates the temporal organization of the protocol. The OED typically runs periodically, every minutes depending on the average rate of change of and/or [3, 17]. Therefore, we (i) divide the time axis into periodic OED epochs, each of of duration , and (ii) assume that changes independently at the beginning of and OED epoch and remains fixed throughout the epoch [17]. In each epoch, the DERs locally run the program (45) using up-to-date information about the generation capacities and load demands. To obtain this information, a fraction of the total duration of the OED epoch is allocated for decentralized training, see Fig. 8. The OED epoch is split into a training epoch of duration and an optimal operation epoch of total duration . In the training epoch, the DER controllers perform decentralized training and estimation as described in Sections IV and V. At the end of the training epoch, controller obtains , used at the beginning of the optimal operation epoch to determine the local dispatch policy , i.e., to determine which condition in (45) is satisfied locally. Hence, each DER individually decides its primary control configuration via (45) using and configures the local controller accordingly, forming the subsets and . We use to denote that (45) is solved using .
Implicit in the derivation of the decentralized program (45) is the assumption that the MG is balanced . However, the stochastic renewable generation might sometimes violate the balance condition. Moreover, due to estimation errors in , the resulting dispatch policies will in general differ from , attainable only when is known perfectly; hence, in general. This leads to slightly suboptimal MG operation, but it might also violate the balance condition even when satisfy it. This results in loss of voltage regulation as the bus voltage quickly (i) drops towards the lower margin when the net load demand is positive or (ii) rises towards the upper margin when the net-load demand is negative . Clearly, additional generation/storage capacity is necessary to balance the remaining demand. We employ a solution based on classical DC bus signaling, where a backup source/storage is activated if the bus voltage crosses certain thresholds [12, 13]. The marginal costs of the backups are denoted with per unit generated/stored power; these values are always larger than the largest marginal cost among the DERs in , i.e., . In normal operating conditions, the MG is balanced, the backups are not active, and the bus voltage is regulated by the DERs in , using the droop control law (6) with parameters:
[TABLE]
dimensioned to maintain the bus voltages in a tight region around the rated voltage , i.e., in the interval with being a small positive number. If the bus voltage drops below , it signals power deficit and the backup source is activated and configured in droop-controlled VSC mode, using (6) with parameters set as:
[TABLE]
maintaining the bus voltages in . Conversely, if the voltage rises above , it signals power surplus and the storage is activated and also configured in droop-controlled VSC mode, using (6) with parameters set as:
[TABLE]
maintaining the bus voltages in . Fig. 9 summarizes the complete operational dynamics of the proposed system on a single diagram. Note that installing backup generation/storage is standard practice when dimensioning standalone systems [3, 12, 13]. In grid-connected systems, the grid can be used as backup, effectively acting as ideal voltage source with infinite generation/storage capacity [3].
VII Evaluation
VII-A General Simulation Description and Design Parameters
Table I summarizes the numerical values of the simulation parameters that remain fixed in all simulation studies; the values of the remaining parameters are provided in the captions of the respective plots. We consider a line, i.e., cut-ring distribution network topology, where all buses are connected to two other buses except for buses and that are connected to a single bus each. As it is a regular practice for any power system, the MG is dimensioned to operate over a range of load demands. For simplicity, we use for any (“” stands for either “a”, “c” or “p”); similarly, for any , see Table I.
The measurement noise variance after averaging samples per slot, see Fig. 5, can be computed as:
[TABLE]
where is the noise variance of the PECs’ ADCs [41].
The number of slots in the -phase for fixed and , see Table I, is determined from the total number of slots which is also fixed:
[TABLE]
The perturbation signals are set as (see also Fig. 10):
[TABLE]
The binary sequences are formed by tossing a fair coin for any . This is done a priori, i.e., binary Bernoulli sequences of length are generated, confirmed to satisfy (15), (16) and stored. The droop slope perturbation laws (52) ensure that the bus voltages will not drop below or rise above as long as , see Fig. 10. The reference voltage training sequences in sub-phase and block in sub-phase have fixed length of slots and are set as:
[TABLE]
Hence, . We also fix and , where are set to keep the reference voltage deviation amplitudes in the -phase relatively small, ensuring that the model (20) is valid for any .
The performance of J-SISE w.r.t. the MSE and the performance of DOED w.r.t. the cost, are determined by the configuration of the training epoch, which in turn is determined by variety of factors such as slot duration, number of slots, nominal droop control parameters, training matrices and deviation amplitudes. With all specifications listed above and in Table I, most of these factors are kept fixed in our evaluations and the design parameters of the training epoch are the slot duration and the reference voltage deviation amplitude . Next, we evaluate the performance of the J-SISE in terms of the design parameters and show how to find their optimal values w.r.t. DOED.
VII-B J-SISE Performance
First, we investigate the performance, the scalability and the convergence properties of Algorithm 1 w.r.t. from the perspective of controller and compare it against CRLB. We fix the generation capacities of all DERs to have equal values, i.e., and we do the same with the load components and the line conductances for all . We use the Relative Root Mean Squared Error (RRMSE) metric, derived from the MSE matrix as follows:
[TABLE]
To evaluate the MSE matrix, we use statistical average of individual MSE matrices, obtained for different realizations of the noise matrix . “” in the above definition stands for either the full vector or its constituent vectors, i.e., , or ; in either case, the RRMSE is interpreted as the standard deviation of the estimation error per component of the vector that is used as argument. Note that, when applied to a constituent vector of , we plug the diagonal block of the MSE matrix corresponding to that particular constituent vector. To compute the corresponding lower bound on the RRMSE, we use the CRLB matrix in (54) instead of the MSE matrix.
We focus particularly on the RRMSE as function of , since RRMSE decreases linearly with in the log-domain, see eq. (49)). Fig. 11 depicts the performance of J-SISE for each of the constituent vectors of , i.e., , and against the corresponding lower bounds, for DERs. We have evaluated the lower bounds using both, the constrained CRLB (36) and expression (37) from Proposition 4, and they both yield numerically identical results. Empty markers correspond to the initial estimate that initializes Algorithm 1, obtained via (34), while filled markers correspond to after Algorithm 1 converges. As expected, J-SISE is efficient and attains the CRLB as increases, except for values very close to ; here, the RRMSE hits a turning point, after which it increases sharply as a result of the fact that when , the droop slope grows arbitrarily large and the virtual resistance . Hence, the controller starts to behave as an ideal voltage source with infinite capacity, pushing the bus voltages to a fixed value and making the MG insusceptible to reference voltage perturbations.
We further observe that the generation capacities, Fig. 11LABEL:sub@results1a, and the line conductances, Fig. 11LABEL:sub@results1c, can be identified with very high precision (less than of the true value). In contrast, the RRMSE of the load demands of individual components, Fig. 11LABEL:sub@results1b, is several orders of magnitude higher. We conclude that, identifying the individual components of the loads with satisfactory performance might require excessive (even prohibitive) training epoch durations to suppress the noise. However, in many upper layer applications, detailed knowledge on the individual load component demands is not necessary and knowing only the total bus demand is sufficient [17, 35]; in such case, an estimate of the total load demand vector , comprising the total demands at each bus, can be obtained from via . Fig. 11LABEL:sub@results1b shows that can be identified with a precision comparable to the one achieved for the generation capacities and line conductances.
The improvement of w.r.t. given with (34), is also evident, clearly showing that the initial estimate is not efficient. The numerical results (not shown here due to space limitations) show that the average of converges to zero vector asymptotically. We conclude that the initial estimate is indeed unbiased estimator of and can be still used in practice even though it is not efficient, particularly, when is of the same order as/smaller than or for small . In the first case, Algorithm 1 does not converge, see Fig. 11, and remains as the only reasonable choice. The second case can be more clearly observed in Fig. 12 that investigates the performance of the framework for increasing number of buses; we see that for small number of buses (e.g. ), the RRMSE of the initial estimate approaches the CRLB; in such case, the gain from applying Algorithm 1 is marginal, and is sufficient for all practical purposes.
From Fig. 12, we also observe that, the performance of J-SISE tends to deteriorate as the number of buses increases, which is expected due to the increase of . A straightforward way to improve the performance of Algorithm 1 and make the estimation error arbitrarily small for large , is to increase . However, note that (29) treats the vector as full vector, when in fact it may be sparse, containing many zero entries. This might prove to be problematic as the size of the MG scales, i.e., as the number of buses increases since larger distribution systems are significantly sparser [39], so estimating as if it is full vector might lead to performance degradation [49]. So, an appropriate way to improve the performance when is large (which is out of the scope of this work) is to modify (29) by adding sparsity constraint on and apply a common relaxation method [49].
Finally, we comment on the convergence speed of Algorithm 1; in all tested cases, that is for , Algorithm 1 converges already after iterations. This remarkable result can be mainly attributed to the fact that the initial estimates , , given with eq. (33), (34), respectively, form a stationary point of the optimization problem (29) (see subsection V-D). The additional fact that they are also (asymptotically) unbiased, implies that must lie in a neighborhood around , possibly being an inflection point from which it can easily converge to the global optimum only after several iterations.
VII-C Optimizing the Cost Trade-off in DOED
The results presented in the previous subsection do not consider (i) the effect that the estimation error has on the upper layer applications, and (ii) the effect that the power dissipation during training has on the overall performance of the MG. In other words, improving the performance of J-SISE, which is desirable from the perspective of the upper layer application, comes at the “price” of increased power dissipation during training, either by using large perturbation amplitudes or long slot durations, which in turn compromises the performance of the upper layer control application. This leads to a fundamental trade-off between the performance of J-SISE, which is determined by the configuration of the training epoch, and the performance of the application. Our goal is to (i) show how to characterize this trade-off via utility function that jointly captures the performances of J-SISE and the upper layer application, and (ii) provide guidelines on how to design optimal training epochs, namely, how to choose and such that the utility function is optimized.
As a case study, we take the DOED protocol, described in subsection VI, noting that the approach described below can be applied to any upper layer application. The performance of specific DOED policy vectors is assessed via the cost . The cost of the optimal policy is with denoting any extra cost entailed by activating backups in case the MG is unbalanced. is in fact the minimal cost, attainable only when and are perfectly known to each controller. However, when running the DOED protocol using the estimated parameter vector, see subsection VI, the cost of the resulting dispatch policy vector should also account for (i) the fact that , i.e., the DOED policy is, in general, suboptimal, (ii) the fact that is valid only in the optimal operation epoch within the OED epoch, and (iii) the power dissipation incurred in the training epoch. We denote this cost with and we write:
[TABLE]
where the matrix is defined as and is the output power of DER in slot . The first term corresponds to the cost of training, whereas the second gives the actual cost of . We define the Relative Cost Increase (RCI) , relative to the optimal cost :
[TABLE]
The RCI can be interpreted as a measure of the additional monetary charge that the the community served by the MG will be subjected to when operating autonomously using the proposed DOED protocol, without any access to external communication enabler. We observe that is a random variable whose pdf is parametrized w.r.t fixed . In practice, it is desirable to optimize the performance of the upper layer application over the range of , which the MG is foreseen to operate in. Therefore, we choose the average RCI, denoted by and computed as an average of over , to be the utility function for the DOED. The aim is to find the optimal training epoch configuration parameters, namely, and , that minimize the average RCI:
[TABLE]
Computing in closed form is far from trivial; therefore, we resort to Monte-Carlo simulation, run the DOED protocol for different values of and use the statistical average of the individual RCIs as an estimate of . In each trial, is generated independently from the uniform distribution, i.e., , , where , are given in Table I; note that we keep the line conductances fixed to as the topology changes very infrequently compared to the generation and the load.
Rewriting , where is the output power of DER corresponding to the nominal droop parameters, the time average of the power dissipation . We conclude that with (56) and linear OED cost function, it is difficult to asses the impact of power dissipation during training. Therefore, we introduce a quadratically-modified RCI (QRCI), denoted with :
[TABLE]
where the matrix is defined as and . In similar way as , we define the average QRCI, denoted with and restate the optimization problem (57) with as utility function.
The results are presented in Fig. 13. We observe that within the investigated domain, the average RCI, see Fig. 13a is a convex function of and . Specifically, for fixed , decrease as increases due to the effect of noise suppression, see (49). In this regime, the duration of the training epoch is still very short relative to , such that the first term in (55) is negligible and the RCI is dominated by the second term which decreases towards as the estimation error is reduced. However, hits a turning point when and, consequently, the duration of the training epoch become long enough such that the first term in (55) starts to dominate over the second; after this, it makes no sense to keep increasing as will also increase. Conversely, for fixed , decreases as increases until it hits the turning point after which it starts to increase quickly; evidently, this is happening when we get very close to . As discussed in the previous subsection, the performance of J-SISE starts to deteriorate when , pushing the second term in (55) away from its lower bound . Hence, within the domain of interest, the average RCI for an MG, specified in Table I and the caption of Fig. 13a, is minimized when volts and milliseconds. The minimized average RCI is ; in other words, the average increase of the cost is less than of the optimal cost . This increase, besides being completely tolerable by the OED [3], it is also comparable to the additional operating cost charges imposed by mobile operators when employing wireless cellular solution not including the cost of installing dedicated communication hardware [3, 17].
Similarly as the average RCI, the average QRCI, see Fig. 13b, is also a convex function of and within the investigated domain with behaviour governed by the same reasoning we used on the average RCI. However, the minimum this time moves closer to the down-left corner due to the second term in (58). Specifically, is minimized when volts and milliseconds with average RCI , i.e., still around of .
VIII Concluding Remarks
We introduced autonomous system identification solution, based on temporary primary control perturbations and iterative ML-based algorithm for DC MGs and without access to an external communication system. The method is implemented in a decentralized manner within the primary droop controllers of the PECs and enables the controllers to learn i) the generation capacities of power sources, ii) the load demands, and iii) distribution network topology using only local bus voltage measurements. The key enabling tool is the decentralized training where the primary controllers inject small, amplitude-modulated training sequences that complete the rank of the estimation problem and enable regaining full system observability. We evaluated the performance of the ML-based algorithm, showing that we can achieve high reliability in DC MGs of small to moderate size (). Then, we showcased the potential of the solution in fully decentralized OED where the controllers perform training periodically and reconfigure according to the locally estimated information. Last but not least, we illustrated an elaborate methodology for designing training epochs that optimize the operational cost of an autonomous DC MG.
Although we focused on DC MGs and we used several assumptions that simplified the developments, the same design principles introduced in this paper can be applied to any cyber-physical system with dual-layer control architecture that does not not have access to external communication resources, under broader circumstances. Such investigations are part of our on-going and future work.
Appendix A Proof of Proposition I
The power balance condition in each slot states that:
[TABLE]
Recall that during training all DERs are in droop-controlled VSC mode configured for proportional power sharing. Hence for any . In such case (59) can be rewritten as:
[TABLE]
Let be defined as ; we get:
[TABLE]
where , and , defined as , and , represent the -th rows of , and , respectively. Stacking vertically for each , we get the power balance matrix :
[TABLE]
yielding the compact form (12) which completes the derivation.
Appendix B is not identifiable when the system is not observable
We consider the following situation: controller knows only and knows and completely. In other words, the controllers do not exchange any local steady state voltage measurements as in the proposed solution, i.e., the -phase training matrices are completely deterministic and known. Hence, all other columns are not observable. Since the power balance equation concerning the observable voltages also includes and depends on (as a result of the fact that the buses are connected through ) and if classical, non-Bayesian framework is employed (without exploiting any prior knowledge), should be treated as unknown parameters in the same way as the generation capacities, load demands and line conductances. Therefore, the parameter vector should be redefined as:
[TABLE]
with
[TABLE]
The sufficient excitation conditions in this case should be restated in term of since only is observable; we get:
[TABLE]
where and are the Jacobians of w.r.t. and , respectively. It becomes immediately evident that is a fat matrix, i.e., with column rank at most ; hence, the first sufficient excitation condition is not satisfied and cannot be uniquely identified.
Equivalently, one can look at the same problem from the perspective of the constrained ML optimization. Namely, the joint parameter/state vector now is:
[TABLE]
The constrained ML optimization problem should be formulated over since only is observable:
[TABLE]
Clearly, the number of linearly independent equality constraints is at most , yielding an ill-conditioned optimization problem that does not converge to any meaningful solution.
Appendix C Proof of Proposition II
Controller derives the channel estimator using the measurement vector from sub-phase , i.e., . Replacing in the linear model
[TABLE]
we get:
[TABLE]
Using the above, is obtained by solving the linear least squares problem:
[TABLE]
Using , (69) can be rewritten as:
[TABLE]
where we used the commutative property of the product . Note that is the -th row of the -phase measurement matrix and contains the data transmitted by the controllers in block . Using the the channel estimate, controller obtains a local copy of , denoted with by solving the following linear least squares problem:
[TABLE]
Note that ; so we get:
[TABLE]
. Vectorizing the above, we obtain:
[TABLE]
which completes the derivation.
Appendix D Derivation of the Gaussian approximation of
Let where . Similarly, where for . Then, (25) can be written as:
[TABLE]
where follows from the Neumann expansion valid on the subset . From (85), we see that can be approximated with Gaussian random vector with mean:
[TABLE]
and covariance matrix:
[TABLE]
Appendix E Proof of Proposition III
The Lagrange method of multipliers casts the original constrained ML problem into an unconstrained as follows:
[TABLE]
where we used the Gaussian approximation for the pdf . is vector of multipliers. Applying the KKT conditions to (91) after replacing the power balance constraint with its first order approximation, we get the following system of equations:
[TABLE]
The above system is linear in and can be solved efficiently; the derivation of the solution follows similar steps as in (LABEL:31). Multiplying (92) with yields:
[TABLE]
which is substituted in (94) to yield:
[TABLE]
Solving for gives:
[TABLE]
Multiplying (97) with on both sides, gives:
[TABLE]
which, after replacing and solving for gives (31). Finally, replacing (97) in (92) and solving for produces (32), completing the proof.
Appendix F Proof of Proposition IV
Recall that the implicit function theorem governs the existence of an explicit solution of the system of power balance equations of the following form:
[TABLE]
Hence, again by the implicit function theorem, the solution of the -phase power balance equation exists and can be written in the following form:
[TABLE]
where the matrix is defined as . If is available in closed form, the -phase measurement matrix (i.e., its vectorization) can be written explicitly in terms of as:
[TABLE]
Using the above, we derive the CRLB. In particular, the MSE matrix of can be bounded from below as:
[TABLE]
where is the Fisher Information Matrix (FIM) defined as:
[TABLE]
Using the Gaussian approximation for the pdf of , the FIM can be approximated with the following Grammian:
[TABLE]
Applying the implicit function theorem, we obtain the following expression for the Jacobian :
[TABLE]
Substituting the above in (104) gives expression (37). To bound the MSE matrix of , we use (100), i.e., the fact that is a transformed version of and apply the corresponding CRLB formula, i.e.:
[TABLE]
completing the proof.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. H. Lasseter and P. Paigi, “Microgrid: a conceptual solution,” in 2004 IEEE 35th Annual Power Electronics Specialists Conference (IEEE Cat. No.04CH 37551) , vol. 6, June 2004, pp. 4285–4290 Vol.6.
- 2[2] L. E. Zubieta, “Are microgrids the future of energy?: Dc microgrids from concept to demonstration to deployment,” IEEE Electrification Magazine , vol. 4, no. 2, pp. 37–44, June 2016.
- 3[3] T. Dragicevic, X. Lu, J. C. Vasquez, and J. M. Guerrero, “Dc microgrids; part i: A review of control strategies and stabilization techniques,” IEEE Transactions on Power Electronics , vol. 31, no. 7, pp. 4876–4891, July 2016.
- 4[4] T. Dragičević, X. Lu, J. C. Vasquez, and J. M. Guerrero, “Dc microgrids; part ii: A review of power architectures, applications, and standardization issues,” IEEE Transactions on Power Electronics , vol. 31, no. 5, pp. 3528–3549, May 2016.
- 5[5] L. Strenge, H. Kirchhoff, G. L. Ndow, and F. Hellmann, “Stability of meshed dc microgrids using probabilistic analysis,” in 2017 IEEE Second International Conference on DC Microgrids (ICDCM) , June 2017, pp. 175–180.
- 6[6] C. Marnay, S. Lanzisera, M. Stadler, and J. Lai, “Building scale dc microgrids,” in 2012 IEEE Energytech , May 2012, pp. 1–5.
- 7[7] D. Zhang, J. Jiang, L. Y. Wang, and W. Zhang, “Robust and scalable management of power networks in dual-source trolleybus systems: A consensus control framework,” IEEE Transactions on Intelligent Transportation Systems , vol. 17, no. 4, pp. 1029–1038, April 2016.
- 8[8] M. A. Masrur, A. G. Skowronska, J. Hancock, S. W. Kolhoff, D. Z. Mc Grew, J. C. Vandiver, and J. Gatherer, “Military-based vehicle-to-grid and vehicle-to-vehicle microgrid; system architecture and implementation,” IEEE Transactions on Transportation Electrification , vol. 4, no. 1, pp. 157–171, March 2018.
