Abstractions of linear dynamic networks for input selection in local module identification
Harm H.M. Weerts, Jonas Linder, Martin Enqvist, Paul M.J. Van den, Hof

TL;DR
This paper presents a generalized method for abstracting linear dynamic networks, enabling effective input selection for local module identification by ensuring certain modules remain invariant under abstraction.
Contribution
It introduces a new abstraction method that generalizes existing algorithms and provides conditions for invariant modules, aiding targeted network identification.
Findings
The method generalizes previous algorithms like immersion and indirect inputs.
Conditions are derived for selected signals to keep specific modules invariant.
The approach facilitates accurate local module estimation through optimal input selection.
Abstract
In abstractions of linear dynamic networks, selected node signals are removed from the network, while keeping the remaining node signals invariant. The topology and link dynamics, or modules, of an abstracted network will generally be changed compared to the original network. Abstractions of dynamic networks can be used to select an appropriate set of node signals that are to be measured, on the basis of which a particular local module can be estimated. A method is introduced for network abstraction that generalizes previously introduced algorithms, as e.g. immersion and the method of indirect inputs. For this abstraction method it is shown under which conditions on the selected signals a particular module will remain invariant. This leads to sets of conditions on selected measured node variables that allow identification of the target module.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Abstractions of linear dynamic networks for input selection in local module
identification
Harm H.M. Weerts
Jonas Linder
Martin Enqvist
Paul M.J. Van den Hof
Control Systems Group, Department of Electrical Engineering, Eindhoven University of Technology, The Netherlands (email: [email protected], [email protected])
ABB Corporate Research, Västerås, Sweden (e-mail: [email protected])
Division of Automatic Control, Linköping University, Sweden (e-mail: [email protected])
Abstract
In abstractions of linear dynamic networks, selected node signals are removed from the network, while keeping the remaining node signals invariant. The topology and link dynamics, or modules, of an abstracted network will generally be changed compared to the original network. Abstractions of dynamic networks can be used to select an appropriate set of node signals that are to be measured, on the basis of which a particular local module can be estimated. A method is introduced for network abstraction that generalizes previously introduced algorithms, as e.g. immersion and the method of indirect inputs. For this abstraction method it is shown under which conditions on the selected signals a particular module will remain invariant. This leads to sets of conditions on selected measured node variables that allow identification of the target module.
keywords:
Dynamic networks, system identification, closed-loop identification, graph theory.
††thanks: Paper accepted for publication in Automatica. Submitted 2 January 2019, revised 12 September 2019, final version 16 March 2020. This project has received funding from the European Research Council (ERC), Advanced Research Grant SYSDYNET, under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 694504), and the Vinnova Industry Excellence Center LINK-SIC project number 2007-02224.
, , and
1 Introduction
In current and future engineering systems, as well as in many biological and biomedical systems, large scale interconnected dynamical systems become a prime modelling target for analysis and control. The resulting dynamic networks that represent these interconnected systems, are studied from many different perspectives. In this paper our main motivation is directed towards methods and tools for data-driven modelling of (parts of) a dynamic network, extending identification methods to be able to deal with linear dynamic networks.
In network identification literature, typically three main objectives can be distinguished. One objective is to perform identification of both the topology and dynamics, see for example Sanandaji et al. (2011); Chiuso and Pillonetto (2012); Materassi and Salapaka (2012); Zorzi and Chiuso (2017). In situations where there is prior knowledge of the topology, the objective can be to identify all dynamic modules, as is done for example in Gonçalves and Warnick (2008); Yuan et al. (2011); Weerts et al. (2018c), including addressing the aspect of network identifiability, (Weerts et al., 2018b; Hendrickx et al., 2019). A third possible objective is the identification of the dynamics of a single local element, or module, in the network, when the topology of the network is known. This has been addressed in Van den Hof et al. (2013), by extending classical closed-loop prediction error methods, known as the direct method, two-stage method and joint-io method, to the situation of dynamic networks. The reasoning has been extended in Dankers et al. (2015) to deal with signal measurements that are affected by sensor noise. Additional extensions concern identifiability aspects of local modules (Bazanella et al., 2017; Hendrickx et al., 2019; Gevers et al., 2018; Weerts et al., 2018a), and combination of the prediction error method with Bayesian estimation in Everitt et al. (2018); Ramaswamy et al. (2018).
In this paper we will elaborate on this local module identification problem, and focus on the selection of signals to be measured in the network, to allow a particular module to be identified. While in general it may be attractive to have all node signals in a network available from measurements, in practice it may be costly or even impossible to measure some of the nodes. Then, a relevant question is which set of measured node signals is sufficient for single module identification. This problem has been addressed in several different ways.
- •
In Dankers et al. (2016) the module of interest is identified in a multi-input-single-output setting with either a direct method or a two-stage method. Signals are selected by removing non-selected signals from the network through an elimination procedure called immersion, while requiring that the target module remains invariant. This allows a consistent identification of the target module dynamics under appropriate excitation and disturbance conditions. A systematic way of selecting the measured node signals is provided. The selection method is extended in Dankers et al. (2017) to handle situations of confounding variables being caused by disturbances that are spatially correlated over the different node signals. This is particularly of interest when applying the direct method, aiming at maximum likelihood results.
- •
The approach in Linder (2017); Linder and Enqvist (2017b, a) is also in a multi-input-single-output setting, but with an instrumental variable identification method. It follows the same philosophy as the one in Dankers et al. (2016), but applies a different elimination procedure, referred to as the indirect inputs method, and thus results in a different set of conditions on selected measured node signals.
- •
In Materassi and Salapaka (2015, 2019), selection of node signals is being done on the basis of graphical models, while Wiener filters are being used for dynamic model reconstruction, and no reference excitations are present. This approach can be characterized as an indirect approach where several dynamic objects are identified from data, after which the target module is reconstructed.
- •
The approach in Bazanella et al. (2017); Hendrickx et al. (2019) is also based on indirect identification for the situation that all nodes are excited by external excitations. Here an identifiability analysis is made in order to verify whether the module of interest can be recovered uniquely from a set of estimated transfer functions from reference signals to a set of selected node signals.
Since all algorithms provide only sufficient conditions for arriving at a set of to-be-measured node signals, it is attractive to obtain results that fully characterize the degrees of freedom that are available in selecting node signals that allow appropriate module estimates.
In this paper we are going to adopt the strategy of the first two approaches, eliminating node variables to arrive at a so-called abstracted network, while keeping the dynamics of the target module invariant. After abstraction, the target module can be estimated on the basis of the node variables that are retained in the abstraction. While both Dankers et al. (2016) and Linder and Enqvist (2017b) have employed a particular way of abstracting networks, e.g. through network immersion (Kron reduction) or through the so-called indirect inputs method, here we develop a more general notion of abstraction that generalizes the two earlier approaches, and provides a higher flexibility in the selection mechanism of choosing which nodes to measure/retain and which nodes to discard/abstract in a multi-input single-output identification setup for estimating the target module.
In order to develop this generalized abstraction algorithm, we use the fundamental property that the network representation is not a unique representation of the behavior of the network. When manipulating the network equations, different representations can be obtained leading to different identification setups for estimating the target module. This freedom will be exploited for developing generalized conditions for selecting the node signals that are used as inputs in an identification setup for estimating the target module.
The paper is organized as follows. After defining the basis network setup in Section 2, in Section 3 the non-uniqueness of the network representations is characterized. This is exploited in Section 4 to arrive at a generalized abstraction algorithm, which is illustrated by examples in Section 5. In Section 6 it is specified under which conditions the target module remains invariant, and consequences for the identification setup and the selection of node signals are presented in Section 7. The proofs of all results are collected in the Appendix.
2 Dynamic network definition
In this section a dynamic network model is formulated on the basis of the setup in Van den Hof et al. (2013). A dynamic network consists of scalar internal variables or nodes , , and external variables , . Each node is a basic building block of the network and is described as:
[TABLE]
where is the delay operator, i.e. ;
- •
are proper rational transfer functions that are referred to as modules in the network;
- •
There are no self-loops in the network, i.e. nodes are not directly connected to themselves ;111Since are rational transfer functions, this does not limit the dynamical description of .
- •
are generated by the external variables via
[TABLE]
where can directly be manipulated by the user, and are proper rational transfer functions;
- •
is process noise, where the vector process is modeled as a stationary stochastic process with rational spectral density, such that there exists an -dimensional white noise process , with covariance matrix such that
[TABLE]
with a proper rational transfer function.
By combining the node signals, the network expression
[TABLE]
is obtained, where the zeros are due to absence of self-loops. In matrix notation the dynamic network is represented as
[TABLE]
A set notation is introduced for notational convenience. Let the sets and each contain a number of node indices, then denotes the vector of node signals consisting of all . Similarly, is the matrix of transfer functions that contains all modules .
The transfer function that maps the external signals and into the node signals is denoted by:
[TABLE]
with
[TABLE]
This is also known as the open-loop response of the network corresponding with
[TABLE]
where noise component is defined by
[TABLE]
with power spectral density
[TABLE]
Some notions from graph theory will be used in the dynamic network. Modules form the interconnections / links between nodes. A node is said to be an in-neighbor of node if , and then is said to be an out-neighbor of node . A path in a network is a sequence of interconnected nodes, more precisely there exists a path through nodes if
[TABLE]
A loop is a path where .
A dynamic network model is then formally defined in the following way.
Definition 1** (dynamic network model).**
A network model of a network with nodes, and external excitation signals, with a noise process of rank is defined by the quadruple:
[TABLE]
with
- •
, diagonal entries 0;
- •
;
- •
, monic, proper, stable, with a stable inverse;
- •
, ;
- •
The network is well-posed (Dankers, 2014), implying that exists. Additionally we require that (I-G)^{-1}{\color[rgb]{0,0,0}R} is proper and stable.
Remark 2**.**
In Definition 1 we do not require modules to be proper, while in (1) we do require properness. Real systems in practice are reflected by (1), and these will typically have proper modules. In this paper we will use the freedom to allow network models to have non-proper modules. For this reason we define a representation of these networks that allows modules to be non-proper in Definition 1. Note that for a network model representation as in Definition 1, we require to be proper and stable, while the noise transfer is allowed to be non-proper, representing a non-causal mapping. This allows us to maintain a monic, proper, stable and stably invertible noise filter , which is attractive from an identification perspective.
Remark 3**.**
With the analysis provided in Weerts et al. (2018c), it is possible to extend the results in the current paper to the situation of a noise process having rank smaller than , implying that , and .
3 Equivalent network representations
The freedom that is present in dynamic network representations allows for different selections of node signals to be used for identification of a module. This freedom is formally characterized in this section. Moreover, the general concept of removing a node from a network is defined as abstraction, such that in later sections we can consider abstractions that are relevant for identification.
3.1 Transformation of the global network
Fundamentally, we need to define when two networks are equivalent descriptions of behavior, and what freedom is available to transform the network to an equivalent representation. In the network model definition, it has been stated that the external variables and nodes are known, and it is reasonable to state that equivalent networks must describe the same relation between and . The dynamic influence of on is described by the open-loop transfer function matrix , and so the equivalence of two networks additionally requires equality of the two related open-loop transfer function matrices from to . The open-loop response of the network is described by (8), i.e. . If , and are the same for two networks, then also must be the same.
Definition 4**.**
Let the network model correspond to open-loop transfer and noise spectrum for . Network models and are said to be equivalent if
[TABLE]
**
In the above definition, and are associated with and for . There is an implicit assumption in the definition that the and are the same for both .
The full freedom that is available for transformation of a network model to an equivalent network model is characterized by operations applied to the network equation. These transformations can be represented by pre-multiplying with a matrix. Consider a square rational transfer function matrix , then pre-multiplication of network equation (4) results in
[TABLE]
The above pre-multiplication typically leads to a left-hand side unequal to , in which case we need to move terms to the right-hand side until we have on the left-hand side, i.e.
[TABLE]
which is denoted as
[TABLE]
where
[TABLE]
The transfer function matrix is then transformed as
[TABLE]
A transformation of the noise model is defined in the following way. When we describe the noise model as , a pre-multiplication with does not necessarily lead to a proper, monic, stable and stably invertible filter . For that reason and are obtained through spectral factorization of the transformed noise spectrum
[TABLE]
A transformation that leads to an appropriate network representation must satisfy some conditions.
Proposition 5**.**
Let network model satisfy Definition 1. The transformation operating on as defined in (15), (16), and (17) leads to a network model that satisfies Definition 1 and that is equivalent to if and only if:
* is full rank, and* 2. 2.
diag.
Proof: Collected in the appendix.
An interesting feature of the network transformations is that the response from external variables and process noises to internal variables remains the same. A pre-multiplication as defined above leaves the transfer function matrix invariant, since
[TABLE]
where the identity is used.
A pre-multiplication that leads to a non-hollow can be used to transform a network. However, in that situation additional manipulations would be necessary to arrive at a hollow representation. Without loss of generality we will restrict to transformations that immediately result in a network representation that satisfies (1). There are some restrictions on , but a large freedom in the choice of transformation is left.
Proposition 6**.**
The equivalence transformation presented in (15), (16), and (17) can transform a network model with corresponding modules into a network model with corresponding modules using the transformation
[TABLE]
Proof: Collected in the appendix.
The proposition allows for to be transformed into an arbitrary as long as it is part of a valid network description. The consequence of transforming to an arbitrary is that the corresponding will have a complex structure
[TABLE]
The implication is that when is transformed, will compensate the changes to keep the node behavior invariant. This also holds for the noise model, which will contain additional correlations. Without any further restrictions on the choice of and , the modules represented in contain no information on the dynamic network. It is the combination of that determines the dynamic network.
3.2 Abstraction
The next step is to extend network equivalence with the option to remove nodes from the representation. To this end the concept of network abstraction is defined next. This definition is related to the notions of abstraction in Pappas and Sastry ; Woodbury et al. (2017).
Definition 7**.**
Let network model be associated with nodes , external variables , open-loop transfer , and noise spectrum . Let network model be associated with nodes , external variables , open-loop transfer , and noise spectrum . Let and let be the matrix that selects from , so define with one 1 per row, zeros everywhere else, full row rank, and such that . Network model is said to be an abstraction of if
[TABLE]
The nodes that are in , but not in are said to be abstracted from the network.
Constructing an abstraction of a network implies that some nodes are removed from the network representation, while the remaining nodes stay invariant, in the sense that for the same external signal , the second order statistical properties of the remaining node signals are invariant.
The next step is to determine how to obtain an abstraction of a network. In certain cases, abstracting nodes from a network can be done by simply pre-multiplying the network equation (4) with the selection matrix , i.e.
[TABLE]
However, this only is an abstraction if the abstracted nodes no longer appear on the right-hand side of the equation. If appears on the right-hand side of (22) then the abstracted nodes have an influence on the behavior of the nodes in , such that (21) cannot hold. It has to be determined how to define a transformation such that an abstraction can be obtained by selecting rows from the equation, as in (22).
A node influences other nodes through its out-neighbors, and these corresponding modules are located in a column in . If a node has no influence on the rest of the network, then it has no out-neighbors, and the corresponding column is 0. Abstracting node requires us to transform the network such that a 0-column is formed by transformation, after which the node can be removed. By Proposition 6 we know that such a transformation always exists. The abstraction satisfies the relations
[TABLE]
A noise model constructed as is a non-square matrix, which is difficult to handle in an identification setting. Therefore the transformed noise model , will be obtained through spectral factorization
[TABLE]
3.3 Discussion on Identifiability
In Proposition 6 we have seen that can be transformed into an arbitrary as long as it is part of a valid network description. This may give rise to the question whether we are not dealing with an unnecessarily overparameterized situation, involving , and to describe a dynamic network. However we particularly include the situation that measured external excitation signals enter into physical subsystems of a network, and thus our modules in can have an intrinsic interpretation in the physical world. The price of abstracting nodes in a network is that and can become complex, which may be impossible to identify. In the identifiability analysis provided in Weerts et al. (2018b) the following necessary condition for network identifiability can be found: For a network model set to be network identifiable it is necessary that at least entries on each row of are fixed and non-parameterized. If only the topology is known, then the absent links in the network reflected by zeros in the matrices are the only known entries. In case and become too complex, then the number of zeros is too small for the model to be embedded in a network identifiable model set. This implies that it is impossible to find a unique estimate for the model structure on the basis of the data. The approach in the next section is to define a particular abstraction method, that leads to abstracted networks that can be embedded in network identifiable model sets.
4 Abstraction of networks
We will now formulate and analyze an abstraction algorithm for dynamic networks that generalizes the procedure of immersion Dankers et al. (2016) and the indirect inputs method Linder and Enqvist (2017a). It starts by dividing the network nodes into a set of nodes that are retained after abstraction and a set of nodes that will be removed. The abstracted network will then allow us to analyze the properties of estimated models when the retained/measured node signals are employed in an identification procedure.
4.1 Generalized algorithm
It appears that the action of abstracting nodes in a network is not unique. This is particularly due to the degrees of freedom that exist in transforming network representations to equivalent forms, by premultiplying the system’s equations by appropriate transformation matrices. In order to incorporate this freedom in the abstraction, we decompose each set of node signals and into two disjunct parts:
[TABLE]
The node signals will be abstracted directly by substituting the equation for into the equations for the other node signals. The node signals have the property that they can be indirectly observed by the node signals , and therefore they can be eliminated from the network by utilizing the equation for nodes . The notion of indirect observation will be specified after the next step.
Based on these sets, the network can be represented by
[TABLE]
where and are defined according to and .
The node signals and are chosen in such a way that the signals serve as indirect observations of the node signals , meaning that the signals contain sufficient information of the indirectly observed signals , so as to replace them in an elimination procedure. This is formulated in the following definition.
Definition 8** (indirect observations).**
The node signals serve as indirect observations of the node signals if the transfer function has full column rank.
The property is satisfied if in the network there exists a sufficient number of paths from nodes to nodes that run through nonmeasured nodes only. An illustration of indirect observations is provided in Section 4.3. In the remainder of this paper it will be assumed that and are selected to satisfy the full column rank property of the above definition. Note that this assumption is not restrictive because and are chosen by the user such that the assumption is satisfied. We can always choose and as empty sets, so the assumption does not prevent us from constructing an abstracted network. However, the use of non-empty sets and will allow us to use more degrees of freedom in constructing abstracted networks.
Now we can formulate the generalized abstraction algorithm, as follows.
Algorithm 1**.**
Consider a network representation as in (27). Then the following algorithm leads to a network in which nodes are abstracted and nodes are retained:
- a
Solve the fourth equation of (27) for , and then substitute the result into the other equations, and remove the fourth equation from the network. 2. b
Solve the second equation of (27) for , and substitute the result into the first equation. 3. c
Solve the third equation of (27) for , and substitute the result into the second equation, and remove the third equation from the network. 4. d
Remove possible self-loops in the resulting network representation by shifting self-loop terms to the left hand side of the equations and scaling the equations such that an identity matrix remains at the left hand side.
The algorithm shows that the essential difference between the nodes in and is how is removed from their equation. After application of the algorithm, the abstracted network will be represented by
[TABLE]
Note that the particular type and ordering of variable substitution in Algorithm 1 essentially influences the result. This will be illustrated with examples in Section 5.
4.2 Specification through transformations
Algorithm 1 can be specified by denoting the algebraic manipulations that generate the substitution and elimination operations in the different steps of the algorithm. We will first describe the substitution operations on the set of equations, and at the end of the procedure address the removal of equations.
In step (a) the elimination of is performed, which corresponds to applying the transformation matrix
[TABLE]
to the network representation (27). This leads to a new -matrix given by
[TABLE]
where is partitioned as defined in (27), and is partitioned in the same way.
In steps (b)-(c), we first obtain a new expression for by reverting the expression for , and we substitute the original expression for into the expressions for . This corresponds to applying the transformation matrix
[TABLE]
with , and denoting its left-inverse, and , leading to
[TABLE]
where has the same partitioning as . Note that the left-inverse exists due to the indirect observations property of , as formulated in Definition 8.
The remaining part of steps (b)-(c) is now to substitute the new expression for in the first equation for , thereby eliminating the dependency of this expression on . This is achieved by applying the transformation matrix
[TABLE]
such that
[TABLE]
where has the same partitioning as . The additional term that is added in the fourth row of ensures that in the transformed network all columns that correspond to are zero in , including for the equations that will be removed.
Step (d) of the Algorithm is addressed by removing self-loops in the resulting network representation, by applying a diagonal transformation matrix with diagonal elements
[TABLE]
and being [math] elsewhere.
The total transformation that is applied to the network representation is now given by
[TABLE]
which leads to a -matrix of the transformed network representation that is structured according to
[TABLE]
The abstracted network now results by selecting the first two block rows and columns in the matrix , thereby removing the equations for the unmeasured/abstracted node variables and .
Proposition 9**.**
When applying the abstraction procedure of Algorithm 1 to a dynamic network given by (27), the obtained abstracted network is the same as the abstracted network given by (28) with
[TABLE]
Proof: Collected in the appendix.
4.3 Interpretations and discussion
Compared to selecting a set of nodes , the particular choice of the sets of nodes and creates additional degrees of freedom in the problem of constructing an abstracted network, in which the nodes (including ) are removed. The mechanism that is used is that the network equation for the node signals is inverted to become an equation that describes the node signals . This equation is then subsequently used to substitute and eliminate the signals from the abstracted network. In order to be able to use the network equation for in this way, it needs to capture full information on the node signals . This is reflected in the property of indirect observations, and the required full column rank property of , as formulated in Definition 8. This full rank property implies that . It is generically satisfied if there are dim vertex-disjoint paths present from to that run through nonmeasured/abstracted nodes only (van der Woude, 1991; Hendrickx et al., 2019). An example of the full rank assumption is shown in Figure 1. In the figure there are the two vertex-disjoint paths and for two nodes that are indirectly observed. In this case actually any selection of two nodes from would be sufficient to act as indirect observations of .
The network abstraction introduced here, generalizes two earlier introduced abstraction algorithms. For the particular choice, and , Algorithm 1 describes the method of network immersion, as introduced in Dankers et al. (2016), and developed for the situation . In that case steps (2)-(3) of the algorithm become obsolete.
If is restricted to consist of nodes that are out-neighbors of , and does not contain in-neighbors , and has full column rank, then Algorithm 1 describes the indirect inputs method as defined in Linder and Enqvist (2017b), which has been developed for the situation . If , then the indirect inputs method is equivalent to the immersion method.
Because of well-posedness of the original network, all terms in the transformation matrices are proper, except possibly for the term which may be non-proper. This implies that the introduction of the sets and may lead to a final abstracted network representation that is non-proper. Properness of the resulting network representation is guaranteed if .
4.4 Identifiability analysis
As discussed in Section 3.3, it is impossible to formulate an identifiable model set for network representations of high complexity. The underlying objective of the particular abstraction algorithm is to limit the complexity of the abstracted network. An evaluation of the structure of network representations obtained with the abstraction algorithm is made. In this way we can guarantee that an identifiable model set can be defined for the abstracted network.
A sufficient condition for network identifiability that can easily be verified is that every node has an independent external excitation. This is achieved when the columns of in
[TABLE]
can be permuted to arrive at a matrix with a leading diagonal (Weerts et al., 2018b). In order to verify whether this can be achieved we need to evaluate the structure of . The abstracted network generated by Algorithm 1 corresponds to the transformation , such that
[TABLE]
Since and are formulated in terms of the original network, we can formulate conditions for network identifiability based on the original topology. This is formally done in the next proposition.
Proposition 10**.**
Consider the abstracted network (28) obtained by abstracting the original network (4) with Algorithm 1 by using the sets of nodes . The representation of the external excitations is . The matrix can be given a leading diagonal by column operations if the original network is such that
* is diagonal, and is not an in-neighbor of nodes other than ,* 2. 2.
* is diagonal, and is not an in-neighbor of nodes other than , and* 3. 3.
* is diagonal, , , and .*
Proof: Collected in the appendix.
The proposition implies that abstracted networks that are obtained by Algorithm 1 can be embedded in network identifiable model sets, under some restrictions on the original network. Here we have analyzed network identifiability of all modules in the abstracted network using sufficient conditions. The result may be extended by using less restrictive conditions that make use of the topology present in Weerts et al. (2018b). Moreover, for consistency of the module of interest only network identifiability of that particular module is necessary. Conditions for network identifiability of a particular module are less restrictive than conditions for network identifiability of all modules (Weerts et al., 2018a), which could further reduce the imposed conditions on the structure of the network.
5 Abstraction applied to an example network
In this section we will provide an example to illustrate some of the options that are available in network abstraction. Consider the network in Figure 2 where the nodes are described by the following equations
[TABLE]
If we would like to abstract node , e.g. because this node signal cannot be measured, then we have different options for doing so. The set of retained nodes is and the set of removed nodes is .
We have to make a choice on whether nodes are used as indirect observations. If we choose that there are no indirect observations, then , such that and . We eliminate from the system equations by substituting its expression (43) in the expressions of the nodes that are retained (40)-(42). This leads to a new set of equations given by
[TABLE]
which induces a self-loop around . This can be compensated for by moving the -dependent term to the left hand side, and rewriting the equation for as
[TABLE]
with . As a result the abstracted network is obtained and sketched in Figure 3. This way of eliminating node is referred to as immersion (Dankers et al., 2016), and comes down to lifting each path in the original network that contains the node signal that is eliminated. After removing the abstracted node signals, the remaining node signals are invariant.
As an alternative for the chosen abstraction, we can choose as an indirect observation of , such that , , , and . In this situation, node signal is abstracted by utilizing the expression for . We rewrite equation (41) as
[TABLE]
and substitute this into (40) to obtain the expression for
[TABLE]
In order to obtain the new expression for we directly substitute the expression for (43) into the expression for (41). The abstracted network is sketched in Figure 4, and given by the following equations:
[TABLE]
This alternative method of eliminating node variable is referred to as the indirect inputs method introduced in Linder and Enqvist (2017a). The principle idea is that the out-neighbor of a node that needs to be abstracted contains information about that node. Then the equation of the out-neighbor is manipulated in order to obtain an explicit expression for the node to be abstracted, which is then used to eliminate the node from the network. A major difference with the method of immersion is that the inverse of modules may appear in the resulting network representation.
It can be observed that the network topology and module dynamics can change when nodes are abstracted from the network. In particular the module has changed to when the immersion method is applied, while it remains invariant when the indirect inputs method is applied. This is going to be important when considering the problem of identifying a local module on the basis of a restricted set of measured node signals, as will be discussed in the next section.
6 Identification setting for invariant modules
We have introduced an algorithm to perform network abstraction. The remaining question to answer is how this will help to select nodes for the identification of the module of interest. A central point in our reasoning will be the invariance of the target module in the abstracted network. Although for consistent identification of the target module it is not strictly necessary to have target module invariance, cf. e.g. the indirect type identification methods of Bazanella et al. (2017); Gevers et al. (2018); Hendrickx et al. (2019) or the Wiener-filter based method of Materassi and Salapaka (2015, 2019), invariant target modules are very attractive in two-stage methods (Dankers et al., 2016; Linder and Enqvist, 2017a), and they are indispensable in direct methods (Dankers et al., 2016) that have the potential to provide consistent and maximum likelihood (and thus minimum variance) results. In Dankers et al. (2016) and Linder and Enqvist (2017a) two different abstraction methods have been used to select node signals to be measured for identification of a target module, based on the invariant module principle. The prime reasoning and the formulation of generalized results are presented next.
6.1 Direct identification setup
If the target module to be identified is , then a MISO identification setup on the basis of the abstracted network can be formulated in the following way. Node is used as output, and the following nodes are inputs: with , , and possibly additional external excitations. In this way an identification algorithm can provide us with a consistent estimate of the modules of the abstracted network for all , provided that some regularity conditions are satisfied, among which sufficient excitation properties of the measured signals. We have seen in the examples of the previous section that a module may remain unchanged after abstraction for particular choices of the sets , i.e.
[TABLE]
If the modules of the abstracted network are estimated consistently, and the module of interest has remained invariant in the abstracted network, then the module of interest is estimated consistently. In the remainder of this section we will address the problem under which conditions the abstracted network has the mentioned invariance property (44) of .
6.2 Invariance of the module
When applying immersion as a specific abstraction algorithm, it has been analyzed in Dankers et al. (2016) under which conditions on the set of retained node signals, a particular module in the network will remain invariant.
Proposition 11** (Dankers et al. (2016)).**
Consider a dynamic network as defined in (4), and let be the module of interest. Denote with the module in the network that is abstracted using Algorithm 1 with the sets . Let the network abstraction be performed through immersion, i.e. with . Define the set . Then
[TABLE]
if satisfies the following conditions:
, , 2. 2.
every path from to , excluding the path , goes through a node , , 3. 3.
every loop from to goes through a node , .
According to this proposition, there are two situations that need to be checked for guaranteeing module invariance: parallel paths and loops around the output. Every path that connects input and output parallel to the target module, and every loop around the output should be “blocked” by another node that is retained in the abstracted network.
6.3 Generalization of invariance conditions
The following two examples illustrate the parallel paths and loops around the outputs, leading to a generalization of Proposition 11 that extends the applicability from the immersion abstraction algorithm to the generalized abstraction algorithm. In the next examples, noise-free networks are used in order to stick to the core reasoning.
Example 12** (Parallel paths).**
Consider the left network in Figure 5 and the module of interest . Paths that run in parallel to this module, i.e. paths from to , may lead to changes in the module of interest during abstraction. If is removed using immersion, with , then is substituted into the equation for , such that the dynamics of modules and are merged with module of interest , i.e.
[TABLE]
As stated in Proposition 11, a way to prevent these parallel paths from changing the module of interest is by including a node in every parallel path in the abstracted network, for example by measuring .
An alternative way of removing is to include as an indirect observation of , i.e. by choosing , . In this case the node is substituted with such that
[TABLE]
The substitution uses an equation that does not contain , such that remains invariant. Apparently, it is not strictly necessary to include a node in every parallel path in the abstracted network. An indirect observation of a node in the parallel path may be used to block this path.
When an additional path exists as in the right network in Figure 5, the situation changes. Now the equation for node depends on , and if the unknown node is eliminated using the indirect observation , then an additional contribution from appears such that the module of interest is changed, i.e.
[TABLE]
where is used. If in the left network of Figure 5 there is no path from to , then cannot be used as an indirect observation.
From the example it can be observed that the nodes used as indirect observations, i.e. , should not have as an in-neighbor.
Example 13** (Self-loops).**
Consider the left network in Figure 6 and suppose the module of interest is . Paths that run as a loop around the output of this module, i.e. paths from to , may lead to changes in the module of interest during abstraction. If the node of the left network in Figure 6 is eliminated by immersion, using and , then abstraction leads to the following. The equation is substituted into the equation for , after which a self-loop around is resolved. This leads to the following change in the module of interest
[TABLE]
As stated in Proposition 11, a way to prevent these loops around the output from changing the module of interest is by including a node in every such loop around in the abstracted network, for example by measuring .
An alternative way of removing is to include as an indirect observation of , i.e. by choosing and . In this case the is substituted for such that
[TABLE]
The substitution uses an equation that does not contain , such that no self-loop has to be resolved, and remains invariant. It is thus not strictly necessary to include a node in every loop around in the abstracted network. An indirect observation of a node in a loop around may be used to block this path.
If instead there is a direct link like in the right network of Figure 6, then depends directly on , and using this equation for elimination of would again lead to a dependence of on itself in the abstracted network, i.e.
[TABLE]
where is substituted for . The self-loop should be resolved, leading to
[TABLE]
where it is obvious that the module of interest has changed.
In conclusion, for verifying module invariance in abstracted networks obtained by Algorithm 1 we have to consider the following. It is not sufficient to only consider parallel paths from to and loops from to that appear in the data generating system. We have to also consider indirect observations of the nodes that are part of parallel paths and loops around the output. Paths from and to the indirect observations also have to be considered to avoid merging of paths and to keep invariant under the transformation. These observations lead to the following formal result.
Theorem 14**.**
Consider a dynamic network as defined in (4), and let be the module of interest. Denote with the module in the abstracted network that is obtained using Algorithm 1 with the sets . Assume that nodes act as indirect observations of nodes according to Definition 8, and that . Define the sets and . Then
[TABLE]
if the following conditions on the sets , and are satisfied:
- (a)
All paths from to , excluding the direct path , pass through a node , 2. (b)
All paths from to pass through a node .
Proof: Collected in the appendix.
In condition (a) the index is excluded from the set since every path that starts in contains a node in . Conditions (a) and (b) imply that there cannot be any direct paths from and to indirect observations , i.e. , and .
The set is the set of, either directly retained signals in , except for node , or indirectly observed nodes in . The result of the theorem implies that all parallel paths from to and all loops around the ’output’, i.e. all paths from to , must pass through a node in this set.
Remark 15**.**
The conditions in Theorem 14 are a generalization of the conditions for immersion. For the choice and , Algorithm 1 is equivalent to the immersion algorithm, and the results of Theorem 14 are equivalent to the conditions of Proposition 11, (Dankers et al., 2016). In the generalized situation, parallel paths and loops around the output can also be blocked by indirectly observed nodes, present in , instead of just by directly observed nodes in .
Remark 16**.**
The conditions in Theorem 14 are a generalization of the conditions for the indirect inputs method, as formulated in Linder and Enqvist (2017a). This latter method results if we consider the particular situation that indirect observations are no in-neighbors of the output node, i.e. , and that all in-neighbors of indirect observations are in , i.e. . In the generalized situation presented here, indirect observations are allowed to be in-neighbors of , and they are allowed to have abstracted nodes as in-neighbors.
7 Node selection strategy
Theorem 14 allows us to check whether a module remains invariant under abstraction if the network topology is known and we have divided the nodes into four groups. The next question is how to choose the sets of nodes, based on the network topology, such that the module of interest remains invariant.
7.1 Selecting the sets of nodes
The strategy to obtaining a set of measured nodes in Dankers et al. (2016) is as follows. First the input and output nodes of the module of interest are required to be measured. Then every parallel path from the input to the output node must be blocked by a measured node. This means that nodes are added such that each of those paths contains a measured node. Similarly every loop around the output node must be blocked by a measured node, so nodes are added such that each of those loops contains a measured node. Different nodes on a path can be chosen to block the path, so the choice of which nodes to measure is not unique.
Now, the method of choosing nodes is adapted with the possibility of using indirectly observed nodes. A parallel path or a loop can now be blocked by either a measured or an indirectly observed node. However, when we use an indirect observation to block a path, additional conditions must be satisfied. Paths from either input or output of the module of interest to the indirect observation must also be blocked by either a measured or an indirectly observed node. For each indirect observation that is added, this condition on blocking the paths is applied recursively. This selection method is demonstrated in the following example.
Example 17** (Selecting nodes).**
For an illustration of how to select nodes, consider the network in Figure 7. The module of interest is , so we select as output, and is included as a predictor input. A parallel path through node exists and must be blocked if is to remain invariant. We can either include as a predictor input, or we can choose to indirectly observe it using . When is chosen as indirect input measurement, , and the parallel path from to through should be blocked, so either should then be included as a predictor input, or can be included as the indirect observation of
7.2 External excitation
In an estimation setting, both the nodes and are used as predictor inputs to parameterized modules. The question can then be raised what the effect is on the identification setup for nodes being present in one of these sets. The external variables that are in-neighbors of the node in the abstracted network need to be included as input. Depending on the chosen nodes in and different external variables are in-neighbors of the node , so different external variables need to be chosen in the experimental setup. We have seen that placing a node in either or leads to a different transformation matrix . This leads to different locations of zeros in the transformed . The structure of can be described as follows. Let denote the structure of a matrix that is diagonal, and let denote a matrix of arbitrary structure, then
[TABLE]
where
[TABLE]
If we consider situation that the conditions of Proposition 10 are satisfied, e.g. where and are diagonal, where , , and , and considering that is diagonal, then the external excitations
[TABLE]
may be in-neighbor of . In terms of choosing a network model set for the abstracted network, the structure of (54) specifies how to choose the zero-structure of the parameterized that is to be used for estimation of the abstracted network.
7.3 The noise model
Due to the abstraction, the noise process is modified in a way that is the same as the modification of the external excitations. The following expression is obtained for the disturbances
[TABLE]
with specified in (55). The obtained noise filter above is not square, which is problematic in terms of identification. This noise model relates to a square noise filter and white noise that can be used in an identification setting. It is likely that the obtained is then no longer diagonal. The zero-structure of the obtained noise filter can be used as the zero-structure when parameterizing the network model set. Under particular conditions special noise structures can be obtained that can be exploited. If no particular structure is obtained for the noise model, then all process noises are correlated.
7.4 Identification methods
For a particular network, and a choice of target module , the choice of the node sets will determine whether the target module will remain invariant in the abstracted network. This result can be applied in the problem of identifying the target module on the basis of measured node signals. In the abstracted network we have the node signals , which we assume to be available from observations. We can now construct an identification setup in line with the methods developed in Van den Hof et al. (2013). Determine the set of input predictors as those node signals in that are in-neighbors of the output in the abstracted network. If has remained invariant in the abstracted network, i.e. when the conditions of Theorem 14 are satisfied, he identification problem of estimating the transfer functions from inputs in to output will now estimate the module from input to output that is equal to the module in the original network. Consistent identification of this module is then possible under the typical regularity conditions of the prediction error methods, as formulated in Van den Hof et al. (2013). This implies that:
- •
For the two-stage identification method, consistency of the estimate is achievable if there is a sufficient excitation by external excitation signals in the network;
- •
For the direct identification method, consistency of the estimate is achievable, if besides sufficient excitation by external excitation and disturbance signals, correlated noises between inputs and outputs are taken care of. This can be done by either choosing the node sets such that these correlated noises (or confounding variables) do not occur, (Dankers et al., 2017), or by modeling this noise correlation correctly in the model, leading to the so-called joint-direct method (Weerts et al., 2018c; Van den Hof et al., 2019).
When applied to the abstracted network, the prediction error associated with the joint-direct method is
[TABLE]
where the are structured according to the topology obtained by network abstraction, the are all retained nodes, and are all available external excitations. Then is minimized over the parameters to obtain the estimated model. A further analysis of the particular identification results is beyond the scope of this paper.
8 Conclusions
The question to be answered is which set of measured nodes can lead to consistent estimates of a target module. As a way to answer this question the concept of abstraction has been introduced as a way to remove unmeasured nodes from a network representation, as a generalization of methods present in literature. A systematic method has been introduced to select nodes such that the module of interest remains invariant in the abstracted network. Under some assumptions on external excitations and the network topology the abstracted network can be parameterized with a network identifiable model set. If the module of interest remains invariant, and the model set is identifiable, then conditions for consistent estimation can be obtained for various identification methods.
A requirement that has been imposed is that the module of interest remains invariant in the abstracted network, but this is not necessary for consistency. It may be that the module of interest can be identified in an indirect way by combining the knowledge of two or more modules present in the abstracted network. It is also possible that there are multiple sets of measured nodes that each lead to consistent estimates of a module of interest. Selecting the set of nodes that leads to the smallest variance is another question for future consideration.
Acknowledgements
The authors are grateful for the discussions with Arne Dankers on the topic of the paper.
9 Appendix
9.1 Proof of Proposition 5
Sufficiency: By (15) the diagonal of is
[TABLE]
which is 0 by condition (2), showing that is hollow. Moreover if Condition (1) is satisfied, then with (15), (16):
[TABLE]
which is proper and stable by Definition (1). A monic, proper, stable, and inversely stable and full rank are obtained through the spectral factorization in (17).
Necessity: In order for to be proper and be stable, it is required that exists. Therefore has to have full rank. In order for to be hollow, it is required that .
9.2 Proof of Proposition 6
Substituting into the definition of the transformation (15) gives
[TABLE]
which shows that is obtained by applying this transformation. Moreover the diagonal of is 0, so is an appropriate transformation.
9.3 Proof of Proposition 9
In order to prove the proposition we evaluate the expressions for each step of Algorithm 1.
Step a: The fourth equation of (27) is solved for
[TABLE]
Substituting the above equation into the remainder of the network results for in
[TABLE]
with
[TABLE]
For we obtain
[TABLE]
with
[TABLE]
For we obtain
[TABLE]
with
[TABLE]
It is straightforward to verify that the transformation , , and results in the same expressions as the algorithm.
Steps b and c: The two equations that are solved for result in
[TABLE]
and
[TABLE]
Substituting (65) into (62) results in
[TABLE]
with
[TABLE]
Substituting (65) into (63) results in
[TABLE]
with
[TABLE]
The combination of the transformations is denoted by
[TABLE]
and it is straightforward to verify that this transformation , , and results in the same expressions as the algorithm.
Step d: Removal of a self-loop in the equation for node is performed by subtracting the right-hand term for from both sides, and then dividing both sides by 1 minus the term. This is precisely the operation performed by the transformation .
9.4 Proof of Proposition 10
Transfer function matrix is
[TABLE]
where the transformations are defined in Section 4.2. The structural properties of the matrices will be evaluated. Let indicate an unstructured matrix, and let indicate a diagonal matrix structure. Then the structure of the first part of the transformation matrix is
[TABLE]
where the matrices are partitioned corresponding to the blocks and .
The transformation has the following structure
[TABLE]
where the matrices are partitioned corresponding to the blocks , and where . From the relations in (63) and (64) we obtain that
[TABLE]
and
[TABLE]
Then from condition 3 we obtain that is diagonal.
Under conditions 1 and 2, matrix has the structure
[TABLE]
such that the following is obtained
[TABLE]
Then the final structure is
[TABLE]
It is then obvious that a leading diagonal can be obtained by column operations.
9.5 Proof of Theorem 14
In order to prove the theorem, the conditions must be interpreted in terms of . Conditions (a) and (b) imply that there are no direct paths from and to indirect observations , i.e.
- (i)
, 2. (ii)
.
The conditions also imply that there are no paths from and to indirect observations and that only go through unmeasured nodes , i.e.
- (iii)
, 2. (iv)
, 3. (v)
, 4. (vi)
.
The module of interest is a part of in (37) which can be obtained as
[TABLE]
Explicit expressions can be found in the proof of Proposition 9 such that using (i)-(vi) we can see that . First it is shown that . From (67) we obtain that
[TABLE]
Then it can be observed that
[TABLE]
if we use the expression from (63) and the obtained conditions (i) and (iii). Now is evaluated using the expression in (62)
[TABLE]
Then by condition (v) we have .
Since is diagonal, all that is left to show is that its -th entry is 1. Using the expression (35) we then need to show that . From (67) we obtain that
[TABLE]
Then it can be observed that
[TABLE]
if we use the expression from (63) and the obtained conditions (ii) and (iv). Now is evaluated using the expression in (62)
[TABLE]
Since there are no self loops in , the . Then by condition (vi) we have , such that and .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bazanella et al. (2017) A.S. Bazanella, M. Gevers, J. Hendrickx, and A. Parraga. Identifiability of dynamical networks: which nodes need be measured? In Proc. 56th IEEE Conf. Decision and Control (CDC 2017) , pages 5870–5875, 2017.
- 2Chiuso and Pillonetto (2012) A. Chiuso and G. Pillonetto. A Bayesian approach to sparse dynamic network identification. Automatica , 48(8):1553––1565, 2012.
- 3Dankers (2014) A. G. Dankers. System identification in dynamic networks . Ph D thesis, Delft University of Technology, 2014.
- 4Dankers et al. (2015) A. G. Dankers, P. M. J. Van den Hof, X. Bombois, and P. S. C. Heuberger. Errors-in-variables identification in dynamic networks—consistency results for an instrumental variable approach. Automatica , 62:39–50, 2015.
- 5Dankers et al. (2016) A. G. Dankers, P. M. J. Van den Hof, P. S. C. Heuberger, and X. Bombois. Identification of dynamic models in complex networks with prediction error methods: Predictor input selection. IEEE Trans. on Automatic Control , 61(4):937–952, 2016.
- 6Dankers et al. (2017) A. G. Dankers, P. M. J. Van den Hof, D. Materassi, and H. H. M. Weerts. Conditions for handling confounding variables in dynamic networks. IFAC-Papers On Line , 50-1:3983–3988, 2017. Proc. 20th IFAC World Congress.
- 7Everitt et al. (2018) N. Everitt, G. Bottegal, and H. Hjalmarsson. An empirical Bayes approach to identification of modules in dynamic networks. Automatica , 91:144 – 151, 2018.
- 8Gevers et al. (2018) M. Gevers, A.S. Bazanella, and G. Vian da Silva. A practical method for the consistent identification of a module in a dynamical network. IFAC-Papers On Line , 51-15:862–867, 2018. Proc. 18th IFAC Symp. System Identif. (SYSID 2018).
