This paper introduces a new realization and identification algorithm for stochastic LPV state-space models with exogenous inputs, combining correlation analysis and covariance realization for efficient and consistent estimation.
Contribution
It presents a novel algorithm that integrates deterministic LPV realization with stochastic covariance methods, improving model estimation accuracy and computational efficiency.
Findings
01
Algorithm is computationally efficient.
02
Estimates LPV model matrices accurately from empirical data.
03
Validated through a numerical case study.
Abstract
In this paper, we present a realization and an identification algorithm for stochastic Linear Parameter-Varying State-Space Affine (LPV-SSA) representations. The proposed realization algorithm combines the deterministic LPV input output to LPV state-space realization scheme based on correlation analysis with a stochastic covariance realization algorithm. Based on this realization algorithm, a computationally efficient and statistically consistent identification algorithm is proposed to estimate the LPV model matrices, which are computed from the empirical covariance matrices of outputs, inputs and scheduling signal observations. The effectiveness of the proposed algorithm is shown via a numerical case study.
Tables2
Table 1. Table 1: BFR and VAF on a noise-free validation data Algorithm 4
BFR
93.56 %
VAF
99.58 %
Table 2. Table 2: True vs estimated sub-Markov parameters
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Realization and identification algorithm for stochastic LPV state-space models with exogenous inputs
Manas Mejari
Mihály Petreczky
Centre de Recherche en Informatique, Signal et Automatique de Lille, University of Lille 1, Villeneuve- d’Ascq 59651, France (e-mail: [email protected])
Centre de Recherche en Informatique, Signal et Automatique de Lille, UMR CNRS 9189, Ecole Centrale de Lille,
Villeneuve dAscq 59651, France ([email protected])
Abstract
In this paper, we present a realization and an identification algorithm for stochastic Linear Parameter-Varying State-Space Affine (LPV-SSA) representations.
The proposed realization algorithm combines the deterministic LPV input output to LPV state-space realization scheme based on correlation analysis with a stochastic covariance realization algorithm.
Based on this realization algorithm, a computationally efficient and statistically consistent identification algorithm is proposed to estimate the LPV model matrices, which are computed from the empirical covariance matrices of outputs, inputs and scheduling signal observations.
The effectiveness of the proposed algorithm is shown via a numerical case study.
keywords:
Linear Parameter-Varying systems, stochastic realization.
††thanks: This work was partially funded by CPER Data project, which is co-financed by European Union with the financial support of European Regional Development Fund (ERDF), French State and the French Region of Hauts-de-France.
1 Introduction
Identification of Linear Parameter-Varying (LPV) models has gained significant attention over the past few years, owing to their ability to describe the behavior of many time-varying and non-linear systems. Many approaches have been proposed for the identification of LPV models, in input-output (IO) (Bamieh and Giarré, 2002; Laurain et al., 2010; Mejari et al., 2018; Piga et al., 2015) as well as state-space (SS) representations (Felici et al., 2007; Tanelli et al., 2011; van Wingerden and Verhaegen, 2009; Verdult and Verhaegen, 2005). The reader is referred to (Tóth, 2010) for a detailed summary of the available LPV identification approaches.
Controller design approaches often require the LPV models to be in SS representation with an affine dependency on the scheduling variable.
To this end, realization theory of LPV models plays a key role in understanding the conditions under which the observed behavior of a system can be realized by a state-space affine representation. It also allows to formulate identification algorithms for estimating state-space representation from a finite set of observations.
The realization theory for deterministic Linear Parameter-Varying State-Space with Affine dependence (LPV-SSA) representation has been developed in Tóth et al. (2012); Petreczky et al. (2017). The results of Tóth et al. (2012); Petreczky et al. (2017) were used to derive LPV-SS identification algorithm in Cox et al. (2015, 2018).
These methods are focused mainly on deterministic realizations, which for certain control and filtering problems are too restrictive.
In this paper, we focus on formulating a realization algorithm and a related identification algorithm for stochastic LPV-SSA representations with inputs. The main idea is to decompose the stochastic LPV-SSA realization/identification problem into two independent problems: realization/identification of deterministic part which depends only on the input,
and realization/identification of stochastic part.
To this end, the proposed algorithm is based on the combination of correlation analysis (Cox et al., 2018) for deterministic realization and stochastic covariance identification algorithm for stochastic LPV-SSA representations (Mejari and Petreczky, 2019a).
The algorithm presented in this paper extends the results of Petreczky and Vidal (2018); Mejari and Petreczky (2019a), to the case of stochastic LPV-SSA representations with exogenous inputs.
The proposed approach differs significantly from the subspace based identification methods for stochastic LPV-SSA representations (van Wingerden and Verhaegen, 2009; dos Santos et al., 2009; Favoreel et al., 1999).
First, the cited papers do not deal with the realization problem. In particular, while the possibility of decomposing the output into a deterministic and purely stochastic
components is sometimes claimed in the literature, the formal details of such a decomposition were never addressed.
Second, in contrast to the literature mentioned above, the proposed identification algorithm in this paper
is provenly consistent and it does not require local observability assumptions.
The downside is that the proposed algorithm is provenly consistent only for
a specific class of scheduling signals and stochastic LPV-SSA representations.
Moreover, the proposed algorithm avoids the curse of dimensionality, but this comes at a price of either using some
prior knowledge on the system to determine the correct selection of the rows and columns of a Hankel-matrix
or using an exhaustive search to find such a selection.
The paper is organized as follows. In Section 2, we present the problem formulation.
Section 3 presents the formal definition and basic properties of the class of LPV state-space representations
considered in this paper. In Section 4, we formalize the decomposition of outputs of such LPV state-space
representations into stochastic and deterministic components. In Section 5, we present the realization algorithm
for stochastic LPV state-space representations, and in Section 6 we present the related identification algorithm.
Finally, in Section 7 we illustrate the results with a numerical example.
Notation
In the sequel, we will use the standard terminology of probability theory (Bilingsley, 1986). In particular, all
the random variables and stochastic processes are understood w.r.t. to a fixed probability space (Ω,F,P), where F is a σ-algebra over the sample space Ω (i.e., F is a collection of subsets of Ω, that includes Ω itself, is closed under complement, is closed under countable unions and is closed under countable intersections)
and P is a probability measure on F. For two σ-algebras Fi, i=1,2, F1∨F2 denotes the smallest σ-algebra generated by the σ-algebras F1,F2. The expected value of a random variable x is denoted by E[x] and conditional expectation w.r.t. σ- algebra F is denoted by E[x∣F].
All the stochastic processes in this paper are discrete-time ones defined over the time-axis Z of the set of integers.
A discrete-time stochastic process is a collection {x(t)}t∈Z taking values in X, where x(t)∈X is a random variable for all t∈Z.
We denote by In the n×n identity matrix.
2 PROBLEM FORMULATION
Let y, u, μ be stochastic processes taking values in Rny,
Rnu and Rnμ respectively. In this paper, y represents
the output process, u is the input process, and
μ is the scheduling signal process.
We define a discrete-time Linear Parameter-Varying State-Space Affine (LPV-SSA) representation of the process (y,u,μ) as the discrete-time system
of the form
[TABLE]
where, Ai∈Rnx×nx, Bi∈Rnx×nu, Ki∈Rnx×ny, ∀i=1,…,nμ, C∈Rny×nx and D∈Rny×nu are real constant matrices, and
v is a white noise process, i.e., E[v(t)vT(s)]=0, s=t and
E[v(t)vT(t)μi(t)]=Qi>0, i=1,…,nμ. The realization and identification problems considered in this paper are as follows.
Problem 1** (Realization problem)**
For process (y,u,μ), find matrices ({Ai,Bi,Ki}i=1nμ,C,D) and processes x,v such that
(2) is a representation of (y,u,μ).
Problem 2** (Identification problem)**
Assume that y:Z→Rny is a sample path of the output process y, u:Z→Rnu is a sample path of the input process u and μ:Z→Rnμ is a sample path of the scheduling process μ, corresponding to the same random event ω∈Ω.
Given a dataset {y(t),u(t),μ(t)}t=1N consisting of N samples of the output, input and scheduling process, compute from this dataset the estimates {{A^iN,B^iN,K^iN,Q^iN}i=1nμ,C^N,D^N}, such that as N→∞, the estimated matrices {{A^iN,B^iN,K^iN,Q^iN}i=1nμ,C^N,D^N} converge to matrices {{Ai,Bi,Ki,Qi}i=1nμ,C,D} such that
the LPV-SSA (2) with Qi=E[v(t)v⊤(t)μi2(t)], i=1,…,nμ, is a representation of (y,u,μ).
3 Properties of LPV-SSA representation
In order to make Problems 1-2 well-posed, we have to
impose additional constraints on the class of processes (y,u,μ) and on the class
of LPV-SSA representations.
Next, we recall from Petreczky and Vidal (2018) the notion of Zero Mean Wide Sense Stationary w.r.t. Inputs (ZMWSSI) process, which will be a central notion for
the mathematical framework of stochastic LPV-SSA representations. To this end, we need the following
notation and terminology.
Notation 1** (Σ)**
Let Σ={1,…,nμ}.
The following terminology from automata theory is used.
A non empty word over Σ is a finite sequence of letters, i.e., w=σ1σ2⋯σk, where 0<k∈Z, σ1,σ2,…,σk∈Σ. The set of all nonempty words is denoted by Σ+. We denote an empty word by ϵ. Let Σ∗=ϵ∪Σ+.
The concatenation of two nonempty words v=a1a2⋯am and w=b1b2⋯bn is defined as vw=a1⋯amb1⋯bn for some m,n>0. Note that if w=ϵ or v=ϵ, then vϵ=v and ϵw=w, moreover, ϵϵ=ϵ. The length of the word w∈Σ∗ is denoted by ∣w∣, and ∣ϵ∣=0. Example:
for nμ=2, Σ={1,2}, Σ∗={ϵ,1,2,11,12,21,22,111,…}, for the word w=111∈Σ∗, ∣w∣=3.
Assumption 1** (White noise scheduling)**
The scheduling process μ=[1,μ2,…,μnμ]T is zero-mean independent identically distributed (i.i.d.) such that, for all t∈Z, we have μ1(t)≡1, and for each σ=2,…,nμ, μσ is a zero mean i.i.d. process.
We define scalars E[μσ2(t)]=pσ, for all t∈Z. In particular, p1=1.
For every word w∈Σ+ where w=σ1σ2⋯σk, k≥1, σ1,…,σk∈Σ, we define the process μw and
the number pw as follows
[TABLE]
We set μϵ(t)=1 and pϵ=1.
For a process r∈Rnu, for each w∈Σ+ we define the process zwr as
[TABLE]
which is interpreted as the past of r w.r.t. {μσ}σ∈Σ.
Definition 1** (ZMWSSI, Petreczky and Vidal (2018))**
A stochastic process r is Zero Mean Wide Sense Stationary w.r.t. the scheduling process
μ (ZMWSSI) if
For t∈Z, the σ-algebras generated by the random variables {r(k)}k≤t, {μσ(k)}k<t,σ∈Σ and {μσ(k)}k≥t,σ∈Σ, denoted by Ftr, Ftμ,− and Ftμ,+ respectively, are such that Ftr and Ftμ,+ are conditionally independent w.r.t. Ftμ,−.
2. 2.
The processes {r,{zwr}w∈Σ+} are zero mean, square integrable and are jointly wide sense stationary.
That is, ∀t,s,k∈Z, and for all w,v∈Σ+,
E[r(t)]=0, E[zwr(t)]=0, and
A process r is said to be square integrable w.r.t. {μσ}σ∈Σ (SII process), if ∀w∈Σ∗,t∈Z,
the random variable zwr+(t)=r(t+∣w∣)μw(t+∣w∣−1)pw1,
is square integrable.
All the process considered in this paper will be assumed to be ZMWSSI and SII process w.r.t. μ.
Definition 3** (White noise w.r.t. μ)**
A process r is called a white noise process w.r.t. μ, if r is ZMWSII w.r.t. μ,
and
E[r(t)(zwr(t))T]=0, E[zσwr(t)(zσwr(t))T]=E[zσr(t)(zσr(t))T]>0, for all w∈Σ+, σ∈Σ.
Using the concept of ZMWSSI process and white noise process w.r.t. μ, we can formulate the main
assumption regarding the processes (y,u,μ).
Assumption 2
Assume that μ satisfies Assumption 1, and
[yTuT]T is a ZMWSSI and SII process w.r.t. μ, and
u is a white noise process w.r.t. μ, and
the covariance E[zσu(t)(zσu(t))T]=E[u(t−1)(u(t−1))T]=Λu>0 does not depend on
σ∈Σ.
Next, we recall from Mejari and Petreczky (2019a) the notion of a stationary stochastic LPV-SSA representation of a process r without inputs.
Definition 4
A stationary LPV-SSA representation without inputs of a process r taking values in Rp, is a tuple ({A~σ,K~σ}σ=1nμ,C~,D~,x~,v~), where
A~σ∈Rn~×n~,K~σ∈Rn~×m~,
C~∈Rp×n~ and v is a process taking values in Rm~ such that
such that
[x~Tv~T]T* is a ZMWSSI process, and E[zσx~(t)(zσv~(t))T]=0, E[x~(t)(zwv~(t))T]=0 for
all σ∈Σ, w∈Σ+.*
2. 2.
v~* is a white noise process w.r.t. μ.*
3. 3.
The eigenvalues of the matrix ∑σ∈ΣpσA~σ⊗A~σ are inside the open unit circle.
4. 4.
We call x~ the state process and v~ the noise process.
In the terminology of Petreczky and Vidal (2018), a stationary LPV-SSA without inputs u, corresponds to a stationary generalized
bilinear system w.r.t. the scheduling inputs {μσ}σ∈Σ.
From Petreczky and Vidal (2018), if a process r has a stationary LPV-SSA representation without inputs, then r is a ZMWSSI process and
x~ is uniquely determined by v~ and the matrices (C~,D~,{A~σ,K~σ}σ∈Σ).
In order to define this notion more precisely, let us introduce the following notation.
Notation 2** (Matrix Product)**
Consider a collection of square matrices Aσ∈Rn×n, σ∈Σ. For any word w∈Σ+ of the form w=σ1σ2⋯σk, k>0 and σ1,…,σk∈Σ, we define
Aw=AσkAσk−1⋯Aσ1.
For an empty word ϵ, Aϵ=In.
From Petreczky and Vidal (2018); Mejari and Petreczky (2019a) it follows that
[TABLE]
where the infinite sum on the right-hand side is absolutely convergent in the mean square sense.
Using the notion of a stationary LPV-SSA without inputs, we can define the class of LPV-SSA representation with inputs which
will be considered in this paper.
Definition 5** (Stationary LPV-SSA)**
*The LPV-SSA representation (2) is stationary with input u, if
({Aσ,[KσBσ]}σ∈Σ,C,[InyD]x,[vTuT]T) is a stationary LPV-SSA representation of y without inputs as in Definition 4, and the orthogonality condition E[v(t)uT(t)μσ2(t)]=0,
∀σ∈Σ holds.
*
From (3) it follows that for a stationary LPV-SSA representation with input u of the form (2),
[TABLE]
where the infinite sums on the right hand side are absolutely convergent in the mean-square sense.
That is, the matrices and the noise processes determine the state process of a stationary LPV-SSA (with or without inputs) uniquely.
4 Decomposition of the output of LPV-SSA representation
It turns out that the output process of stationary LPV-SSA representations admits a decomposition into
deterministic and stochastic parts. The deterministic part depends only on the input process, while the stochastic part depends only on the noise process.
This decomposition does not depend on the particular choice of LPV-SSA representation, but only on the output process at hand.
In order to explain this decomposition in more detail, we recall from Petreczky and Vidal (2018) the
following terminology.
Notation 3** (Orthogonal projection El)**
*Recall that the set of square integrable
random variables taking values in R, forms a Hilbert-space with the scalar product defined as <z1,z2>=E[z1z2]. We denote this Hilbert-space by H1.
Let z be a square integrable vector-valued
random variable taking its values in Rk. Let M be a closed subspace of H1.
By the orthogonal projection of z onto the subspace M, denoted by El[z∣M],
we mean the vector-valued square-integrable random variable z∗=[z1∗,…,zk∗]T such that zi∗∈M is the orthogonal projection of the ith coordinate zi of z onto M, as it is usually defined for Hilbert spaces.
Let S be a subset of square integrable random variables in Rp for some integer p, and
suppose that M is generated by the coordinates of the elements of S, i.e. M is the smallest (with respect to set inclusion) closed subspace of H1
which contains the set {αTs∣s∈S,α∈Rp}.
Then instead of El[z∣M] we will use El[z∣S] to denote the projection of z to M.
*
Definition 6** (Deterministic and stochastic components)**
Assume the processes (y,u,μ) satisfy Assumption 2. Define the deterministic componentyd of y as follows
[TABLE]
Define the stochastic component of y as
[TABLE]
From the definition it follows that
[TABLE]
i.e., the process y(t) can be represented as the sum of its deterministic and stochastic
components. In case when the process admits an LPV-SSA representation, the stochastic and deterministic
components satisfy the following properties.
Lemma 1** (Decomposition of y)**
*Assume that there exists a stationary LPV-SSA representation of (y,u,μ) of the form
(2) and that (y,u,μ) satisfy Assumption 2.
It then follows that
*
[TABLE]
and ({Aσ,Bσ}σ∈Σ,C,D,xd,u) is a stationary LPV-SSA representation of yd without inputs and with noise process u, moreover,
[TABLE]
and ({Aσ,Kσ}σ∈Σ,C,Iny,xs,v)
is a stationary LPV-SSA representation of ys without inputs, where
[TABLE]
The proof of Lemma 1 is presented in (Mejari and Petreczky, 2019b, Appendix A.1).
Thus, ys depends only on the noise v, and yd does not depend on the noise but it depends
only on input u.
In fact, the converse of Lemma 1 also holds.
Lemma 2
Assume that y has a stationary LPV-SSA representation with input u.
Assume that Σd=({A^id,B^id}i=1nμ,C^d,D^d,x^d,u) is a
stationary LPV-SSA representation of yd without input such that its noise process equals the input process u.
Assume that Σs=({A^is,K^is}i=1nμ,C^s,Iny,x^s,es)
is a stationary LPV-SSA representation of ys without inputs in forward innovation form, i.e., assume that the process es is the so called innovation process of ys as defined in Mejari and Petreczky (2019a); Petreczky and Vidal (2018):
[TABLE]
Then, tuple ({A^i,K^i,B^i}i=1nμ,C^,D^,x^,es) is a stationary
LPV-SSA representation of y with input u, where
[TABLE]
Moreover, the innovation process es satisfies
[TABLE]
The proof of Lemma 2 is presented in (Mejari and Petreczky, 2019b, Appendix A.2).
Thus, the problem of realization of y can be decomposed into two problems:
P1
finding a stationary LPV-SSA representation Σd without inputs of yd, such that the noise process of Σd is u,
P2
finding a stationary LPV-SSA representation Σs without inputs of ys=y−yd, such that the noise process es of Σs is the innovation process of ys as defined in Mejari and Petreczky (2019a); Petreczky and Vidal (2018).
Moreover, the innovation process es(t) is the error of projecting y(t) onto the linear space spanned by the
products of the past values of y, u and the scheduling process μ, as defined in (12).
In order to solve problem P1, we can adapt realization theory of deterministic LPV-SSA representations.
To this end, in Section 5.2 we present an adaptation of the reduced basis Ho-Kalman algorithm from Cox et al. (2018).
Solution to problem P2 was developed in Petreczky and Vidal (2018), and a realization algorithm was formulated in
Mejari and Petreczky (2019a). The latter algorithm will be recalled in Section 5.3 which is also based on the reduced basis Ho-Kalman algorithm (Cox et al., 2018).
The combination of realization algorithm from Sections 5.2–5.3 yields a realization algorithm which can
easily be converted into a system identification algorithm.
The resulting identification algorithm will first estimate an LPV-SSA representation of yd, noise process of which is the input u, and then it
will estimate a stationary LPV-SSA representation of ys in forward innovation form.
The identification algorithm outlined above will be presented in Section 6.
5 Realization algorithms
In this section, we first recall the basis reduced Ho-Kalman realization algorithm for deterministic LPV state-space representations. In turn, this algorithm will be used for covariance realization algorithms for estimating LPV-SSA representations
of yd, ys, presented in Section 5.2–5.3.
5.1 Basis reduced Ho-Kalman realization algorithm
Recall from Petreczky et al. (2017); Cox et al. (2018) that a deterministic LPV-SSA representation (with affine dependence) is a system of the form
[TABLE]
where Ai,Bi,C,D are matrices of suitable dimensions,
x:Z→Rnx is the state trajectory
u:Z→Rnu is the input trajectory
y:Z→Rny is the output trajectory. In order to avoid technical problems,
we assume that x,u,y all have finite support, i.e. there exist a t0∈Z, such that
x(s)=0,y(s)=0,u(s)=0 for all s<t0. We identify a deterministic LPV-SSA of the form (13)
with the tuple S=({Aσ,Bσ}σ∈Σ,C,D). The number nx is called the dimension of S.
The sub-Markov parameters of S=({Aσ,Bσ}σ∈Σ,C,D) are the values of the map MS:Σ∗→Rny×nu, such
that for all w∈Σ∗,
[TABLE]
We will refer to MS as the sub-Markov function of the deterministic LPV-SSA representation of S.
From Petreczky et al. (2017) it then follows that two deterministic LPV-SSA representations S1, S2 have the same input-output behavior, if and only if
their sub-Markov parameters are equal, i.e., MS1=MS2. Moreover, the sub-Markov parameters can be
determined from the input-output behavior.
Below we recall from Cox et al. (2018) an adaptation of this Ho-Kalman-like algorithm, which uses sub-Markov parameters to
compute a deterministic LPV-SSA representation.
In order to present the algorithm, we present the notion of n-selection.
Let us define the set Σn as the set of all words w∈Σ∗ of length less than or equal to n, i.e., Σn={w∈Σ∗∣∣w∣≤n}.
Definition 7** (Selection)**
We define (n,ny,nu)-selection as a pair
(α,β) such that
α⊆Σn×{1,2,⋯,ny}* and β⊆Σ×Σn×{1,2,⋯,nu}*
2. 2.
card(α)=card(β)=n, where card denotes cardinality of the set.
When ny and nu are clear from the context, we refer to (n,ny,nu)-selections as n-selections,
and when n is also clear from the context, we use the term selection.
Consider n=2, number of outputs and inputs ny=nu==2, and scheduling signal dimension nμ=2, we have, Σn={ϵ,1,2,11,12,21,22}. Then, one of the n-selection pair (α,β) can be chosen as, for e.g.,
α={(u1,k1),(u2,k2)}={(ϵ,1),(11,2)} and β={(σ1,v1,l1),(σ2,v2,l2)}={(1,21,1),(2,22,2)}.
Let M:Σ∗→Rny×nu be a map, values of which represent potential sub-Markov parameters (14) of an LPV-SSA.
Let us now define the Hankel matrix Hα,βM∈Rn×n as follows:
i,j=1,…,n,
the (i,j)-th element of Hα,βM is of the form
[TABLE]
[M(σjvjui)]ki,lj denotes the entry of M(σjvjui) on the
ki-th row and lj-th column, and
(ui,ki)∈α,(σj,vj,lj)∈β are as in the ordering of (15). Intuitively, the rows of Hα,βM are indexed by word-index pairs (ui,ki)∈α, where ui∈Σn and ki∈{1,…,ny} and similarly, the columns of Hα,βM are indexed by word-index pairs (σjvj,lj)∈β, where σj∈Σ, vj∈Σn and lj∈{1,…,nu}, and
the element of Hα,βM with the row indexed (ui,ki) and column index (σj,vj,lj) is the (ki,lj)-th entry
of M(σjvjui).
In addition, we define the σ-shifted Hankel-matrix
Hσ,α,βM∈Rn×n as follows: its i,j-th entry is given by
[TABLE]
Moreover, let us define Hankel matrices Hα,σM∈Rn×nu and HβM∈Rny×n as follows
[TABLE]
Consider the model matrix computations summarized in Algorithm 1, using Hankel matrices and selections.
Let the (n,ny,nu)-selection (α,β) be such that rank(Hα,βM)=n, and assume that there exists a deterministic LPV-SSA representation S∗ of dimension n
such that M=MS∗. Then the tuple S^=({A^σ,B^σ}σ∈Σ,C^,D^), returned by Algorithm 1, when applied to
the matrices Hα,βM, Hσ,α,βM, Hα,σM, HβM ((16)-(19)) and M(ϵ), is a minimal dimensional deterministic LPV-SSA representation
such that MS^=M, i.e.
M(σw)=C^A^wB^σ for all w∈Σ∗.
5.2 Correlation analysis: finding an LPV-SSA representation of yd
In this section, we describe an adaptation of the correlation
analysis (CRA) method (Cox et al., 2015, 2018) for finding a stationary LPV-SSA representation of yd with noise process
u.
Let us define the map Ψu,y:Σ∗→Rny×nu as follows
It turns out that if y has a stationary LPV-SSA representation with input u, then Ψu,y is the sub-Markov function of a deterministic LPV-SSA representation.
Lemma 4
Assume that y has a realization by a stationary LPV-SSA representation with input u.
Assume that ({Aσ,Bσ},C,D,x,u) is a stationary LPV-SSA representation (without inputs, Definition 4) of yd. Then
Ψu,y in (20) equals the sub-Markov function MS (14) of the deterministic LPV-SSA representation
S=({Aσ,Bσ}σ∈Σ,C,D).
Conversely, if
S^=({A^σ,B^σ}σ∈Σ,C^,D^) is a deterministic LPV-SSA representation
such that its sub-Markov function MS^ equals Ψu,y and it is minimal dimensional among such deterministic LPV-SSA representations, then
({A^σ,B^σ},C^,D^,x^,u) is a stationary LPV-SSA representation (without inputs) of yd.
The proof of Lemma 4 is presented in (Mejari and Petreczky, 2019b, Appendix A.3).
Hence, we can adapt the basis reduced Ho-Kalman realization algorithm as described in Algorithm 2.
It is clear from Lemma 4 and Lemma 3 that Algorithm 2 is correct.
Corollary 1
If yd has a stationary LPV-SSA representation with no inputs, with noise process u, with dimension
n and rankHα,βΨu,y=n, then Algorithm 2 returns matrices
({A^σ,B^σ}σ∈Σ,C^,D^) such that
({A^σ,B^σ}σ∈Σ,C^,D^,x^,u) is a stationary
LPV-SSA representation of yd without inputs, with noise process u.
5.3 Covariance realization algorithm
In this section, we adapt the realization algorithm from Mejari and Petreczky (2019a) to estimate the stochastic part (1) of a LPV-SSA representation.
Define the covariance sequenceΨys:Σ∗→Rny×ny, where Ψys(ϵ)=Iny, and for all w∈Σ+,
[TABLE]
If ys has a stationary LPV-SSA representation, then Ψys is a sub-Markov function of a
suitable deterministic LPV-SSA representation, Petreczky and Vidal (2018); Mejari and Petreczky (2019a).
Conversely, from a deterministic LPV-SSA representation, sub-Markov function of which equals Ψys a
stationary LPV-SSA representation can be computed.
Lemma 5
If S=({A^σ,G^σ}σ∈Σ,C^,Iny)
is a minimal dimensional deterministic LPV-SSA representation such that MS=Ψys, then
({A^σs,K^σ}σ∈Σ,C^,Iny,x^,es)
is a stationary LPV-SSA representation of ys in forward innovation form, where
A^σs=pσ1A^σ, C^s=C^σ,
K^σ=limi→∞K^σi, and
{K^σi}σ∈Σ,i∈N satisfies the following recursion
[TABLE]
with P^σ0=0.
Moreover,
E[es(t)(es(t))Tμσ2(t)]=Q^σ=limi→∞Q^σi,
E[x^(t)x^T(t)μσ2]=P^σ=limi→∞P^σi for all σ∈Σ.
The proof of Lemma 5 can be found in Petreczky and Vidal (2018); Mejari and Petreczky (2019a), (Mejari and Petreczky, 2019b, Appendix A.4).
From Lemma 5, it follows that we can use the basis reduced Kalman-Ho realization algorithm
Algorithm 2, as described in Algorithm 3, in order to compute LPV-SSA representation of ys .
It is clear from Lemma 5 and Lemma 3 that Algorithm 3 is correct.
Corollary 2
If ys has a stationary LPV-SSA representation with no inputs, with dimension
n and rankHαˉ,βˉΨu,y=n, then Algorithm 3 returns matrices
({A^σs,G^σ,K^σI,Q^σI,P^σI}σ∈Σ,C^s) such that with K^σ=limI→∞K^σI, Q^σ=limI→∞Q^σI,
P^σ=limI→∞P^σI;
tuple ({A^σs,K^σ}σ∈Σ,C^s,Iny,x^,es) is a stationary
LPV-SSA representation of ys without inputs, and Q^σ=E[es(t)(es(t))Tμσ2(t)],
P^σ=E[x^(t)x^T(t)μσ2(t)], σ∈Σ.
6 Identification algorithm
In this section, we formulate an identification algorithm based on stochastic realization Algorithms 2–3 and selections, for N-length observation sequence of outputs, inputs and scheduling signals, as detailed in Algorithm 4.
Intuitively, the main idea behind Algorithm 4 is to estimate the covariances Ψu,y, Ψys and E[zσy(t)(zσy(t))T]
from the observed data and then apply Algorithms 2–3 to the thus estimated covariances.
More specifically, the following assumptions are made:
Assumption 3
(1)
The nx-selection pair (α,β) and (αˉ,βˉ) are such that
rankHα,βΨu,y=nx,
rankHαˉ,βˉΨys=nx, where nx is the state-space dimension of a minimal LPV-SSA realization of y.
(2)
The process (y,u,{μw}w∈Σ+) is ergodic and there exist sample paths
y:Z→Rny, u:Z→Rnu and μ:Z→Rnμ of the processes y, u and μ respectively such that
{y(t),u(t),{μσ(t)}σ∈Σ}t=1N is observed and the following holds:
for all w∈Σ∗,σ∈Σ,
[TABLE]
Then for all w∈Σ∗, σ∈Σ,
[TABLE]
where,
for all w=σ1σ2⋯σr∈Σ+, r>0, we have,
[TABLE]
Lemma 6** (Consistency)**
With the Assumption 3
the result of Algorithm 4 satisfies the following:
[TABLE]
and ({A~σ,B~σ,K~σ,}σ=1nμ,C~,D~,x^,es) is a stationary LPV-SSA representation of (y,u,μ),
and E[es(t)(es(t))Tμσ2(t)]=limI→∞limN→∞Q~σN,I,
σ∈Σ.
The proof sketch of Lemma 6 is presented in (Mejari and Petreczky, 2019a, Theorem 3), Mejari and Petreczky (2019b).
It can be shown that
Ψys(σw)=E[y(t)(zσwy(t))T]−E[yd(t)(zσwyd(t))T] and
E[zσys(t)(zσys(t))T]=E[zσy(t)(zσy(t))T]−E[zσyd(t)(zσyd(t))T], see Mejari and Petreczky (2019b).
Moreover, if ({A~σd,B~σd}σ∈Σ,C~d,D~d,x^d,u) is a stationary LPV-SSA representation of yd without inputs, then from Petreczky and Vidal (2018) it follows that ΛS(σw)=E[yd(t)(zσwyd(t))T] and
Tσ,σ,S=E[zσyd(t)(zσyd(t))T].
Intuitively, since Ψu,yN(w) converges to Ψu,y(w) as N→∞, ({A~σd,B~σd}σ∈Σ,C~d,D~d,x^d,u) becomes a LPV-SSA representation of yd as N→∞, and hence the right-hand side of the first and third equation of (23) converges
to E[y(t)(zσwy(t))T]−E[yd(t)(zσwyd(t))T] and
E[zσy(t)(zσy(t))T]−E[zσyd(t)(zσyd(t))T] respectively.
Remark 2** (Alternative way of computing ΨysN)**
An alternative way of estimating the covariances Ψys and E[zσys(t)(zσys(t))T]}σ∈Σ is to use
the matrices S=({A~σd,B~σd}σ∈Σ,C~d,D~d) to approximate
the sample paths yd, ys of yd and ys by
y^d(t)=D~du(t)+∑v∈Σ∗,σ∈Σ,∣v∣<t−1C~dA~vdB~σdzσvu(t), and
y^s(t)=y(t)−y^d(t) and define
[TABLE]
where zvy^s(t)=y^s(t−∣v∣)μv(t−1)pv1 for all v∈Σ+.
We can then view
ΨysN(w) as an approximation of Ψys(w), and Tσ,σN is an approximation of E[zσys(t)(zσys(t))T]}σ∈Σ.
We could modify Algorithm 4 by replacing (23) with (25). We conjecture that Lemma 6
will remain true for the modified algorithm.
7 Numerical example
In this section, we present a numerical example to test the effectiveness of our algorithm. All computations are carried out on an i5 1.8-GHz Intel core processor with 8 GB of RAM running MATLAB R2018a.
The quality of the match between estimated and true outputs is quantified on a noise-free validation data of length Nval via Best Fit Rate (BFR) and Variance Accounted For (VAF) criterion defined for each output channel yi, i=1,…,ny, as
[TABLE]
where y^i denotes the simulated one-step ahead model output and yˉi denotes the sample mean of the output over the validation set.
The LPV-SSA representation in form (2) is used for data generation with following matrices:
[TABLE]
which corresponds to state-dimension nx=3, output dimension ny=1, and scheduling signal dimension nμ=2 with Σ={1,2}.
Note that, the system corresponding to first local model A~1=A1−K1C is not observable, i.e., rank([CT(CA~1)T…(CA~1l−1)T]T)=2<nx, which is a particular assumption required in subspace based approaches (van Wingerden and Verhaegen, 2009).
Training and noise free validation output sequences of length N=100000 and Nval=100000, respectively, are generated using a white-noise input process u with uniform distribution U(−1.5,1.5) and an independent scheduling signal process μ=[μ1μ2] such that μ1(t)=1 and μ2(t) is a white-noise process with uniform distribution U(−1.5,1.5). This corresponds to the parameter values {pσ}σ∈{1,2} to be p1=E[μ12(t)]=1 and p2=E[μ22(t)]=0.75. The standard deviation of the white Gaussian noise e corrupting the training output is 1, i.e., e∼N(0,1). This corresponds to the Signal-to-Noise RatioSNR=10log∑t=1Ne2(t)∑t=1N(y(t)−e(t))2=4.7dB.
We run the version of Algorithm 4 explained in Remark 2, with I=50 iterations and with the following n-selection pairs (α,β) and (αˉ,βˉ), with n=3,
[TABLE]
which are used to choose corresponding entries of the Hankel matrices.
The mean time taken to run the algorithm is 1.55 sec.
The validation result using one-step ahead predicted outputs y^ are reported in Table 1, and true vs estimated sub-Markov parameters are reported in Table 2. The results show a good match between estimated model output w.r.t. true system output.
8 Conclusion
In this paper, we formulated a realization algorithm and an efficient identification algorithm for stochastic LPV-SSA representations with inputs, by combining correlation analysis method with a stochastic realization based identification algorithm. The proposed algorithm provides a computationally efficient alternative to the parametric subspace approaches avoiding the curse of dimensionality.
Recall from Notation 3 the definition of the Hilbert-space H1 of zero mean square integrable random variables.
Let us denote by Ht,+u, the closed subspace of H1 generated by the components of {zwu(t)}w∈Σ+∪{u(t)}.
Lemma 7
With the assumptions and notations of Lemma 1,
v(t)μσ(t) for all σ∈Σ, is orthogonal to Ht,+u.
{pf}
[Proof of Lemma 7]
Since r(t):=[vT(t)uT(t)]T is a white noise process w.r.t. μ, it follows that
E[r(t+1)(zwr(t+1))T]=0 for all w∈Σ+.
In particular, as zσr(t+1)=pσ1r(t)μσ(t), E[r(t+1)(r(t)μσ(t))T]=0 and as E[v(t)(u(t+1))Tμσ(t)] is the transpose of the lower left block of E[r(t+1)(r(t)μσ(t))T], it follows that E[v(t)(u(t+1))Tμσ(t)]=0.
Since r is ZMWSSI, it follows that for all w∈Σ+, σ1,σ∈Σ
[TABLE]
Since r is a white noise process w.r.t. μ, it follows that E[r(t)zwr(t)]=0 for all w∈Σ+.
Since E[v(t)μσ(t)(zwσ1u(t+1))T] is the upper right block of E[r(t)μσ(t)(zwσ1r(t+1))T],
it follows that E[v(t)μσ(t)(zwσ1u(t+1))T]=0 for all w∈Σ+, σ∈Σ.
That is, we have shown that E[v(t)μσ(t)(u(t+1))T]=0, E[v(t)μσ(t)(zwσ1u(t+1))T]=0 for all w∈Σ+, σ,σ1∈Σ. It is left to show that E[v(t)μσ(t)(zσ1u(t+1))T]=0. Note that E[v(t)μσ(t)(zσ1u(t+1))T] is the
upper right block of pσE[zσr(t+1)(zwσ1r(t+1))T], and the latter equals zero if σ1=σ.
Hence, for σ=σ1, E[v(t)μσ(t)(zσ1u(t+1))T]=0. If σ=σ1, then E[v(t)μσ(t)(zσu(t+1))T]=pσ1E[v(t)uT(t)μσ2(t)], and
from Definition 5 it follows that E[v(t)uT(t)μσ2(t)]=0, σ∈Σ. That is, E[v(t)μσ(t)(zσu(t+1))T]=0.
To summarize, we have shown that E[v(t)(u(t+1))Tμσ(t)]=0, E[v(t)μσ(t)(zwσ1u(t+1))T]=0 for all w∈Σ∗, σ,σ1∈Σ.
Since u(t+1), zwσ1u(t+1), w∈Σ∗, σ1∈Σ generate Ht+1,+u, the statement of the lemma follows.
■
Let us denote by Htu, the closed subspace generated by the components of {zwu(t)}w∈Σ+.
It is clear that Htu⊆Ht,+u.
Lemma 8
For any w∈Σ+, the components of zwv(t) are orthogonal to Ht+k,+u, k≥0.
{pf}
[Proof of Lemma 8]
Let us consider the case k=0.
Since r(t):=[v(t)Tu(t)T]T is a ZMWSII process , from (Petreczky and Vidal, 2018, Lemma 7) it follows that
E[zwr(t)(zvr(t))T]=0 for all v∈Σ+, v=w, and
if v=w and σ is the first letter of w, then E[zwr(t)(zwr(t))T]=E[zσr(t)(zσr(t))T].
Since E[zwv(t)(zvu(t))T] is the upper right block of E[zwr(t)(zvr(t))T], it follows that
E[zwv(t)(zvu(t))T]=0 if v=w and E[zwv(t)(zwu(t))T]=E[zσv(t)(zσu(t))T]=pσ1E[u(t−1)v(t−1)μσ2(t−1)], and from Definition 5, it follows that the latter is zero. That is, E[zwv(t)(zvu(t))T]=0 for all v∈Σ+.
Finally, as r(t):=[v(t)Tu(t)T]T is a
white noise process w.r.t. μ, it follows that E[zwr(t)(r(t))T]=0, and since E[zwv(t)(u(t))T] is the upper right block of E[zwr(t)(r(t))T], it then follows that E[zwv(t)(u(t))T]=0.
Since zwv(t) is orthogonal to the components of the random variables which generate Ht,+u, the statement of the lemma follows for k=0.
Consider now the case k>0. As μ1=1 and p1=1 it follows that zwv(t)=zw1⋯1kv(t+k) (where 1⋯1k denotes k-lenght word of 1s), and zw1⋯1kv(s)
is orthogonal to Hs,+u, according to the case k=0. By taking s=t+k for k>0, the statement of the lemma follows. ■
Lemma 9
The components of xd(t) belong to Htu and
[TABLE]
where the right-hand side of (27) converges in the mean square sense.
{pf}
[Proof of Lemma 9]
It is clear from the definition that the components of xd(t) belong to Ht,+u. Since
x(t)=∑w∈Σ∗,σ∈ΣpσwAw(Kσzσwv(t)+Bσzσwu(t))
and the fact that the map z↦El[z∣M] (where z∈H1) is a continuous linear operator for any closed subspace M, it follows that
[TABLE]
From Lemma 8 it follows that, El[zσwv(t)∣Ht,+u]=0, and since the components of zσwu(t) belong to Ht,+u, it follows that El[zσwu(t)∣Ht,+u]=zσwu(t).
Hence,
[TABLE]
Since the components of zσwu(t) belong to Htu, it follows that the components of the right-hand side of
(28) belongs to Htu and hence the components of xd(t) belong to Htu.
Note that, the convergence of the right-hand side of (28) in the mean square sense follows from the convergence of the series ∑w∈Σ∗,σ∈ΣpσwAw(Kσzσwv(t)+Bσzσwu(t)).
■
Lemma 10
The components of xs(t) belong to Htv, they are orthogonal to Ht+k,+u for any k≥0 and
[TABLE]
where the right-hand side converges in the mean-square sense.
From (27), xs(t)=x(t)−xd(t)
and
x(t)=∑w∈Σ∗,σ∈ΣpσwAw(Kσzσwv(t)+Bσzσwu(t)), it follows that (29) holds and that its right-hand side converges in the mean square sense. From Lemma 8, it follows that for any w∈Σ+, the components of zwv(t) are orthogonal to Ht+k,+u, hence
all the summands of the infinite series of (29) are orthogonal to Ht+k,+u. ■
Finally, we now state the proof of Lemma 1 (Decomposition of y).
{pf}[Proof of Lemma 1]
It follows that,
[TABLE]
Note that, u(t)μσ(t)=pσzσu(t+1), hence the components of u(t)μσ(t) belong to Ht+1,+u and
therefore
[TABLE]
We claim that,
[TABLE]
Since x(t)=xd(t)+xs(t), it follows that x(t)μσ(t)=xd(t)μσ(t)+xs(t)μσ(t).
From (Petreczky and Vidal, 2018, Lemma 9)
and Lemma 9–10, it follows that
[TABLE]
From Lemma 8, it follows that zσ′wσv(t+1) is orthogonal to Ht+1,+u for all w∈Σ+,
σ′,σ∈Σ, and hence xs(t)μσ(t) is also orthogonal to Ht+1,+u. Moreover, since the components of
zσ′wσu(t+1) belong to Ht+1,+u, it follows that xd(t)μσ(t) belongs to Ht+1,+u.
Hence,
[TABLE]
Finally, from Lemma 8, it follows that v(t)μσ(t)=pσzσv(t+1) is orthogonal to Ht+1,+u, and hence
As to the second equation of (1), notice that from Definition 6,
[TABLE]
Since the components of u(t) are among the generators of Ht,+u, and by Lemma 7, v(t)=v(t)μ1(t) is orthogonal to
Ht,+u, it follows that El[v(t)∣Ht,+u]=0 and El[u(t)∣Ht,+u]=u(t).
It then follows that the second equation of (1) holds.
From ys(t)=y(t)−yd(t), xs(t)=x(t)−xd(t) and (1), (1) follows.
It is left to show that ({Aσ,Bσ}σ∈Σ,C,D,xd,u) and ({Aσ,Kσ}σ∈Σ,C,Iny,xs,v)
are stationary LPV-SSA representations without inputs as per Definition 4.
Since ∑σ∈ΣpσAσ⊗Aσ is stable and u and v are both white noise processes w.r.t. μ, the
only thing which needs to be shown is that [(xd)TuT]T and [(xs)TvT]T
are ZMWSII. However, the latter follows from (27), (29) and (Petreczky and Vidal, 2018, Lemma 3). ■
We show that ({A^σ,B^σ,K^σ}σ∈Σ,C^,D^,x^,es) is a stationary LPV-SSA representation of y with input u, where, ({A^σ,B^σ,K^σ}σ∈Σ,C^,D^,x^,es) are as defined in (11)-(12).
To this end, assume that ({Aσ,Bσ,Kσ}σ∈Σ,C,D,x,v) is a stationary LPV-SSA representation of y with input u. 111By Assumption of Lemma 2, such a LPV-SSA representation exists.
Recall from Notation 3 that H1 denotes the Hilbert-space of zero mean square integrable random variables.
Denote by Ht,+v the closed-subspace of the Hilbert-space H1
generated by the components of {zwv}w∈Σ+∪{v(t)}, and denote by
Htv the Hilbert-space generated by {zwv}w∈Σ+.
We prove the following lemmas.
Lemma 11
Assume that ({Aσ,Bσ,Kσ}σ∈Σ,C,D,x,v) is a stationary LPV-SSA representation of y with input u The components of ys(t),zvys(t), es(t), zves(t), v∈Σ+ belong to Ht,+v.
That is, the components of ys(t) belong to Ht,+v.
In particular, from (Petreczky and Vidal, 2018, Lemma 11), it follows that the coordinates of
zwys(t) belong to Ht,+v and hence, Htys⊆Htv.
Since es(t)=ys(t)−El[ys(t)∣Htys], it follows that the components of
es(t) belong to Ht,+v. Since zvv(t)=zv1v(t+1), v(t)=z1v(t+1), it follows that
Ht,+v⊆Ht+1v and from (Petreczky and Vidal, 2018, Lemma 11) it follows that
the components of zves(t) belong to Htv⊆Ht,+v.
Lemma 12
If y has a realization by a stationary LPV-SSA representation with input u, then the components of
ys(t),zvys(t),es(t),zves(t), v∈Σ+ are orthogonal to Ht,+u,
i.e., for all v,w∈Σ+
[TABLE]
{pf}
[Lemma 12]
From Lemma 7–8 and by noticing that v(t)=v(t)μ1(t) it follows that the elements of Ht,+v are orthogonal to Ht,+u.
Hence, the coordinates of u(t), zwu(t), w∈Σ+ are orthogonal to Ht,+v.
Since the coordinates of ys(t),zvys(t),es(t),zves(t) belong to Ht,+v, it follows
that the coordinates of
ys(t),zvys(t),es(t),zves(t)
are orthogonal to Ht,+u.
Since Ht,+u is generated by the coordinates of u(t), zwu(t), w∈Σ+,
(32) follows.
Lemma 13
[(es)TuT]T* is a white noise process w.r.t. μ and E[es(t)uT(t)μσ2(t)]=0 for all σ∈Σ.*
{pf}
[Proof of Lemma 13]
In order to prove the statement of the lemma, we will first show that r(t)=[(es)TuT]T is a ZMWSII, by showing that r satisfies the conditions of Definition 1 one by one.
First, we show that the processes r(t),zwr(t),w∈Σ+ is zero mean, square integrable.
Note that u is a white noise process w.r.t. μ, in particular, it is a ZWMSII process and hence u(t),zwu(t),w∈Σ+ are zero mean, square integrable.
From the fact that Σs is a stationary LPV-SSA representation of ys it follows that es is also
a white noise process w.r.t. μ, in particular, it is also ZWMSII and thus es(t),zwes(t),w∈Σ+ is zero mean, square integrable. From this it follows that r(t)=[(es)TuT]T and
zwr(t)=[(zwes(t))T(zwu(t))T]T are zero mean and square integrable.
From Lemma 11 it follows that es(t) belongs to Ht,+v(t), where v is a noise process of a stationary LPV-SSA representation of
y with input u. From the definition of a stationary LPV-SSA representation it then follows that [vTuT]T is ZMWSII.
Hence, with the notation of
Definition 1, the σ-algebras Ft[vTuT]T and Ftμ,+ are conditionally independent w.r.t. Ftμ,−.
From the fact that es(t) belongs to Ht,+v(t) it follows that es(t) is measurable with respect to the σ-algebra generated by
{v(t)}∪{zvv(t)∣v∈Σ+} and the latter σ-algebra is a subset of Ft[vTuT]T∨Ftμ,−,
where for two σ-algebras Fi, i=1,2, F1∨F2 denotes the smallest σ-algebra generated by the σ-algebras F1,F2.
That is, es(t) is measurable w.r.t. the σ algebra Ft[vTuT]T∨Ftμ,−Ft[(es)TuT]T⊆Ft[vTuT]T∨Ftμ,−. Since Ft[vTuT]T and Fμ,+ are conditionally independent w.r.t. Ftμ,−, from (van Putten and van Schuppen, 1985, Proposition 2.4) it follows that
Ft[vTuT]T∨Ftμ,− and Ftμ,+ are conditionally independent w.r.t.
Ftμ,−, and as Ft[(es)TuT]T⊆Ft[vTuT]T∨Ftμ,−, it follows that Ft[(es)TuT]T and Fμ,+ are conditionally independent w.r.t. Ftμ,−.
Finally, from (32) it follows that r(t),zwr(t),w∈Σ+, r(t)=[(es)TuT]T
are jointly wide-sense stationary, i.e., for all s,t∈Z, s≤t, v,w∈Σ+,
[TABLE]
Indeed, from (32) and the fact that es, u are ZMWSII and hence the processes es(t),zwes(t),w∈Σ+ are jointly wide-sense stationary
and the processes u(t),zwu(t),w∈Σ+ are also jointly wide-sense stationary,
it follows that
[TABLE]
Above, we used the fact that if s<t, then u(s+k)=zhu(t+k+1), es(s+k)=zhes(t+k+1), u(s)=zhu(t+1), es(s)=zhes(t+1),
zwu(s+k)=zwhu(t+k), zves(s+k)=zvhes(t+k), where h=t−s1⋯1.
That is, we have shown that r(t)=[(es)TuT]T satisfies all the conditions of Definition 1.
Next we show that r(t) is a white noise process w.r.t. μ, i.e., E[r(t)(zwr(t))T]=0 for all w∈Σ+. From
(32) it follows that
[TABLE]
Notice es is a white noise process w.r.t. μ, since Σs=({A^is,K^is}i=1nμ,C^s,Iny,x^s,es) is a stationary
LPV-SSA representation of ys without inputs, and hence E[es(t)(zwes(t)]=0.
Furthermore, u is a white noise process w.r.t. μ by assumption, so E[u(t)(zwu(t)]=0. Hence, E[r(t)(zwr(t))T]=0.
It is left to show that E[es(t)uT(t)μσ2(t)]=0 for all σ∈Σ. Notice that E[es(t)uT(t)μσ2(t)]=E[zσes(t+1)(zσu(t+1))T]pσ by definition, and from (32) it follows that E[zσes(t+1)(zσu(t+1))T]=0.
Lemma 14
∑σ∈ΣpσA^σ⊗A^σ* is stable, where A^σ=diag(A^σd,A^σs).*
{pf}
[Proof of Lemma 14]
From (Costa et al., 2005, Proposition 2.6), it follows that ∑σ∈ΣpσA^σd⊗A^σd and ∑σ∈ΣpσA^σs⊗A^σs, are stable if ∃Qd,Qs>0 such that,
[TABLE]
It then follows that,
[TABLE]
[TABLE]
From the corollary, (Costa et al., 2005, Proposition 2.6), it follows that, ∑σ∈ΣpσA^σ⊗A^σ is stable, with A^σ=diag(A^σd,A^σs). ■
Lemma 15
The processes x^,u,e satisfy all the conditions for noise and state processes of a stationary LPV-SSA representation with no inputs, i.e.,
[x^TuT(es)T]T
is a ZMWSII,
[(es)TuT]T is a white noise process w.r.t. μ, and
E[x^(t)(zw[ues](t))T]=0 and E[zσx^(t)(zσ[ues](t))T]=0.
{pf}
[Proof of Lemma 15]
It follows from Lemma 13 that [(es)TuT]T is a white noise process w.r.t. μ.
From (Petreczky and Vidal, 2018, Lemma 2), it follows that
[TABLE]
and
[TABLE]
Thus,
[TABLE]
From Lemma 14, it follows that ∑σ∈ΣpσA^σ⊗A^σ is stable.
That is, the noise process [(es)TuT]T and the matrices {A^σ}σ∈Σ satisfy the conditions of a stationary
LPV-SSA representation without inputs. From (Petreczky and Vidal, 2018, Lemma 3) it then follows that x^ is the unique process such that
[uT(es)T]T satisfies all the conditions of a stationary LPV-SSA representation.
{pf}[Proof of Lemma 2]
It is clear from Lemmas 15, 13, 14 that
{Aσ}σ∈Σ satisfies the conditions of the definition of a stationary LPV-SSA representation without inputs. From Lemma 13, it follows that x^ and [(es)TuT]T satisfies the conditions of an LPV-SSA representation without inputs.
From Lemmas 15 it follows that the noise process es and the input u satisfy the condition of E[es(t)(u(t))Tμσ2(t)]=0, σ∈Σ.
Hence, ({A^σ,B^σ,K^σ}σ∈Σ,C^,D^,x^,es) is a stationary LPV-SSA representation of y with input u.
**If ({Aσ,Bσ}σ∈Σ,C,D,x,u) is a stationary LPV-SSA representation without inputs of yd⟹Ψu,y=MS, where S=({Aσ,Bσ}σ∈Σ,C,D). **
Recall that,
[TABLE]
Consider,
[TABLE]
This follows, as u is white noise process, E[u(t)(zwu(t))T]=0 and E[zσsu(t)(zwu(t))T]=Λu if σs=w, otherwise E[zσsu(t)(zwu(t))T]=0.
Similarly, E[yd(t)u(t)]=DΛu.
Finally, we recall that y(t)=yd(t)+ys(t).
From Lemma 12 it follows that
that the he components of ys(t) is orthogonal to Ht,+u, that is,
E[ys(t)(zwu(t))T]=0 and E[ys(t)(u(t))T]=0.
Thus, we have, E[y(t)(zwu(t))T]=E[yd(t)(zwu(t))T], E[y(t)(u(t))T]=E[yd(t)(u(t))T] and the statement of Lemma follows.
S^=({A^σ,B^σ}σ∈Σ,C^,D^)** is a minimal dimensional determinstic LPV-SSA representation such that MS^=Ψu,y⟹({A^σ,B^σ},C^,D^,x^,u) is a stationary LPV-SSA representation (without inputs) of yd**.
Consider the formal power series (Petreczky and Vidal, 2018, Appendix C)
of Ψ(w)=[MS^(1w)p1wMS^(nμw)pnμw].
Let S~=({A~σ,B~σ}σ∈Σ,C~,D~) be any deterministic LPV-SSA representation such that MS~.
Consider the recognizable representation in the sense of (Petreczky and Vidal, 2018, Appendix C) defined as
RS~=({pσA~σ}σ∈Σ,B~,C~), B~=[p1B~1⋯pnμB~nμ]. We claim that RS~ is a recognizable representation of Ψ (see
(Petreczky and Vidal, 2018, Appendix C) for the definition of a recognizable representation of a formal power series),
and if S~ is a minimal dimensional deterministic LPV-SSA representation such that
MS~=Ψu,y, then RS~ is a minimal dimensional representation of Ψ.
For the definition of the dimension of a recognizable representation, see (Petreczky and Vidal, 2018, Appendix C).
We call RS~the recognizable representation associated with the deterministic LPV-SSA representation
S.
Indeed, MS~(σw)pσw=pwC~A~wB~σpσ, hence Ψ(w)=C~FwB~, Fσ=pσA~σ,
which by definition (Petreczky and Vidal, 2018, Appendix C)
means that R is a representation of Ψ.
In order to show that RS~ is a minimal representation of Ψ, if S~ is a
minimal dimensional deterministic LPV-SSA representation such that
MS~=Ψu,y, we proceed as follows.
Consider a recognizable representation Ro=({Fˉσ}σ∈Σ,Gˉ,Cˉ) of Ψ.
Define the deterministic LPV-SSA representation
Sˉ=({{Aˉσ,Bˉσ}σ∈Σ,Cˉ,D), where Aˉσ=pσ1Fˉσ, and Gˉ=[p1B~1⋯pnμB~nμ]. Then, since Ψ(w)=CˉFˉwGˉ, w∈Σ∗, it follows that
MS~(σw)=pwpσ1CˉFˉw(pσB~σ)=CˉAˉwBˉσ, i.e., MSˉ=MS~=Ψu,y. By the assumption,
S~ is a minimal dimensional determinsitic LPV-SSA representation such that
MS~=Ψu,y, hence the dimension of Sˉ should not be smaller than that of S~. However, the dimension of S~ equals the dimension of the representation RS~, and the
dimension of Sˉ equals the dimension of Ro. That is, the dimension of any representation of Ψ cannot be smaller than the dimension of RS~, i.e., RS~ is minimal.
Since it is assumed that y has a stationary LPV-SSA representation with input u, from Lemma 1
it follows that there exists a stationary LPV-SSA representation
({Aσ,Bσ},C,D,xd,u) of yd without inputs. Then by the first part of
this lemma, the deterministic LPV-SSA representation S=({Aσ,Bσ},C,D) is such that
MS=Ψu,y. Since by definition of a stationary LPV-SSA representation without inputs,
∑σ∈ΣpσAσ⊗Aσ=∑σ∈Σ(pσAσ)⊗(pσAσ) is stable, then
using the terminology of (Petreczky and Vidal, 2018, Appendix C),
the representation RS associated with S
is a stable representation. Recall from (Petreczky and Vidal, 2018, Appendix C) that a
recognizable representation R=({Fσ}σ∈Σ,G,H) is stable, if all eigenvalues of the matrix
∑σ∈ΣFσ⊗Fσ are inside the open unit disk.
Since by the discussion above RS is a representation of Ψ,
by (Petreczky and Vidal, 2018, Theorem 6) Ψ is square summable (see (Petreczky and Vidal, 2018, Appendix C) for the definition of a square summable formal power series).
Consider now the minimal deterministic LPV-SSA representation S^ from the statement of the lemma.
From the discussion above it then follows that the associated recognizable representation RS^=({pσA^σ}σ∈Σ,B,C)
is a minimal representation of Ψ. Since Ψ is square summable and RS^ is minimal,
from (Petreczky and Vidal, 2018, Theorem 6) it follows that RS^ is stable, which means that all
the eigenvalues of
∑σ∈ΣpσA^σ⊗A^σ are inside the open unit disk.
Notice that u is a white noise process w.r.t. μ by assumption. Then it follows from
(Petreczky and Vidal, 2018, Lemma 3) that with
x^(t)=∑w∈Σ∗,σ∈ΣpσwA^wB^σzσwu(t)({A^σ,B^σ},C^,D^,x^,u) is a stationary LPV-SSA representation.
It is left to show that ({A^σ,B^σ},C^,D^,x^,u) is
a representation of yd. Consider the representation stationary LPV-SSA representation
({Aσ,Bσ},C,D,xd,u) of yd without inputs from the discussion above.
It then follows from (Petreczky and Vidal, 2018, Lemma 1) that
[TABLE]
where we used that by (Petreczky and Vidal, 2018, Lemma 3)
x^(t)=∑w∈Σ∗,σ∈ΣpσwA^wB^σzσwu(t). This means that yd(t)=C^x^(t)+D^u(t), and hence ({A^σ,B^σ},C^,D^,x^,u) is a representation of yd.
Assume S=({A^σ,G^σ}σ∈Σ,C^,Iny) is LPV-SSA representation whose sub-Markov parameters are MS=Ψys.
Let Ψ(w)=[Ψys(1w)⋯Ψys(nμw)], ∀w∈Σ∗. Then, R=({A^σ,}σ∈Σ,G^,C^), with G^=[G^1⋯G^nμ], is a representation of Ψ and by (Petreczky et al., 2017, Theorem 1, Theorem 2) and the definition of
observability and reachability for representations in (Petreczky and Vidal, 2018, Appendix C), it follows that
R is observable and reachable, and hence R is minimal by Berstel and Reutenauer (1984); Sontag (1979).
Then, the statement follows from (Petreczky and Vidal, 2018, Theorem 5 and Lemma 21). ■.
Bibliography23
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1Bamieh and Giarré (2002) Bamieh, B.A. and Giarré, L. (2002). Identification of linear parameter-varying models. International Journal of Robust Nonlinear Control , 12(9), 841–853.
2Berstel and Reutenauer (1984) Berstel, J. and Reutenauer, C. (1984). Rational series and their languages . EATCS Monographs on Theoretical Computer Science. Springer-Verlag.
3Bilingsley (1986) Bilingsley, P. (1986). Probability and measure . Wiley.
4Costa et al. (2005) Costa, O.L.V., Fragoso, M.D., and Marques, R.P. (2005). Discrete-time Markov Jump Linear Systems . Springer.
5Cox et al. (2018) Cox, P., Petreczky, M., and Tóth, R. (2018). Towards efficient maximum likelihood estimation of LPV-SS models. Automatica , 97(9), 392–403.
6Cox et al. (2015) Cox, P., Tóth, R., and Petreczky, M. (2015). Estimation of LPV-SS models with static dependency using correlation analysis. In Proc. 1st IFAC Workshop on Linear Parameter Varying Systems , 91–96. Grenoble, France.
7dos Santos et al. (2009) dos Santos, P., Ramos, J., and de Carvalho, J. (2009). Identification of bilinear systems with white noise inputs: An iterative deterministic-stochastic subspace approach. IEEE Transactions on Control Systems Technology , 17(5), 1145–1153.
8Favoreel et al. (1999) Favoreel, W., De Moor, B., and Van Overschee, P. (1999). Subspace identification of bilinear systems subject to white inputs. IEEE Transactions on Automatic Control , 44(6), 1157–1165. 10.1109/9.769370 . · doi ↗