Realization and identification algorithm for stochastic LPV state-space   models with exogenous inputs

Manas Mejari; Mihaly Petreczky

arXiv:1905.10113·cs.SY·May 27, 2019

Realization and identification algorithm for stochastic LPV state-space models with exogenous inputs

Manas Mejari, Mihaly Petreczky

PDF

TL;DR

This paper introduces a new realization and identification algorithm for stochastic LPV state-space models with exogenous inputs, combining correlation analysis and covariance realization for efficient and consistent estimation.

Contribution

It presents a novel algorithm that integrates deterministic LPV realization with stochastic covariance methods, improving model estimation accuracy and computational efficiency.

Findings

01

Algorithm is computationally efficient.

02

Estimates LPV model matrices accurately from empirical data.

03

Validated through a numerical case study.

Abstract

In this paper, we present a realization and an identification algorithm for stochastic Linear Parameter-Varying State-Space Affine (LPV-SSA) representations. The proposed realization algorithm combines the deterministic LPV input output to LPV state-space realization scheme based on correlation analysis with a stochastic covariance realization algorithm. Based on this realization algorithm, a computationally efficient and statistically consistent identification algorithm is proposed to estimate the LPV model matrices, which are computed from the empirical covariance matrices of outputs, inputs and scheduling signal observations. The effectiveness of the proposed algorithm is shown via a numerical case study.

Tables2

Table 1. Table 1: BFR and VAF on a noise-free validation data Algorithm 4

BFR	93.56 %
VAF	99.58 %

Table 2. Table 2: True vs estimated sub-Markov parameters

Markov parameters	True value	Estimated value
$C A_{1} B_{1}$	0.80	0.7957
$C A_{1} B_{2}$	0.40	0.3914
$C A_{1}^{2} B_{1}$	0.32	0.3147
$C A_{1} A_{2} B_{2}$	0.16	0.1549
$C A_{1}^{3} B_{1}$	0.12	0.1093

Equations152

x (t + 1)

x (t + 1)

y (t)

μ_{w} (t) p_{w} = μ_{σ_{1}} (t - k + 1) μ_{σ_{2}} (t - k + 2) \dots μ_{σ_{k}} (t), \forall t \in Z = p_{σ_{1}} p_{σ_{2}} \dots p_{σ_{k}} .

μ_{w} (t) p_{w} = μ_{σ_{1}} (t - k + 1) μ_{σ_{2}} (t - k + 2) \dots μ_{σ_{k}} (t), \forall t \in Z = p_{σ_{1}} p_{σ_{2}} \dots p_{σ_{k}} .

z_{w}^{r} (t) = r (t - ∣ w ∣) μ_{w} (t - 1) \frac{1}{p _{w}}, \forall t \in Z,

z_{w}^{r} (t) = r (t - ∣ w ∣) μ_{w} (t - 1) \frac{1}{p _{w}}, \forall t \in Z,

E [r (t + k) (z_{w}^{r} (s + k))^{T}]

E [r (t + k) (z_{w}^{r} (s + k))^{T}]

E [r (t + k) (r (s + k))^{T}]

E [z_{w}^{r} (t + k) (z_{v}^{r} (s + k))^{T}]

x (t) = σ \in Σ, w \in Σ^{*} \sum p_{σ w} \tilde{A}_{w} \tilde{K}_{σ} z_{σ w}^{\tilde{v}} (t),

x (t) = σ \in Σ, w \in Σ^{*} \sum p_{σ w} \tilde{A}_{w} \tilde{K}_{σ} z_{σ w}^{\tilde{v}} (t),

x (t) = w \in Σ^{*}, σ \in Σ \sum p_{σ w} A_{w} (K_{σ} z_{σ w}^{v} (t) + B_{σ} z_{σ w}^{u} (t)),

x (t) = w \in Σ^{*}, σ \in Σ \sum p_{σ w} A_{w} (K_{σ} z_{σ w}^{v} (t) + B_{σ} z_{σ w}^{u} (t)),

y^{d} (t) = E_{l} [y (t) ∣ {z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}] .

y^{d} (t) = E_{l} [y (t) ∣ {z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}] .

y^{s} (t) = y (t) - y^{d} (t) .

y^{s} (t) = y (t) - y^{d} (t) .

y (t) = y^{d} (t) + y^{s} (t),

y (t) = y^{d} (t) + y^{s} (t),

x^{d} (t + 1)

x^{d} (t + 1)

y^{d} (t)

x^{s} (t + 1)

x^{s} (t + 1)

y^{s} (t)

x^{d} (t) = E_{l} [x (t) ∣ {z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}]

x^{d} (t) = E_{l} [x (t) ∣ {z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}]

x^{s} (t) = x (t) - x^{d} (t)

e^{s} (t) = y^{s} (t) - E_{l} [y^{s} (t) ∣ {z_{w}^{y^{s}} (t)}_{w \in Σ^{+}}]

e^{s} (t) = y^{s} (t) - E_{l} [y^{s} (t) ∣ {z_{w}^{y^{s}} (t)}_{w \in Σ^{+}}]

\hat{x} (t) = [(\hat{x}^{d} (t))^{T} (\hat{x}^{s} (t))^{T}]^{T} \hat{A}_{σ} = diag (\hat{A}_{σ}^{d}, \hat{A}_{σ}^{s}), \hat{B}_{σ} = [(\hat{B}_{σ}^{d})^{T} 0_{n_{x} \times n_{u}}^{T}]^{T} \hat{K}_{σ} = [0_{n_{x} \times n_{y}}^{T} (\hat{K}_{σ}^{s})^{T}]^{T}, \hat{C} = [\hat{C}^{d} \hat{C}^{s}], \hat{D} = \hat{D}^{d} .

\hat{x} (t) = [(\hat{x}^{d} (t))^{T} (\hat{x}^{s} (t))^{T}]^{T} \hat{A}_{σ} = diag (\hat{A}_{σ}^{d}, \hat{A}_{σ}^{s}), \hat{B}_{σ} = [(\hat{B}_{σ}^{d})^{T} 0_{n_{x} \times n_{u}}^{T}]^{T} \hat{K}_{σ} = [0_{n_{x} \times n_{y}}^{T} (\hat{K}_{σ}^{s})^{T}]^{T}, \hat{C} = [\hat{C}^{d} \hat{C}^{s}], \hat{D} = \hat{D}^{d} .

e^{s} (t) = y (t) - E_{l} [y (t) ∣ {z_{w}^{y} (t), z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}]

e^{s} (t) = y (t) - E_{l} [y (t) ∣ {z_{w}^{y} (t), z_{w}^{u} (t)}_{w \in Σ^{+}} \cup {u (t)}]

x (t + 1) y (t) = i = 1 \sum n_{μ} (A_{i} x (t) + B_{i} u (t)) μ_{i} (t), = C x (t) + D u (t),

x (t + 1) y (t) = i = 1 \sum n_{μ} (A_{i} x (t) + B_{i} u (t)) μ_{i} (t), = C x (t) + D u (t),

M_{\mathscr{S}}(w)=\left\{\begin{array}[]{ll}CA_{s}B_{\sigma},&w=\sigma s,\sigma\in\Sigma,s\in\Sigma^{*}\\ D.&w=\epsilon\end{array}\right.

M_{\mathscr{S}}(w)=\left\{\begin{array}[]{ll}CA_{s}B_{\sigma},&w=\sigma s,\sigma\in\Sigma,s\in\Sigma^{*}\\ D.&w=\epsilon\end{array}\right.

α = {(u_{i}, k_{i})}_{i = 1}^{n}, β = {(σ_{j}, v_{j}, l_{j})}_{j = 1}^{n},

α = {(u_{i}, k_{i})}_{i = 1}^{n}, β = {(σ_{j}, v_{j}, l_{j})}_{j = 1}^{n},

[H_{α, β}^{M}]_{i, j} = [M (σ_{j} v_{j} u_{i})]_{k_{i}, l_{j}},

[H_{α, β}^{M}]_{i, j} = [M (σ_{j} v_{j} u_{i})]_{k_{i}, l_{j}},

[H_{σ, α, β}^{M}]_{i, j} = [M (σ_{j} v_{j} σ u_{i})]_{k_{i}, l_{j}} .

[H_{σ, α, β}^{M}]_{i, j} = [M (σ_{j} v_{j} σ u_{i})]_{k_{i}, l_{j}} .

[H_{α, σ}^{M}]_{i, j}

[H_{α, σ}^{M}]_{i, j}

[H_{β}^{M}]_{i, j}

\hat{A}_{σ} = (H_{α, β}^{M})^{- 1} H_{σ, α, β}^{M},

\hat{A}_{σ} = (H_{α, β}^{M})^{- 1} H_{σ, α, β}^{M},

\hat{B}_{σ} = (H_{α, β}^{M})^{- 1} H_{α, σ}^{M}, \hat{C} = H_{β}^{M}

\Psi_{\mathbf{u},\mathbf{y}}(w)=\left\{\begin{array}[]{lr}\frac{1}{\sqrt{p_{w}}}E[\mathbf{y}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]\Lambda_{u}^{-1}&\forall w\in\Sigma^{+}\\ E[\mathbf{y}(t)\mathbf{u}^{T}(t)]\Lambda_{u}^{-1}&w=\epsilon\end{array}\right.

\Psi_{\mathbf{u},\mathbf{y}}(w)=\left\{\begin{array}[]{lr}\frac{1}{\sqrt{p_{w}}}E[\mathbf{y}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]\Lambda_{u}^{-1}&\forall w\in\Sigma^{+}\\ E[\mathbf{y}(t)\mathbf{u}^{T}(t)]\Lambda_{u}^{-1}&w=\epsilon\end{array}\right.

Ψ_{y^{s}} (w) = E [y^{s} (t) (z_{w}^{y^{s}} (t))^{T}]

Ψ_{y^{s}} (w) = E [y^{s} (t) (z_{w}^{y^{s}} (t))^{T}]

\hat{P}_{σ}^{i + 1} = σ_{1} \in Σ \sum p_{σ} (\hat{A}_{σ_{1}}^{s} \hat{P}_{σ_{1}}^{i} (\hat{A}_{σ_{1}}^{s})^{T} + \hat{K}_{σ_{1}} \hat{Q}_{σ_{1}}^{i} \hat{K}_{σ_{1}}^{T}) \hat{Q}_{σ}^{i} = p_{σ} E [z_{σ}^{y^{s}} (t) (z_{σ}^{y^{s}} (t))^{T}] - \hat{C}^{s} \hat{P}_{σ}^{i} (\hat{C}^{s})^{T} \hat{K}_{σ}^{i} = (\hat{G}_{σ} p_{σ} - \hat{A}_{σ}^{s} \hat{P}_{σ}^{i} (\hat{C}^{s})^{T}) (\hat{Q}_{σ}^{i})^{- 1}

\hat{P}_{σ}^{i + 1} = σ_{1} \in Σ \sum p_{σ} (\hat{A}_{σ_{1}}^{s} \hat{P}_{σ_{1}}^{i} (\hat{A}_{σ_{1}}^{s})^{T} + \hat{K}_{σ_{1}} \hat{Q}_{σ_{1}}^{i} \hat{K}_{σ_{1}}^{T}) \hat{Q}_{σ}^{i} = p_{σ} E [z_{σ}^{y^{s}} (t) (z_{σ}^{y^{s}} (t))^{T}] - \hat{C}^{s} \hat{P}_{σ}^{i} (\hat{C}^{s})^{T} \hat{K}_{σ}^{i} = (\hat{G}_{σ} p_{σ} - \hat{A}_{σ}^{s} \hat{P}_{σ}^{i} (\hat{C}^{s})^{T}) (\hat{Q}_{σ}^{i})^{- 1}

Ψ_{u, y}^{N} (w) = \frac{1}{p _{w}} \frac{1}{N} t = ∣ w ∣ \sum N y (t) (z_{w}^{u} (t))^{T} Λ_{u}^{- 1}, w \in Σ^{*} Λ_{σ w}^{y, N} = \frac{1}{N} t = ∣ w ∣ \sum N y (t) (z_{σ w}^{y} (t))^{T}, T_{σ, σ}^{y, N} = \frac{1}{N} t = 1 \sum N z_{σ}^{y} (t) (z_{σ}^{y} (t))^{T}

Ψ_{u, y}^{N} (w) = \frac{1}{p _{w}} \frac{1}{N} t = ∣ w ∣ \sum N y (t) (z_{w}^{u} (t))^{T} Λ_{u}^{- 1}, w \in Σ^{*} Λ_{σ w}^{y, N} = \frac{1}{N} t = ∣ w ∣ \sum N y (t) (z_{σ w}^{y} (t))^{T}, T_{σ, σ}^{y, N} = \frac{1}{N} t = 1 \sum N z_{σ}^{y} (t) (z_{σ}^{y} (t))^{T}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Realization and identification algorithm for stochastic LPV state-space models with exogenous inputs

Manas Mejari

Mihály Petreczky

Centre de Recherche en Informatique, Signal et Automatique de Lille, University of Lille 1, Villeneuve- d’Ascq 59651, France (e-mail: [email protected])

Centre de Recherche en Informatique, Signal et Automatique de Lille, UMR CNRS 9189, Ecole Centrale de Lille, Villeneuve dAscq 59651, France ([email protected])

Abstract

In this paper, we present a realization and an identification algorithm for stochastic Linear Parameter-Varying State-Space Affine (LPV-SSA) representations. The proposed realization algorithm combines the deterministic LPV input output to LPV state-space realization scheme based on correlation analysis with a stochastic covariance realization algorithm. Based on this realization algorithm, a computationally efficient and statistically consistent identification algorithm is proposed to estimate the LPV model matrices, which are computed from the empirical covariance matrices of outputs, inputs and scheduling signal observations. The effectiveness of the proposed algorithm is shown via a numerical case study.

keywords:

Linear Parameter-Varying systems, stochastic realization.

††thanks: This work was partially funded by CPER Data project, which is co-financed by European Union with the financial support of European Regional Development Fund (ERDF), French State and the French Region of Hauts-de-France.

1 Introduction

Identification of Linear Parameter-Varying (LPV) models has gained significant attention over the past few years, owing to their ability to describe the behavior of many time-varying and non-linear systems. Many approaches have been proposed for the identification of LPV models, in input-output (IO) (Bamieh and Giarré, 2002; Laurain et al., 2010; Mejari et al., 2018; Piga et al., 2015) as well as state-space (SS) representations (Felici et al., 2007; Tanelli et al., 2011; van Wingerden and Verhaegen, 2009; Verdult and Verhaegen, 2005). The reader is referred to (Tóth, 2010) for a detailed summary of the available LPV identification approaches.

Controller design approaches often require the LPV models to be in SS representation with an affine dependency on the scheduling variable. To this end, realization theory of LPV models plays a key role in understanding the conditions under which the observed behavior of a system can be realized by a state-space affine representation. It also allows to formulate identification algorithms for estimating state-space representation from a finite set of observations. The realization theory for deterministic Linear Parameter-Varying State-Space with Affine dependence (LPV-SSA) representation has been developed in Tóth et al. (2012); Petreczky et al. (2017). The results of Tóth et al. (2012); Petreczky et al. (2017) were used to derive LPV-SS identification algorithm in Cox et al. (2015, 2018). These methods are focused mainly on deterministic realizations, which for certain control and filtering problems are too restrictive.

In this paper, we focus on formulating a realization algorithm and a related identification algorithm for stochastic LPV-SSA representations with inputs. The main idea is to decompose the stochastic LPV-SSA realization/identification problem into two independent problems: realization/identification of deterministic part which depends only on the input, and realization/identification of stochastic part. To this end, the proposed algorithm is based on the combination of correlation analysis (Cox et al., 2018) for deterministic realization and stochastic covariance identification algorithm for stochastic LPV-SSA representations (Mejari and Petreczky, 2019a).

The algorithm presented in this paper extends the results of Petreczky and Vidal (2018); Mejari and Petreczky (2019a), to the case of stochastic LPV-SSA representations with exogenous inputs. The proposed approach differs significantly from the subspace based identification methods for stochastic LPV-SSA representations (van Wingerden and Verhaegen, 2009; dos Santos et al., 2009; Favoreel et al., 1999). First, the cited papers do not deal with the realization problem. In particular, while the possibility of decomposing the output into a deterministic and purely stochastic components is sometimes claimed in the literature, the formal details of such a decomposition were never addressed. Second, in contrast to the literature mentioned above, the proposed identification algorithm in this paper is provenly consistent and it does not require local observability assumptions. The downside is that the proposed algorithm is provenly consistent only for a specific class of scheduling signals and stochastic LPV-SSA representations. Moreover, the proposed algorithm avoids the curse of dimensionality, but this comes at a price of either using some prior knowledge on the system to determine the correct selection of the rows and columns of a Hankel-matrix or using an exhaustive search to find such a selection.

The paper is organized as follows. In Section 2, we present the problem formulation. Section 3 presents the formal definition and basic properties of the class of LPV state-space representations considered in this paper. In Section 4, we formalize the decomposition of outputs of such LPV state-space representations into stochastic and deterministic components. In Section 5, we present the realization algorithm for stochastic LPV state-space representations, and in Section 6 we present the related identification algorithm. Finally, in Section 7 we illustrate the results with a numerical example.

Notation In the sequel, we will use the standard terminology of probability theory (Bilingsley, 1986). In particular, all the random variables and stochastic processes are understood w.r.t. to a fixed probability space $\left(\Omega,\mathcal{F},\mathcal{P}\right)$ , where $\mathcal{F}$ is a $\sigma$ -algebra over the sample space $\Omega$ (i.e., $\mathcal{F}$ is a collection of subsets of $\Omega$ , that includes $\Omega$ itself, is closed under complement, is closed under countable unions and is closed under countable intersections) and $\mathcal{P}$ is a probability measure on $\mathcal{F}$ . For two $\sigma$ -algebras $\mathcal{F}_{i}$ , $i=1,2$ , $\mathcal{F}_{1}\lor\mathcal{F}_{2}$ denotes the smallest $\sigma$ -algebra generated by the $\sigma$ -algebras $\mathcal{F}_{1},\mathcal{F}_{2}$ . The expected value of a random variable $\mathbf{x}$ is denoted by $E[\mathbf{x}]$ and conditional expectation w.r.t. $\sigma$ - algebra $\mathcal{F}$ is denoted by $E\left[\mathbf{x}\mid\mathcal{F}\right]$ . All the stochastic processes in this paper are discrete-time ones defined over the time-axis $\mathbb{Z}$ of the set of integers. A discrete-time stochastic process is a collection $\{\mathbf{x}(t)\}_{t\in\mathbb{Z}}$ taking values in $X$ , where $\mathbf{x}(t)\in X$ is a random variable for all $t\in\mathbb{Z}$ . We denote by $I_{n}$ the $n\times n$ identity matrix.

2 PROBLEM FORMULATION

Let $\mathbf{y}$ , $\mathbf{u}$ , $\bm{\mu}$ be stochastic processes taking values in $\mathbb{R}^{n_{y}}$ , $\mathbb{R}^{n_{u}}$ and $\mathbb{R}^{n_{\mu}}$ respectively. In this paper, $\mathbf{y}$ represents the output process, $\mathbf{u}$ is the input process, and $\bm{\mu}$ is the scheduling signal process. We define a discrete-time Linear Parameter-Varying State-Space Affine (LPV-SSA) representation of the process $(\mathbf{y},\mathbf{u},\bm{\mu})$ as the discrete-time system of the form

[TABLE]

where, $A_{i}\in\mathbb{R}^{n_{x}\times n_{x}}$ , $B_{i}\in\mathbb{R}^{n_{x}\times n_{u}}$ , $K_{i}\in\mathbb{R}^{n_{x}\times n_{y}}$ , $\forall i=1,\ldots,n_{\mu}$ , $C\in\mathbb{R}^{n_{y}\times n_{x}}$ and $D\in\mathbb{R}^{n_{y}\times n_{u}}$ are real constant matrices, and $\mathbf{v}$ is a white noise process, i.e., $E[\mathbf{v}(t)\mathbf{v}^{T}(s)]=0$ , $s\neq t$ and $E[\mathbf{v}(t)\mathbf{v}^{T}(t)\bm{\mu}_{i}(t)]=Q_{i}>0$ , $i=1,\ldots,n_{\mu}$ . The realization and identification problems considered in this paper are as follows.

Problem 1 (Realization problem)

For process $(\mathbf{y},\mathbf{u},\bm{\mu})$ , find matrices $(\{A_{i},B_{i},K_{i}\}_{i=1}^{n_{\mu}},C,D)$ and processes $\mathbf{x},\mathbf{v}$ such that (2) is a representation of $(\mathbf{y},\mathbf{u},\bm{\mu})$ .

Problem 2 (Identification problem)

Assume that $y:\mathbb{Z}\rightarrow\mathbb{R}^{n_{y}}$ is a sample path of the output process $\mathbf{y}$ , $u:\mathbb{Z}\rightarrow\mathbb{R}^{n_{u}}$ is a sample path of the input process $\mathbf{u}$ and $\mu:\mathbb{Z}\rightarrow\mathbb{R}^{n_{\mu}}$ is a sample path of the scheduling process $\bm{\mu}$ , corresponding to the same random event $\omega\in\Omega$ . Given a dataset $\{{y}(t),u(t),\mu(t)\}_{t=1}^{N}$ consisting of $N$ samples of the output, input and scheduling process, compute from this dataset the estimates $\{\{\hat{A}_{i}^{N},\hat{B}^{N}_{i},\hat{K}^{N}_{i},\hat{Q}^{N}_{i}\}_{i=1}^{n_{\mu}},\hat{C}^{N},\hat{D}^{N}\}$ , such that as $N\rightarrow\infty$ , the estimated matrices $\{\{\hat{A}_{i}^{N},\hat{B}_{i}^{N},\hat{K}_{i}^{N},\hat{Q}_{i}^{N}\}_{i=1}^{n_{\mu}},\hat{C}^{N},\hat{D}^{N}\}$ converge to matrices $\{\{A_{i},B_{i},K_{i},Q_{i}\}_{i=1}^{n_{\mu}},C,D\}$ such that the LPV-SSA (2) with $Q_{i}=E[\mathbf{v}(t)\mathbf{v}^{\top}(t)\bm{\mu}^{2}_{i}(t)]$ , $i=1,\ldots,n_{\mu}$ , is a representation of $(\mathbf{y},\mathbf{u},\bm{\mu})$ .

3 Properties of LPV-SSA representation

In order to make Problems 1-2 well-posed, we have to impose additional constraints on the class of processes $(\mathbf{y},\mathbf{u},\bm{\mu})$ and on the class of LPV-SSA representations.

Next, we recall from Petreczky and Vidal (2018) the notion of Zero Mean Wide Sense Stationary w.r.t. Inputs (ZMWSSI) process, which will be a central notion for the mathematical framework of stochastic LPV-SSA representations. To this end, we need the following notation and terminology.

Notation 1 ( $\Sigma$ )

Let $\Sigma=\{1,\ldots,n_{\mu}\}$ .

The following terminology from automata theory is used. A non empty word over $\Sigma$ is a finite sequence of letters, i.e., $w=\sigma_{1}\sigma_{2}\cdots\sigma_{k}$ , where $0<k\in\mathbb{Z}$ , $\sigma_{1},\sigma_{2},\ldots,\sigma_{k}\in\Sigma$ . The set of all nonempty words is denoted by $\Sigma^{+}$ . We denote an empty word by $\epsilon$ . Let $\Sigma^{*}=\epsilon\cup\Sigma^{+}$ . The concatenation of two nonempty words $v=a_{1}a_{2}\cdots a_{m}$ and $w=b_{1}b_{2}\cdots b_{n}$ is defined as $vw=a_{1}\cdots a_{m}b_{1}\cdots b_{n}$ for some $m,n>0$ . Note that if $w=\epsilon$ or $v=\epsilon$ , then $v\epsilon=v$ and $\epsilon w=w$ , moreover, $\epsilon\epsilon=\epsilon$ . The length of the word $w\in\Sigma^{*}$ is denoted by $|w|$ , and $|\epsilon|=0$ . Example: for $n_{\mu}=2$ , $\Sigma=\{1,2\}$ , $\Sigma^{*}=\{\epsilon,1,2,11,12,21,22,111,\ldots\}$ , for the word $w=111\in\Sigma^{*}$ , $|w|=3$ .

Assumption 1 (White noise scheduling)

The scheduling process $\bm{\mu}=[1,\bm{\mu}_{2},\ldots,\bm{\mu}_{n_{\mu}}]^{T}$ is zero-mean independent identically distributed (i.i.d.) such that, for all $t\in\mathbb{Z}$ , we have $\bm{\mu}_{1}(t)\equiv 1$ , and for each $\sigma=2,\ldots,n_{\mu}$ , $\bm{\mu}_{\sigma}$ is a zero mean i.i.d. process.

We define scalars $E[\bm{\mu}_{\sigma}^{2}(t)]=p_{\sigma}$ , for all $t\in\mathbb{Z}$ . In particular, $p_{1}=1$ .

For every word $w\in\Sigma^{+}$ where $w=\sigma_{1}\sigma_{2}\cdots\sigma_{k}$ , $k\geq 1$ , $\sigma_{1},\ldots,\sigma_{k}\in\Sigma$ , we define the process $\bm{\mu}_{w}$ and the number $p_{w}$ as follows

[TABLE]

We set $\bm{\mu}_{\epsilon}(t)=1$ and $p_{\epsilon}=1$ . For a process $\mathbf{r}\in\mathbb{R}^{n_{u}}$ , for each $w\in\Sigma^{+}$ we define the process $\mathbf{z}^{\mathbf{r}}_{w}$ as

[TABLE]

which is interpreted as the past of $\mathbf{r}$ w.r.t. $\{\bm{\mu}_{\sigma}\}_{\sigma\in\Sigma}$ .

Definition 1 (ZMWSSI, Petreczky and Vidal (2018))

A stochastic process $\mathbf{r}$ is Zero Mean Wide Sense Stationary w.r.t. the scheduling process $\bm{\mu}$ (ZMWSSI) if

For $t\in\mathbb{Z}$ , the $\sigma$ -algebras generated by the random variables $\{\mathbf{r}(k)\}_{k\leq t}$ , $\{\bm{\mu}_{\sigma}(k)\}_{k<t,\sigma\in\Sigma}$ and $\{\bm{\mu}_{\sigma}(k)\}_{k\geq t,\sigma\in\Sigma},$ denoted by $\mathcal{F}_{t}^{\mathbf{r}}$ , $\mathcal{F}_{t}^{\bm{\mu},-}$ and $\mathcal{F}_{t}^{\bm{\mu},+}$ respectively, are such that $\mathcal{F}_{t}^{\mathbf{r}}$ and $\mathcal{F}_{t}^{\bm{\mu},+}$ are conditionally independent w.r.t. $\mathcal{F}_{t}^{\bm{\mu},-}$ . 2. 2.

The processes $\{\mathbf{r},\{\mathbf{z}^{\mathbf{r}}_{w}\}_{w\in\Sigma^{+}}\}$ are zero mean, square integrable and are jointly wide sense stationary. That is, $\forall t,s,k\in\mathbb{Z}$ , and for all $w,v\in\Sigma^{+}$ , $E\left[\mathbf{r}(t)\right]=0$ , $E\left[\mathbf{z}^{\mathbf{r}}_{w}(t)\right]=0$ , and

[TABLE]

Definition 2 (Petreczky and Vidal (2018))

A process $\mathbf{r}$ is said to be square integrable w.r.t. $\{\bm{\mu}_{\sigma}\}_{\sigma\in\Sigma}$ (SII process), if $\forall w\in\Sigma^{*},t\in\mathbb{Z}$ , the random variable $\mathbf{z}^{\mathbf{r}+}_{w}(t)=\mathbf{r}(t+|w|)\bm{\mu}_{w}(t+|w|-1)\frac{1}{\sqrt{p_{w}}}$ , is square integrable.

All the process considered in this paper will be assumed to be ZMWSSI and SII process w.r.t. $\bm{\mu}$ .

Definition 3 (White noise w.r.t. $\bm{\mu}$ )

A process $\mathbf{r}$ is called a white noise process w.r.t. $\bm{\mu}$ , if $\mathbf{r}$ is ZMWSII w.r.t. $\bm{\mu}$ , and $E[\mathbf{r}(t)(\mathbf{z}^{\mathbf{r}}_{w}(t))^{T}]=0$ , $E[\mathbf{z}_{\sigma w}^{\mathbf{r}}(t)(\mathbf{z}_{\sigma w}^{\mathbf{r}}(t))^{T}]=E[\mathbf{z}^{\mathbf{r}}_{\sigma}(t)(\mathbf{z}^{\mathbf{r}}_{\sigma}(t))^{T}]>0$ , for all $w\in\Sigma^{+}$ , $\sigma\in\Sigma$ .

Using the concept of ZMWSSI process and white noise process w.r.t. $\bm{\mu}$ , we can formulate the main assumption regarding the processes $(\mathbf{y},\mathbf{u},\bm{\mu})$ .

Assumption 2

Assume that $\bm{\mu}$ satisfies Assumption 1, and $\begin{bmatrix}\mathbf{y}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ is a ZMWSSI and SII process w.r.t. $\bm{\mu}$ , and $\mathbf{u}$ is a white noise process w.r.t. $\bm{\mu}$ , and the covariance $E[\mathbf{z}^{\mathbf{u}}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t))^{T}]=E[\mathbf{u}(t-1)(\mathbf{u}(t-1))^{T}]=\Lambda_{u}>0$ does not depend on $\sigma\in\Sigma$ .

Next, we recall from Mejari and Petreczky (2019a) the notion of a stationary stochastic LPV-SSA representation of a process $\mathbf{r}$ without inputs.

Definition 4

A stationary LPV-SSA representation without inputs of a process $\mathbf{r}$ taking values in $\mathbb{R}^{p}$ , is a tuple $(\{\tilde{A}_{\sigma},\tilde{K}_{\sigma}\}_{\sigma=1}^{n_{\mu}},\tilde{C},\tilde{D},\tilde{\mathbf{x}},\tilde{\mathbf{v}})$ , where $\tilde{A}_{\sigma}\in\mathbb{R}^{\tilde{n}\times\tilde{n}},\tilde{K}_{\sigma}\in\mathbb{R}^{\tilde{n}\times\tilde{m}}$ , $\tilde{C}\in\mathbb{R}^{p\times\tilde{n}}$ and $\mathbf{v}$ is a process taking values in $\mathbb{R}^{\tilde{m}}$ such that such that

$\begin{bmatrix}\tilde{\mathbf{x}}^{T}&\tilde{\mathbf{v}}^{T}\end{bmatrix}^{T}$ * is a ZMWSSI process, and $E[\mathbf{z}^{\tilde{\mathbf{x}}}_{\sigma}(t)(\mathbf{z}^{\tilde{\mathbf{v}}}_{\sigma}(t))^{T}]=0$ , $E[\tilde{\mathbf{x}}(t)(\mathbf{z}^{\tilde{\mathbf{v}}}_{w}(t))^{T}]=0$ for all $\sigma\in\Sigma$ , $w\in\Sigma^{+}$ .* 2. 2.

$\tilde{\mathbf{v}}$ * is a white noise process w.r.t. $\bm{\mu}$ .* 3. 3.

The eigenvalues of the matrix $\sum_{\sigma\in\Sigma}p_{\sigma}\tilde{A}_{\sigma}\otimes\tilde{A}_{\sigma}$ are inside the open unit circle. 4. 4.

$\tilde{\mathbf{x}}(t+1)=\sum_{i=1}^{n_{\mu}}(\tilde{A}_{i}\tilde{\mathbf{x}}(t)+\tilde{K}_{i}\tilde{\mathbf{v}}(t))\bm{\mu}_{i}(t)$ , $\mathbf{r}(t)=\tilde{C}\tilde{\mathbf{x}}(t)+\tilde{D}\tilde{\mathbf{v}}(t)$ .

We call $\tilde{\mathbf{x}}$ the state process and $\tilde{\mathbf{v}}$ the noise process.

In the terminology of Petreczky and Vidal (2018), a stationary LPV-SSA without inputs $\mathbf{u}$ , corresponds to a stationary generalized bilinear system w.r.t. the scheduling inputs $\{\bm{\mu}_{\sigma}\}_{\sigma\in\Sigma}$ . From Petreczky and Vidal (2018), if a process $\mathbf{r}$ has a stationary LPV-SSA representation without inputs, then $\mathbf{r}$ is a ZMWSSI process and $\tilde{\mathbf{x}}$ is uniquely determined by $\tilde{\mathbf{v}}$ and the matrices $(\tilde{C},\tilde{D},\{\tilde{A}_{\sigma},\tilde{K}_{\sigma}\}_{\sigma\in\Sigma})$ . In order to define this notion more precisely, let us introduce the following notation.

Notation 2 (Matrix Product)

Consider a collection of square matrices $A_{\sigma}\in\mathbb{R}^{n\times n}$ , $\sigma\in\Sigma$ . For any word $w\in\Sigma^{+}$ of the form $w=\sigma_{1}\sigma_{2}\cdots\sigma_{k}$ , $k\!>\!0$ and $\sigma_{1},\ldots,\sigma_{k}\in\Sigma$ , we define $A_{w}=A_{\sigma_{k}}A_{\sigma_{k-1}}\cdots A_{\sigma_{1}}$ . For an empty word $\epsilon$ , $A_{\epsilon}=I_{n}$ .

From Petreczky and Vidal (2018); Mejari and Petreczky (2019a) it follows that

[TABLE]

where the infinite sum on the right-hand side is absolutely convergent in the mean square sense.

Using the notion of a stationary LPV-SSA without inputs, we can define the class of LPV-SSA representation with inputs which will be considered in this paper.

Definition 5 (Stationary LPV-SSA)

*The LPV-SSA representation (2) is stationary with input $\mathbf{u}$ , if $(\{A_{\sigma},\begin{bmatrix}K_{\sigma}&B_{\sigma}\end{bmatrix}\}_{\sigma\in\Sigma},C,\begin{bmatrix}I_{n_{y}}&D\end{bmatrix}\mathbf{x},\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T})$ is a stationary LPV-SSA representation of $\mathbf{y}$ without inputs as in Definition 4, and the orthogonality condition $E[\mathbf{v}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]=0$ , $\forall\sigma\in\Sigma$ holds. *

From (3) it follows that for a stationary LPV-SSA representation with input $\mathbf{u}$ of the form (2),

[TABLE]

where the infinite sums on the right hand side are absolutely convergent in the mean-square sense. That is, the matrices and the noise processes determine the state process of a stationary LPV-SSA (with or without inputs) uniquely.

4 Decomposition of the output of LPV-SSA representation

It turns out that the output process of stationary LPV-SSA representations admits a decomposition into deterministic and stochastic parts. The deterministic part depends only on the input process, while the stochastic part depends only on the noise process. This decomposition does not depend on the particular choice of LPV-SSA representation, but only on the output process at hand.

In order to explain this decomposition in more detail, we recall from Petreczky and Vidal (2018) the following terminology.

Notation 3 (Orthogonal projection $E_{l}$ )

*Recall that the set of square integrable random variables taking values in $\mathbb{R}$ , forms a Hilbert-space with the scalar product defined as $<\mathbf{z}_{1},\mathbf{z}_{2}>=E[\mathbf{z}_{1}\mathbf{z}_{2}]$ . We denote this Hilbert-space by $\mathcal{H}_{1}$ . Let $\mathbf{z}$ be a square integrable vector-valued random variable taking its values in $\mathbb{R}^{k}$ . Let $M$ be a closed subspace of $\mathcal{H}_{1}$ . By the orthogonal projection of $\mathbf{z}$ onto the subspace $M$ , denoted by $E_{l}[\mathbf{z}\mid M]$ , we mean the vector-valued square-integrable random variable $\mathbf{z}^{*}=\begin{bmatrix}\mathbf{z}_{1}^{*},\ldots,\mathbf{z}_{k}^{*}\end{bmatrix}^{T}$ such that $\mathbf{z}_{i}^{*}\in M$ is the orthogonal projection of the $i$ th coordinate $\mathbf{z}_{i}$ of $\mathbf{z}$ onto $M$ , as it is usually defined for Hilbert spaces. Let $\mathfrak{S}$ be a subset of square integrable random variables in $\mathbb{R}^{p}$ for some integer $p$ , and suppose that $M$ is generated by the coordinates of the elements of $\mathfrak{S}$ , i.e. $M$ is the smallest (with respect to set inclusion) closed subspace of $\mathcal{H}_{1}$ which contains the set $\{\alpha^{T}s\mid s\in\mathfrak{S},\alpha\in\mathbb{R}^{p}\}$ . Then instead of $E_{l}[z\mid M]$ we will use $E_{l}[\mathbf{z}\mid\mathfrak{S}]$ to denote the projection of $z$ to $M$ . *

Definition 6 (Deterministic and stochastic components)

Assume the processes $(\mathbf{y},\mathbf{u},\bm{\mu})$ satisfy Assumption 2. Define the deterministic component $\mathbf{y}^{d}$ of $\mathbf{y}$ as follows

[TABLE]

Define the stochastic component of $\mathbf{y}$ as

[TABLE]

From the definition it follows that

[TABLE]

i.e., the process $\mathbf{y}(t)$ can be represented as the sum of its deterministic and stochastic components. In case when the process admits an LPV-SSA representation, the stochastic and deterministic components satisfy the following properties.

Lemma 1 (Decomposition of $\mathbf{y}$ )

*Assume that there exists a stationary LPV-SSA representation of $(\mathbf{y},\mathbf{u},\bm{\mu})$ of the form (2) and that $(\mathbf{y},\mathbf{u},\bm{\mu})$ satisfy Assumption 2. It then follows that *

[TABLE]

and $(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D,\mathbf{x}^{d},\mathbf{u})$ is a stationary LPV-SSA representation of $\mathbf{y}^{d}$ without inputs and with noise process $\mathbf{u}$ , moreover,

[TABLE]

and $(\{A_{\sigma},K_{\sigma}\}_{\sigma\in\Sigma},C,I_{n_{y}},\mathbf{x}^{s},\mathbf{v})$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ without inputs, where

[TABLE]

The proof of Lemma 1 is presented in (Mejari and Petreczky, 2019b, Appendix A.1). Thus, $\mathbf{y}^{s}$ depends only on the noise $\mathbf{v}$ , and $\mathbf{y}^{d}$ does not depend on the noise but it depends only on input $\mathbf{u}$ . In fact, the converse of Lemma 1 also holds.

Lemma 2

Assume that $\mathbf{y}$ has a stationary LPV-SSA representation with input $\mathbf{u}$ . Assume that $\Sigma_{d}=(\{\hat{A}^{d}_{i},\hat{B}^{d}_{i}\}_{i=1}^{n_{\mu}},\hat{C}^{d},\hat{D}^{d},\hat{\mathbf{x}}^{d},\mathbf{u})$ is a stationary LPV-SSA representation of $\mathbf{y}^{d}$ without input such that its noise process equals the input process $\mathbf{u}$ . Assume that $\Sigma_{s}=(\{\hat{A}^{s}_{i},\hat{K}^{s}_{i}\}_{i=1}^{n_{\mu}},\hat{C}^{s},I_{n_{y}},\hat{\mathbf{x}}^{s},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ without inputs in forward innovation form, i.e., assume that the process $\mathbf{e}^{s}$ is the so called innovation process of $\mathbf{y}^{s}$ as defined in Mejari and Petreczky (2019a); Petreczky and Vidal (2018):

[TABLE]

Then, tuple $(\{\hat{A}_{i},\hat{K}_{i},\hat{B}_{i}\}_{i=1}^{n_{\mu}},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ , where

[TABLE]

Moreover, the innovation process $\mathbf{e}^{s}$ satisfies

[TABLE]

The proof of Lemma 2 is presented in (Mejari and Petreczky, 2019b, Appendix A.2). Thus, the problem of realization of $\mathbf{y}$ can be decomposed into two problems:

P1

finding a stationary LPV-SSA representation $\Sigma_{d}$ without inputs of $\mathbf{y}^{d}$ , such that the noise process of $\Sigma_{d}$ is $\mathbf{u}$ ,

P2

finding a stationary LPV-SSA representation $\Sigma_{s}$ without inputs of $\mathbf{y}^{s}=\mathbf{y}-\mathbf{y}^{d}$ , such that the noise process $\mathbf{e}^{s}$ of $\Sigma_{s}$ is the innovation process of $\mathbf{y}^{s}$ as defined in Mejari and Petreczky (2019a); Petreczky and Vidal (2018).

Moreover, the innovation process $\mathbf{e}^{s}(t)$ is the error of projecting $\mathbf{y}(t)$ onto the linear space spanned by the products of the past values of $\mathbf{y}$ , $\mathbf{u}$ and the scheduling process $\bm{\mu}$ , as defined in (12).

In order to solve problem P1, we can adapt realization theory of deterministic LPV-SSA representations. To this end, in Section 5.2 we present an adaptation of the reduced basis Ho-Kalman algorithm from Cox et al. (2018). Solution to problem P2 was developed in Petreczky and Vidal (2018), and a realization algorithm was formulated in Mejari and Petreczky (2019a). The latter algorithm will be recalled in Section 5.3 which is also based on the reduced basis Ho-Kalman algorithm (Cox et al., 2018).

The combination of realization algorithm from Sections 5.2–5.3 yields a realization algorithm which can easily be converted into a system identification algorithm. The resulting identification algorithm will first estimate an LPV-SSA representation of $\mathbf{y}^{d}$ , noise process of which is the input $\mathbf{u}$ , and then it will estimate a stationary LPV-SSA representation of $\mathbf{y}^{s}$ in forward innovation form. The identification algorithm outlined above will be presented in Section 6.

5 Realization algorithms

In this section, we first recall the basis reduced Ho-Kalman realization algorithm for deterministic LPV state-space representations. In turn, this algorithm will be used for covariance realization algorithms for estimating LPV-SSA representations of $\mathbf{y}^{d}$ , $\mathbf{y}^{s}$ , presented in Section 5.2–5.3.

5.1 Basis reduced Ho-Kalman realization algorithm

Recall from Petreczky et al. (2017); Cox et al. (2018) that a deterministic LPV-SSA representation (with affine dependence) is a system of the form

[TABLE]

where $A_{i},B_{i},C,D$ are matrices of suitable dimensions, $x:\mathbb{Z}\rightarrow\mathbb{R}^{n_{x}}$ is the state trajectory $u:\mathbb{Z}\rightarrow\mathbb{R}^{n_{u}}$ is the input trajectory $y:\mathbb{Z}\rightarrow\mathbb{R}^{n_{y}}$ is the output trajectory. In order to avoid technical problems, we assume that $x,u,y$ all have finite support, i.e. there exist a $t_{0}\in\mathbb{Z}$ , such that $x(s)=0,y(s)=0,u(s)=0$ for all $s<t_{0}$ . We identify a deterministic LPV-SSA of the form (13) with the tuple $\mathscr{S}=(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D)$ . The number $n_{x}$ is called the dimension of $\mathscr{S}$ . The sub-Markov parameters of $\mathscr{S}=(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D)$ are the values of the map $M_{\mathscr{S}}:\Sigma^{*}\rightarrow\mathbb{R}^{n_{y}\times n_{u}}$ , such that for all $w\in\Sigma^{*}$ ,

[TABLE]

We will refer to $M_{\mathscr{S}}$ as the sub-Markov function of the deterministic LPV-SSA representation of $\mathscr{S}$ . From Petreczky et al. (2017) it then follows that two deterministic LPV-SSA representations $\mathscr{S}_{1}$ , $\mathscr{S}_{2}$ have the same input-output behavior, if and only if their sub-Markov parameters are equal, i.e., $M_{\mathscr{S}_{1}}=M_{\mathscr{S}_{2}}$ . Moreover, the sub-Markov parameters can be determined from the input-output behavior.

Below we recall from Cox et al. (2018) an adaptation of this Ho-Kalman-like algorithm, which uses sub-Markov parameters to compute a deterministic LPV-SSA representation. In order to present the algorithm, we present the notion of $n$ -selection. Let us define the set $\Sigma^{n}$ as the set of all words $w\in\Sigma^{*}$ of length less than or equal to $n$ , i.e., $\Sigma^{n}=\{w\in\Sigma^{*}\mid|w|\leq n\}$ .

Definition 7 (Selection)

We define $(n,n_{y},n_{u})$ -selection as a pair $\left(\alpha,\beta\right)$ such that

$\alpha\subseteq\Sigma^{n}\times\{1,2,\cdots,n_{y}\}$ * and $\beta\subseteq\Sigma\times\Sigma^{n}\times\{1,2,\cdots,n_{u}\}$ * 2. 2.

$\mathrm{card}(\alpha)=\mathrm{card}(\beta)=n$ , where $\mathrm{card}$ denotes cardinality of the set.

When $n_{y}$ and $n_{u}$ are clear from the context, we refer to $(n,n_{y},n_{u})$ -selections as $n$ -selections, and when $n$ is also clear from the context, we use the term selection.

We will fix the following ordering of $\alpha$ and $\beta$ .

[TABLE]

$u_{i}\in\Sigma^{n}$ , $k_{i}\in\{1,2,\cdots,n_{y}\}$ , $\sigma_{j}\in\Sigma$ , $v_{j}\in\Sigma^{n}$ , $l_{j}\in\{1,2,\cdots,n_{u}\}$

Example 1

Consider $n\!=\!2$ , number of outputs and inputs $n_{y}=n_{u}=\!\!\!=\!\!\!2$ , and scheduling signal dimension $n_{\mu}\!\!\!=\!\!\!2$ , we have, $\Sigma^{n}=\{\epsilon,1,2,11,12,21,22\}$ . Then, one of the $n$ -selection pair $\left(\alpha,\beta\right)$ can be chosen as, for e.g., $\alpha=\{\left(u_{1},k_{1}\right),\left(u_{2},k_{2}\right)\}=\{\left(\epsilon,1\right),\left(11,2\right)\}$ and $\beta=\{\left(\sigma_{1},v_{1},l_{1}\right),\left(\sigma_{2},v_{2},l_{2}\right)\}=\{\left(1,21,1\right),\left(2,22,2\right)\}$ .

Let $M:\Sigma^{*}\rightarrow\mathbb{R}^{n_{y}\times n_{u}}$ be a map, values of which represent potential sub-Markov parameters (14) of an LPV-SSA. Let us now define the Hankel matrix $\mathcal{H}_{\alpha,\beta}^{M}\in\mathbb{R}^{n\times n}$ as follows: $i,j=1,\ldots,n$ , the $(i,j)$ -th element of $\mathcal{H}_{\alpha,\beta}^{M}$ is of the form

[TABLE]

$\left[M(\sigma_{j}v_{j}u_{i})\right]_{k_{i},l_{j}}$ denotes the entry of $M(\sigma_{j}v_{j}u_{i})$ on the $k_{i}$ -th row and $l_{j}$ -th column, and $\left(u_{i},k_{i}\right)\in\alpha,\left(\sigma_{j},v_{j},l_{j}\right)\in\beta$ are as in the ordering of (15). Intuitively, the rows of $\mathcal{H}_{\alpha,\beta}^{M}$ are indexed by word-index pairs $\left(u_{i},k_{i}\right)\in\alpha$ , where $u_{i}\in\Sigma^{n}$ and $k_{i}\in\{1,\ldots,n_{y}\}$ and similarly, the columns of $\mathcal{H}_{\alpha,\beta}^{M}$ are indexed by word-index pairs $\left(\sigma_{j}v_{j},l_{j}\right)\in\beta$ , where $\sigma_{j}\in\Sigma$ , $v_{j}\in\Sigma^{n}$ and $l_{j}\in\{1,\ldots,n_{u}\}$ , and the element of $\mathcal{H}_{\alpha,\beta}^{M}$ with the row indexed $(u_{i},k_{i})$ and column index $(\sigma_{j},v_{j},l_{j})$ is the $(k_{i},l_{j})$ -th entry of $M(\sigma_{j}v_{j}u_{i})$ .

In addition, we define the $\sigma$ -shifted Hankel-matrix $\mathcal{H}_{\sigma,\alpha,\beta}^{M}\in\mathbb{R}^{n\times n}$ as follows: its $i,j$ -th entry is given by

[TABLE]

Moreover, let us define Hankel matrices $\mathcal{H}_{\alpha,\sigma}^{M}\in\mathbb{R}^{n\times n_{u}}$ and $\mathcal{H}^{M}_{\beta}\in\mathbb{R}^{n_{y}\times n}$ as follows

[TABLE]

Consider the model matrix computations summarized in Algorithm 1, using Hankel matrices and selections.

Lemma 3 (Adapted from (Cox et al., 2018))

Let the $(n,n_{y},n_{u})$ -selection $(\alpha,\beta)$ be such that $\text{rank}(\mathcal{H}_{\alpha,\beta}^{M})=n$ , and assume that there exists a deterministic LPV-SSA representation $\mathscr{S}_{*}$ of dimension $n$ such that $M=M_{\mathscr{S}_{*}}$ . Then the tuple $\hat{\mathscr{S}}=(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D})$ , returned by Algorithm 1, when applied to the matrices $\mathcal{H}_{\alpha,\beta}^{M}$ , $\mathcal{H}^{M}_{\sigma,\alpha,\beta}$ , $\mathcal{H}_{\alpha,\sigma}^{M}$ , $\mathcal{H}_{\beta}^{M}$ ((16)-(19)) and $M(\epsilon)$ , is a minimal dimensional deterministic LPV-SSA representation such that $M_{\hat{\mathscr{S}}}=M$ , i.e. $M(\sigma w)=\hat{C}\hat{A}_{w}\hat{B}_{\sigma}$ for all $w\in\Sigma^{*}$ .

5.2 Correlation analysis: finding an LPV-SSA representation of $\mathbf{y}^{d}$

In this section, we describe an adaptation of the correlation analysis (CRA) method (Cox et al., 2015, 2018) for finding a stationary LPV-SSA representation of $\mathbf{y}^{d}$ with noise process $\mathbf{u}$ .

Let us define the map $\Psi_{\mathbf{u},\mathbf{y}}:\Sigma^{*}\rightarrow\mathbb{R}^{n_{y}\times n_{u}}$ as follows

[TABLE]

where we recall from Assumption 2, $\Lambda_{u}\!=\!\mathrm{var}(\mathbf{u})$ .

It turns out that if $\mathbf{y}$ has a stationary LPV-SSA representation with input $\mathbf{u}$ , then $\Psi_{\mathbf{u},\mathbf{y}}$ is the sub-Markov function of a deterministic LPV-SSA representation.

Lemma 4

Assume that $\mathbf{y}$ has a realization by a stationary LPV-SSA representation with input $\mathbf{u}$ . Assume that $(\{A_{\sigma},B_{\sigma}\},C,D,\mathbf{x},\mathbf{u})$ is a stationary LPV-SSA representation (without inputs, Definition 4) of $\mathbf{y}^{d}$ . Then $\Psi_{\mathbf{u},\mathbf{y}}$ in (20) equals the sub-Markov function $M_{\mathscr{S}}$ (14) of the deterministic LPV-SSA representation $\mathscr{S}=(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D)$ . Conversely, if $\hat{\mathscr{S}}=(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D})$ is a deterministic LPV-SSA representation such that its sub-Markov function $M_{\hat{\mathscr{S}}}$ equals $\Psi_{\mathbf{u},\mathbf{y}}$ and it is minimal dimensional among such deterministic LPV-SSA representations, then $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a stationary LPV-SSA representation (without inputs) of $\mathbf{y}^{d}$ .

The proof of Lemma 4 is presented in (Mejari and Petreczky, 2019b, Appendix A.3).

Hence, we can adapt the basis reduced Ho-Kalman realization algorithm as described in Algorithm 2.

It is clear from Lemma 4 and Lemma 3 that Algorithm 2 is correct.

Corollary 1

If $\mathbf{y}^{d}$ has a stationary LPV-SSA representation with no inputs, with noise process $\mathbf{u}$ , with dimension $n$ and $\mathrm{rank}$ $\mathcal{H}_{\alpha,\beta}^{\Psi_{\mathbf{u},\mathbf{y}}}=n$ , then Algorithm 2 returns matrices $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D})$ such that $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a stationary LPV-SSA representation of $\mathbf{y}^{d}$ without inputs, with noise process $\mathbf{u}$ .

5.3 Covariance realization algorithm

In this section, we adapt the realization algorithm from Mejari and Petreczky (2019a) to estimate the stochastic part (1) of a LPV-SSA representation.

Define the covariance sequence $\Psi_{\mathbf{y}^{s}}:\Sigma^{*}\rightarrow\mathbb{R}^{n_{y}\times n_{y}}$ , where $\Psi_{\mathbf{y}^{s}}(\epsilon)=I_{n_{y}}$ , and for all $w\in\Sigma^{+}$ ,

[TABLE]

If $\mathbf{y}^{s}$ has a stationary LPV-SSA representation, then $\Psi_{\mathbf{y}^{s}}$ is a sub-Markov function of a suitable deterministic LPV-SSA representation, Petreczky and Vidal (2018); Mejari and Petreczky (2019a).

Conversely, from a deterministic LPV-SSA representation, sub-Markov function of which equals $\Psi_{\mathbf{y}^{s}}$ a stationary LPV-SSA representation can be computed.

Lemma 5

If $\mathscr{S}=(\{\hat{A}_{\sigma},\hat{G}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},I_{n_{y}})$ is a minimal dimensional deterministic LPV-SSA representation such that $M_{\mathscr{S}}=\Psi_{\mathbf{y}^{s}}$ , then $(\{\hat{A}^{s}_{\sigma},\hat{K}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},I_{n_{y}},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ in forward innovation form, where $\hat{A}^{s}_{\sigma}=\frac{1}{\sqrt{p_{\sigma}}}\hat{A}_{\sigma}$ , $\hat{C}^{s}=\hat{C}_{\sigma}$ , $\hat{K}_{\sigma}=\lim_{i\rightarrow\infty}\hat{K}_{\sigma}^{i}$ , and $\{\hat{K}_{\sigma}^{i}\}_{\sigma\in\Sigma,i\in\mathbb{N}}$ satisfies the following recursion

[TABLE]

with $\hat{P}^{0}_{\sigma}=0$ . Moreover, $E[\mathbf{e}^{s}(t)(\mathbf{e}^{s}(t))^{T}\bm{\mu}^{2}_{\sigma}(t)]=\hat{Q}_{\sigma}=\lim_{i\rightarrow\infty}\hat{Q}_{\sigma}^{i}$ , $E[\hat{\mathbf{x}}(t)\hat{\mathbf{x}}^{T}(t)\bm{\mu}^{2}_{\sigma}]=\hat{P}_{\sigma}=\lim_{i\rightarrow\infty}\hat{P}_{\sigma}^{i}$ for all $\sigma\in\Sigma$ .

The proof of Lemma 5 can be found in Petreczky and Vidal (2018); Mejari and Petreczky (2019a), (Mejari and Petreczky, 2019b, Appendix A.4). From Lemma 5, it follows that we can use the basis reduced Kalman-Ho realization algorithm Algorithm 2, as described in Algorithm 3, in order to compute LPV-SSA representation of $\mathbf{y}^{s}$ .

It is clear from Lemma 5 and Lemma 3 that Algorithm 3 is correct.

Corollary 2

If $\mathbf{y}^{s}$ has a stationary LPV-SSA representation with no inputs, with dimension $n$ and $\mathrm{rank}\ H_{\bar{\alpha},\bar{\beta}}^{\Psi_{\mathbf{u},\mathbf{y}}}\!\!=\!\!n$ , then Algorithm 3 returns matrices $(\{\hat{A}^{s}_{\sigma},\hat{G}_{\sigma},\hat{K}^{\mathcal{I}}_{\sigma},\hat{Q}^{\mathcal{I}}_{\sigma},\hat{P}^{\mathcal{I}}_{\sigma}\}_{\sigma\in\Sigma},\hat{C}^{s})$ such that with $\hat{K}_{\sigma}=\lim_{\mathcal{I}\rightarrow\infty}\hat{K}_{\sigma}^{\mathcal{I}}$ , $\hat{Q}_{\sigma}=\lim_{\mathcal{I}\rightarrow\infty}\hat{Q}_{\sigma}^{\mathcal{I}}$ , $\hat{P}_{\sigma}=\lim_{\mathcal{I}\rightarrow\infty}\hat{P}_{\sigma}^{\mathcal{I}}$ ; tuple $(\{\hat{A}_{\sigma}^{s},\hat{K}_{\sigma}\}_{\sigma\in\Sigma},\hat{C}^{s},I_{n_{y}},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ without inputs, and $\hat{Q}_{\sigma}\!\!=\!\!E[\mathbf{e}^{s}(t)(\mathbf{e}^{s}(t))^{T}\bm{\mu}_{\sigma}^{2}(t)]$ , $\hat{P}_{\sigma}\!\!=\!\!E[\hat{\mathbf{x}}(t)\hat{\mathbf{x}}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]$ , $\sigma\in\Sigma$ .

6 Identification algorithm

In this section, we formulate an identification algorithm based on stochastic realization Algorithms 2–3 and selections, for $N$ -length observation sequence of outputs, inputs and scheduling signals, as detailed in Algorithm 4. Intuitively, the main idea behind Algorithm 4 is to estimate the covariances $\Psi_{\mathbf{u},\mathbf{y}}$ , $\Psi_{\mathbf{y}^{s}}$ and $E[\mathbf{z}^{\mathbf{y}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}}_{\sigma}(t))^{T}]$ from the observed data and then apply Algorithms 2–3 to the thus estimated covariances. More specifically, the following assumptions are made:

Assumption 3

(1) The $n_{x}$ -selection pair $\left(\alpha,\beta\right)$ and $\left(\bar{\alpha},\bar{\beta}\right)$ are such that $\mathrm{rank}\ \mathcal{H}_{\alpha,\beta}^{\Psi_{\mathbf{u},\mathbf{y}}}=n_{x}$ , $\mathrm{rank}\ \mathcal{H}_{\bar{\alpha},\bar{\beta}}^{\Psi_{\mathbf{y}^{s}}}=n_{x}$ , where $n_{x}$ is the state-space dimension of a minimal LPV-SSA realization of $\mathbf{y}$ .

(2) The process $(\mathbf{y},\mathbf{u},\{\bm{\mu}_{w}\}_{w\in\Sigma^{+}})$ is ergodic and there exist sample paths $y:\mathbb{Z}\rightarrow\mathbb{R}^{n_{y}}$ , $u:\mathbb{Z}\rightarrow\mathbb{R}^{n_{u}}$ and $\mu:\mathbb{Z}\rightarrow\mathbb{R}^{n_{\mu}}$ of the processes $\mathbf{y}$ , $\mathbf{u}$ and $\bm{\mu}$ respectively such that $\{y(t),u(t),\{{\mu}_{\sigma}(t)\}_{\sigma\in\Sigma}\}_{t=1}^{N}$ is observed and the following holds: for all $w\in\Sigma^{*},\sigma\in\Sigma$ ,

[TABLE]

Then for all $w\in\Sigma^{*}$ , $\sigma\in\Sigma$ ,

[TABLE]

where, for all $w=\sigma_{1}\sigma_{2}\cdots\sigma_{r}\in\Sigma^{+}$ , $r>0$ , we have,

[TABLE]

Lemma 6 (Consistency)

With the Assumption 3 the result of Algorithm 4 satisfies the following:

[TABLE]

and $(\{\tilde{A}_{\sigma},\tilde{B}_{\sigma},\tilde{K}_{\sigma},\}_{\sigma=1}^{n_{\mu}},\tilde{C},\tilde{D},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $(\mathbf{y},\mathbf{u},\bm{\mu})$ , and $E[\mathbf{e}^{s}(t)(\mathbf{e}^{s}(t))^{T}\bm{\mu}_{\sigma}^{2}(t)]=\lim_{\mathcal{I}\rightarrow\infty}\lim_{N\rightarrow\infty}\tilde{Q}^{N,\mathcal{I}}_{\sigma}$ , $\sigma\in\Sigma$ .

The proof sketch of Lemma 6 is presented in (Mejari and Petreczky, 2019a, Theorem 3), Mejari and Petreczky (2019b).

Remark 1 (Intuition behind (23))

It can be shown that $\Psi_{\mathbf{y}^{s}}(\sigma w)=E[\mathbf{y}(t)(\mathbf{z}_{\sigma w}^{\mathbf{y}}(t))^{T}]-E[\mathbf{y}^{d}(t)(\mathbf{z}_{\sigma w}^{\mathbf{y}^{d}}(t))^{T}]$ and $E[\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t))^{T}]=E[\mathbf{z}^{\mathbf{y}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}}_{\sigma}(t))^{T}]-E[\mathbf{z}^{\mathbf{y}_{d}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}_{d}}_{\sigma}(t))^{T}]$ , see Mejari and Petreczky (2019b). Moreover, if $(\{\tilde{A}_{\sigma}^{d},\tilde{B}_{\sigma}^{d}\}_{\sigma\in\Sigma},\tilde{C}^{d},\tilde{D}^{d},\hat{\mathbf{x}}^{d},\mathbf{u})$ is a stationary LPV-SSA representation of $\mathbf{y}^{d}$ without inputs, then from Petreczky and Vidal (2018) it follows that $\Lambda_{\mathscr{S}}(\sigma w)=E[\mathbf{y}^{d}(t)(\mathbf{z}_{\sigma w}^{\mathbf{y}^{d}}(t))^{T}]$ and $T_{\sigma,\sigma,\mathscr{S}}=E[\mathbf{z}^{\mathbf{y}^{d}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}^{d}}_{\sigma}(t))^{T}]$ . Intuitively, since $\Psi_{\mathbf{u},\mathbf{y}}^{N}(w)$ converges to $\Psi_{\mathbf{u},\mathbf{y}}(w)$ as $N\rightarrow\infty$ , $(\{\tilde{A}_{\sigma}^{d},\tilde{B}_{\sigma}^{d}\}_{\sigma\in\Sigma},\tilde{C}^{d},\tilde{D}^{d},\hat{\mathbf{x}}^{d},\mathbf{u})$ becomes a LPV-SSA representation of $\mathbf{y}^{d}$ as $N\rightarrow\infty$ , and hence the right-hand side of the first and third equation of (23) converges to $E[\mathbf{y}(t)(\mathbf{z}_{\sigma w}^{\mathbf{y}}(t))^{T}]-E[\mathbf{y}^{d}(t)(\mathbf{z}_{\sigma w}^{\mathbf{y}^{d}}(t))^{T}]$ and $E[\mathbf{z}^{\mathbf{y}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}}_{\sigma}(t))^{T}]-E[\mathbf{z}^{\mathbf{y}_{d}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}_{d}}_{\sigma}(t))^{T}]$ respectively.

Remark 2 (Alternative way of computing $\Psi_{\mathbf{y}^{s}}^{N}$ )

An alternative way of estimating the covariances $\Psi_{\mathbf{y}^{s}}$ and $E[\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t))^{T}]\}_{\sigma\in\Sigma}$ is to use the matrices $\mathscr{S}=(\{\tilde{A}_{\sigma}^{d},\tilde{B}_{\sigma}^{d}\}_{\sigma\in\Sigma},\tilde{C}^{d},\tilde{D}^{d})$ to approximate the sample paths $y^{d}$ , $y^{s}$ of $\mathbf{y}^{d}$ and $\mathbf{y}^{s}$ by $\hat{y}^{d}(t)=\tilde{D}^{d}u(t)+\sum_{v\in\Sigma^{*},\sigma\in\Sigma,|v|<t-1}\tilde{C}^{d}\tilde{A}_{v}^{d}\tilde{B}_{\sigma}^{d}z_{\sigma v}^{u}(t)$ , and $\hat{y}^{s}(t)=y(t)-\hat{y}^{d}(t)$ and define

[TABLE]

where ${z}^{\hat{y}^{s}}_{v}(t)=\hat{y}_{s}(t-|v|)\mu_{v}(t-1)\frac{1}{\sqrt{p_{v}}}$ for all $v\in\Sigma^{+}$ . We can then view $\Psi_{\mathbf{y}^{s}}^{N}(w)$ as an approximation of $\Psi_{\mathbf{y}^{s}}(w)$ , and $T_{\sigma,\sigma}^{N}$ is an approximation of $E[\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t)(\mathbf{z}^{\mathbf{y}^{s}}_{\sigma}(t))^{T}]\}_{\sigma\in\Sigma}$ . We could modify Algorithm 4 by replacing (23) with (25). We conjecture that Lemma 6 will remain true for the modified algorithm.

7 Numerical example

In this section, we present a numerical example to test the effectiveness of our algorithm. All computations are carried out on an i5 1.8-GHz Intel core processor with 8 GB of RAM running MATLAB R2018a.

The quality of the match between estimated and true outputs is quantified on a noise-free validation data of length $N_{\mathrm{val}}$ via Best Fit Rate (BFR) and Variance Accounted For (VAF) criterion defined for each output channel $y_{i}$ , $i\!=\!1,\ldots,n_{y}$ , as

[TABLE]

where $\hat{y}_{i}$ denotes the simulated one-step ahead model output and $\bar{y}_{i}$ denotes the sample mean of the output over the validation set.

The LPV-SSA representation in form (2) is used for data generation with following matrices:

[TABLE]

which corresponds to state-dimension $n_{x}\!\!=\!\!3$ , output dimension $n_{y}\!\!=\!\!1$ , and scheduling signal dimension $n_{\mu}\!\!=\!\!2$ with $\Sigma\!=\!\{1,2\}$ . Note that, the system corresponding to first local model $\tilde{A}_{1}=A_{1}-K_{1}C$ is not observable, i.e., $\mathrm{rank}([C^{T}(C\tilde{A}_{1})^{T}\ldots(C\tilde{A}^{l-1}_{1})^{T}]^{T})=2<n_{x}$ , which is a particular assumption required in subspace based approaches (van Wingerden and Verhaegen, 2009).

Training and noise free validation output sequences of length $N\!\!=\!\!100000$ and $N_{\mathrm{val}}\!\!=\!\!100000$ , respectively, are generated using a white-noise input process $\mathbf{u}$ with uniform distribution $\mathcal{U}(-1.5,1.5)$ and an independent scheduling signal process $\bm{\mu}=[\bm{\mu}_{1}\ \ \bm{\mu}_{2}]$ such that $\bm{\mu}_{1}(t)=1$ and $\bm{\mu}_{2}(t)$ is a white-noise process with uniform distribution $\mathcal{U}(-1.5,1.5)$ . This corresponds to the parameter values $\{p_{\sigma}\}_{\sigma\in\{1,2\}}$ to be $p_{1}\!=\!E[\mu^{2}_{1}(t)]\!\!=\!\!1$ and $p_{2}\!=\!E[\mu^{2}_{2}(t)]\!=\!0.75$ . The standard deviation of the white Gaussian noise $\mathbf{e}$ corrupting the training output is $1$ , i.e., $\mathbf{e}\sim\mathcal{N}(0,1)$ . This corresponds to the Signal-to-Noise Ratio $\mathrm{SNR}=10\log{\frac{\sum_{t=1}^{N}\left(y(t)-e(t)\right)^{2}}{\sum_{t=1}^{N}e^{2}(t)}}=4.7\ \mathrm{dB}.$

We run the version of Algorithm 4 explained in Remark 2, with $\mathcal{I}=50$ iterations and with the following $n$ -selection pairs $(\alpha,\beta)$ and $(\bar{\alpha},\bar{\beta})$ , with $n=3$ ,

[TABLE]

which are used to choose corresponding entries of the Hankel matrices. The mean time taken to run the algorithm is $1.55$ sec.

The validation result using one-step ahead predicted outputs $\hat{y}$ are reported in Table 1, and true vs estimated sub-Markov parameters are reported in Table 2. The results show a good match between estimated model output w.r.t. true system output.

8 Conclusion

In this paper, we formulated a realization algorithm and an efficient identification algorithm for stochastic LPV-SSA representations with inputs, by combining correlation analysis method with a stochastic realization based identification algorithm. The proposed algorithm provides a computationally efficient alternative to the parametric subspace approaches avoiding the curse of dimensionality.

Appendix A Proofs

A.1 Proof of Lemma 1

Recall from Notation 3 the definition of the Hilbert-space $\mathcal{H}_{1}$ of zero mean square integrable random variables. Let us denote by $\mathcal{H}_{t,+}^{\mathbf{u}}$ , the closed subspace of $\mathcal{H}_{1}$ generated by the components of $\{\mathbf{z}^{\mathbf{u}}_{w}(t)\}_{w\in\Sigma^{+}}\cup\{\mathbf{u}(t)\}$ .

Lemma 7

With the assumptions and notations of Lemma 1, $\mathbf{v}(t)\bm{\mu}_{\sigma}(t)$ for all $\sigma\in\Sigma$ , is orthogonal to $\mathcal{H}_{t,+}^{\mathbf{u}}$ .

{pf}

[Proof of Lemma 7] Since $\mathbf{r}(t):=\begin{bmatrix}\mathbf{v}^{T}(t)&\mathbf{u}^{T}(t)\end{bmatrix}^{T}$ is a white noise process w.r.t. $\bm{\mu}$ , it follows that $E[\mathbf{r}(t+1)(\mathbf{z}^{\mathbf{r}}_{w}(t+1))^{T}]=0$ for all $w\in\Sigma^{+}$ . In particular, as $\mathbf{z}^{\mathbf{r}}_{\sigma}(t+1)=\frac{1}{\sqrt{p}_{\sigma}}\mathbf{r}(t)\bm{\mu}_{\sigma}(t)$ , $E[\mathbf{r}(t+1)(\mathbf{r}(t)\bm{\mu}_{\sigma}(t))^{T}]=0$ and as $E[\mathbf{v}(t)(\mathbf{u}(t+1))^{T}\bm{\mu}_{\sigma}(t)]$ is the transpose of the lower left block of $E[\mathbf{r}(t+1)(\mathbf{r}(t)\bm{\mu}_{\sigma}(t))^{T}]$ , it follows that $E[\mathbf{v}(t)(\mathbf{u}(t+1))^{T}\bm{\mu}_{\sigma}(t)]=0$ .

Since $\mathbf{r}$ is ZMWSSI, it follows that for all $w\in\Sigma^{+}$ , $\sigma_{1},\sigma\in\Sigma$

[TABLE]

Since $\mathbf{r}$ is a white noise process w.r.t. $\bm{\mu}$ , it follows that $E[\mathbf{r}(t)\mathbf{z}_{w}^{\mathbf{r}}(t)]=0$ for all $w\in\Sigma^{+}$ .

Since $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{w\sigma_{1}}(t+1))^{T}]$ is the upper right block of $E[\mathbf{r}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{r}}_{w\sigma_{1}}(t+1))^{T}]$ , it follows that $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{w\sigma_{1}}(t+1))^{T}]=0$ for all $w\in\Sigma^{+}$ , $\sigma\in\Sigma$ .

That is, we have shown that $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{u}(t+1))^{T}]=0$ , $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{w\sigma_{1}}(t+1))^{T}]=0$ for all $w\in\Sigma^{+}$ , $\sigma,\sigma_{1}\in\Sigma$ . It is left to show that $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma_{1}}(t+1))^{T}]=0$ . Note that $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma_{1}}(t+1))^{T}]$ is the upper right block of $p_{\sigma}E[\mathbf{z}^{\mathbf{r}}_{\sigma}(t+1)(\mathbf{z}_{w\sigma_{1}}^{\mathbf{r}}(t+1))^{T}]$ , and the latter equals zero if $\sigma_{1}\neq\sigma$ . Hence, for $\sigma\neq\sigma_{1}$ , $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma_{1}}(t+1))^{T}]=0$ . If $\sigma=\sigma_{1}$ , then $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t+1))^{T}]=\frac{1}{\sqrt{p_{\sigma}}}E[\mathbf{v}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]$ , and from Definition 5 it follows that $E[\mathbf{v}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]=0$ , $\sigma\in\Sigma$ . That is, $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t+1))^{T}]=0$ .

To summarize, we have shown that $E[\mathbf{v}(t)(\mathbf{u}(t+1))^{T}\bm{\mu}_{\sigma}(t)]=0$ , $E[\mathbf{v}(t)\bm{\mu}_{\sigma}(t)(\mathbf{z}^{\mathbf{u}}_{w\sigma_{1}}(t+1))^{T}]=0$ for all $w\in\Sigma^{*}$ , $\sigma,\sigma_{1}\in\Sigma$ . Since $\mathbf{u}(t+1)$ , $\mathbf{z}^{\mathbf{u}}_{w\sigma_{1}}(t+1)$ , $w\in\Sigma^{*}$ , $\sigma_{1}\in\Sigma$ generate $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ , the statement of the lemma follows. $\blacksquare$

Let us denote by $\mathcal{H}_{t}^{\mathbf{u}}$ , the closed subspace generated by the components of $\{\mathbf{z}^{\mathbf{u}}_{w}(t)\}_{w\in\Sigma^{+}}$ . It is clear that $\mathcal{H}_{t}^{\mathbf{u}}\subseteq\mathcal{H}_{t,+}^{\mathbf{u}}$ .

Lemma 8

For any $w\in\Sigma^{+}$ , the components of $\mathbf{z}_{w}^{\mathbf{v}}(t)$ are orthogonal to $\mathcal{H}_{t+k,+}^{\mathbf{u}}$ , $k\geq 0$ .

{pf}

[Proof of Lemma 8] Let us consider the case $k=0$ . Since $\mathbf{r}(t):=\begin{bmatrix}\mathbf{v}(t)^{T}&\mathbf{u}(t)^{T}\end{bmatrix}^{T}$ is a ZMWSII process , from (Petreczky and Vidal, 2018, Lemma 7) it follows that $E[\mathbf{z}_{w}^{\mathbf{r}}(t)(\mathbf{z}^{\mathbf{r}}_{v}(t))^{T}]=0$ for all $v\in\Sigma^{+}$ , $v\neq w$ , and if $v=w$ and $\sigma$ is the first letter of $w$ , then $E[\mathbf{z}_{w}^{\mathbf{r}}(t)(\mathbf{z}^{\mathbf{r}}_{w}(t))^{T}]=E[\mathbf{z}_{\sigma}^{\mathbf{r}}(t)(\mathbf{z}^{\mathbf{r}}_{\sigma}(t))^{T}]$ .

Since $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{z}^{\mathbf{u}}_{v}(t))^{T}]$ is the upper right block of $E[\mathbf{z}_{w}^{\mathbf{r}}(t)(\mathbf{z}^{\mathbf{r}}_{v}(t))^{T}]$ , it follows that $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{z}^{\mathbf{u}}_{v}(t))^{T}]=0$ if $v\neq w$ and $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]=E[\mathbf{z}_{\sigma}^{\mathbf{v}}(t)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t))^{T}]=\frac{1}{p_{\sigma}}E[\mathbf{u}(t-1)\mathbf{v}(t-1)\bm{\mu}_{\sigma}^{2}(t-1)]$ , and from Definition 5, it follows that the latter is zero. That is, $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{z}^{\mathbf{u}}_{v}(t))^{T}]=0$ for all $v\in\Sigma^{+}$ .

Finally, as $\mathbf{r}(t):=\begin{bmatrix}\mathbf{v}(t)^{T}&\mathbf{u}(t)^{T}\end{bmatrix}^{T}$ is a white noise process w.r.t. $\bm{\mu}$ , it follows that $E[\mathbf{z}_{w}^{\mathbf{r}}(t)(\mathbf{r}(t))^{T}]=0$ , and since $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{u}(t))^{T}]$ is the upper right block of $E[\mathbf{z}_{w}^{\mathbf{r}}(t)(\mathbf{r}(t))^{T}]$ , it then follows that $E[\mathbf{z}_{w}^{\mathbf{v}}(t)(\mathbf{u}(t))^{T}]=0$ . Since $\mathbf{z}_{w}^{\mathbf{v}}(t)$ is orthogonal to the components of the random variables which generate $\mathcal{H}_{t,+}^{\mathbf{u}}$ , the statement of the lemma follows for $k=0$ .

Consider now the case $k>0$ . As $\bm{\mu}_{1}=1$ and $p_{1}=1$ it follows that $\mathbf{z}_{w}^{\mathbf{v}}(t)=\mathbf{z}_{w\underline{1\cdots 1}_{k}}^{\mathbf{v}}(t+k)$ (where $\underline{1\cdots 1}_{k}$ denotes $k$ -lenght word of $1$ s), and $\mathbf{z}_{w\underline{1\cdots 1}_{k}}^{\mathbf{v}}(s)$ is orthogonal to $\mathcal{H}_{s,+}^{\mathbf{u}}$ , according to the case $k=0$ . By taking $s=t+k$ for $k>0$ , the statement of the lemma follows. $\blacksquare$

Lemma 9

The components of $\mathbf{x}^{d}(t)$ belong to $\mathcal{H}_{t}^{\mathbf{u}}$ and

[TABLE]

where the right-hand side of (27) converges in the mean square sense.

{pf}

[Proof of Lemma 9] It is clear from the definition that the components of $\mathbf{x}^{d}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{u}}$ . Since $\mathbf{x}(t)\!\!=\!\!\sum_{w\in\Sigma^{*},\sigma\in\Sigma}\sqrt{p_{\sigma w}}A_{w}\left(K_{\sigma}\mathbf{z}^{\mathbf{v}}_{\sigma w}(t)+B_{\sigma}\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)\right)$ and the fact that the map $z\mapsto E_{l}[z\mid M]$ (where $z\in\mathcal{H}_{1}$ ) is a continuous linear operator for any closed subspace $M$ , it follows that

[TABLE]

From Lemma 8 it follows that, $E_{l}[\mathbf{z}^{\mathbf{v}}_{\sigma w}(t)\mid H_{t,+}^{\mathbf{u}}]=0$ , and since the components of $\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{u}}$ , it follows that $E_{l}[\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)\mid H_{t,+}^{\mathbf{u}}]=\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)$ .

Hence,

[TABLE]

Since the components of $\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)$ belong to $\mathcal{H}_{t}^{\mathbf{u}}$ , it follows that the components of the right-hand side of (28) belongs to $\mathcal{H}_{t}^{\mathbf{u}}$ and hence the components of $\mathbf{x}^{d}(t)$ belong to $\mathcal{H}_{t}^{\mathbf{u}}$ . Note that, the convergence of the right-hand side of (28) in the mean square sense follows from the convergence of the series $\sum_{w\in\Sigma^{*},\sigma\in\Sigma}\sqrt{p_{\sigma w}}A_{w}\left(K_{\sigma}\mathbf{z}^{\mathbf{v}}_{\sigma w}(t)+B_{\sigma}\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)\right)$ . $\blacksquare$

Lemma 10

The components of $\mathbf{x}^{s}(t)$ belong to $\mathcal{H}_{t}^{\mathbf{v}}$ , they are orthogonal to $\mathcal{H}_{t+k,+}^{\mathbf{u}}$ for any $k\geq 0$ and

[TABLE]

where the right-hand side converges in the mean-square sense.

{pf}

[Proof of Lemma 10]

From (27), $\mathbf{x}^{s}(t)=\mathbf{x}(t)-\mathbf{x}^{d}(t)$ and $\mathbf{x}(t)\!\!=\!\!\sum_{w\in\Sigma^{*},\sigma\in\Sigma}\sqrt{p_{\sigma w}}A_{w}\left(K_{\sigma}\mathbf{z}^{\mathbf{v}}_{\sigma w}(t)+B_{\sigma}\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)\right)$ , it follows that (29) holds and that its right-hand side converges in the mean square sense. From Lemma 8, it follows that for any $w\in\Sigma^{+}$ , the components of $\mathbf{z}_{w}^{\mathbf{v}}(t)$ are orthogonal to $\mathcal{H}_{t+k,+}^{\mathbf{u}}$ , hence all the summands of the infinite series of (29) are orthogonal to $\mathcal{H}_{t+k,+}^{\mathbf{u}}$ . $\blacksquare$

Finally, we now state the proof of Lemma 1 (Decomposition of $\mathbf{y}$ ). {pf}[Proof of Lemma 1] It follows that,

[TABLE]

Note that, $\mathbf{u}(t)\bm{\mu}_{\sigma}(t)=\sqrt{p_{\sigma}}\mathbf{z}^{\mathbf{u}}_{\sigma}(t+1)$ , hence the components of $\mathbf{u}(t)\bm{\mu}_{\sigma}(t)$ belong to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ and therefore

[TABLE]

We claim that,

[TABLE]

Since $\mathbf{x}(t)=\mathbf{x}^{d}(t)+\mathbf{x}^{s}(t)$ , it follows that $\mathbf{x}(t)\bm{\mu}_{\sigma}(t)=\mathbf{x}^{d}(t)\bm{\mu}_{\sigma}(t)+\mathbf{x}^{s}(t)\bm{\mu}_{\sigma}(t)$ .

From (Petreczky and Vidal, 2018, Lemma 9) and Lemma 9–10, it follows that

[TABLE]

From Lemma 8, it follows that $\mathbf{z}^{\mathbf{v}}_{\sigma^{{}^{\prime}}w\sigma}(t+1)$ is orthogonal to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ for all $w\in\Sigma^{+}$ , $\sigma^{{}^{\prime}},\sigma\in\Sigma$ , and hence $\mathbf{x}_{s}(t)\bm{\mu}_{\sigma}(t)$ is also orthogonal to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ . Moreover, since the components of $\mathbf{z}^{\mathbf{u}}_{\sigma^{{}^{\prime}}w\sigma}(t+1)$ belong to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ , it follows that $\mathbf{x}^{d}(t)\bm{\mu}_{\sigma}(t)$ belongs to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ . Hence,

[TABLE]

Finally, from Lemma 8, it follows that $\mathbf{v}(t)\bm{\mu}_{\sigma}(t)=\sqrt{p_{\sigma}}\mathbf{z}_{\sigma}^{\mathbf{v}}(t+1)$ is orthogonal to $\mathcal{H}_{t+1,+}^{\mathbf{u}}$ , and hence

[TABLE]

By collecting all these facts, we can show that

[TABLE]

That is, the first equation of (1) holds.

As to the second equation of (1), notice that from Definition 6,

[TABLE]

Since the components of $\mathbf{u}(t)$ are among the generators of $\mathcal{H}_{t,+}^{\mathbf{u}}$ , and by Lemma 7, $\mathbf{v}(t)=\mathbf{v}(t)\bm{\mu}_{1}(t)$ is orthogonal to $\mathcal{H}_{t,+}^{\mathbf{u}}$ , it follows that $E_{l}[\mathbf{v}(t)\mid\mathcal{H}_{t,+}^{\mathbf{u}}]=0$ and $E_{l}[\mathbf{u}(t)\mid\mathcal{H}_{t,+}^{\mathbf{u}}]=\mathbf{u}(t)$ . It then follows that the second equation of (1) holds.

From $\mathbf{y}^{s}(t)=\mathbf{y}(t)-\mathbf{y}^{d}(t)$ , $\mathbf{x}^{s}(t)=\mathbf{x}(t)-\mathbf{x}^{d}(t)$ and (1), (1) follows.

It is left to show that $(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D,\mathbf{x}^{d},\mathbf{u})$ and $(\{A_{\sigma},K_{\sigma}\}_{\sigma\in\Sigma},C,I_{n_{y}},\mathbf{x}^{s},\mathbf{v})$ are stationary LPV-SSA representations without inputs as per Definition 4.

Since $\sum_{\sigma\in\Sigma}p_{\sigma}A_{\sigma}\otimes A_{\sigma}$ is stable and $\mathbf{u}$ and $\mathbf{v}$ are both white noise processes w.r.t. $\bm{\mu}$ , the only thing which needs to be shown is that $\begin{bmatrix}(\mathbf{x}^{d})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ and $\begin{bmatrix}(\mathbf{x}^{s})^{T}&\mathbf{v}^{T}\end{bmatrix}^{T}$ are ZMWSII. However, the latter follows from (27), (29) and (Petreczky and Vidal, 2018, Lemma 3). $\blacksquare$

A.2 Proof of Lemma 2

We show that $(\{\hat{A}_{\sigma},\hat{B}_{\sigma},\hat{K}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ , where, $(\{\hat{A}_{\sigma},\hat{B}_{\sigma},\hat{K}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{e}^{s})$ are as defined in (11)-(12).

To this end, assume that $(\{A_{\sigma},B_{\sigma},K_{\sigma}\}_{\sigma\in\Sigma},C,D,\mathbf{x},\mathbf{v})$ is a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ . 111By Assumption of Lemma 2, such a LPV-SSA representation exists. Recall from Notation 3 that $\mathcal{H}_{1}$ denotes the Hilbert-space of zero mean square integrable random variables. Denote by $\mathcal{H}_{t,+}^{\mathbf{v}}$ the closed-subspace of the Hilbert-space $\mathcal{H}_{1}$ generated by the components of $\{\mathbf{z}^{\mathbf{v}}_{w}\}_{w\in\Sigma^{+}}\cup\{\mathbf{v}(t)\}$ , and denote by $\mathcal{H}_{t}^{\mathbf{v}}$ the Hilbert-space generated by $\{\mathbf{z}^{\mathbf{v}}_{w}\}_{w\in\Sigma^{+}}$ . We prove the following lemmas.

Lemma 11

Assume that $(\{A_{\sigma},B_{\sigma},K_{\sigma}\}_{\sigma\in\Sigma},C,D,\mathbf{x},\mathbf{v})$ is a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ The components of $\mathbf{y}^{s}(t),\mathbf{z}^{\mathbf{y}^{s}}_{v}(t)$ , $\mathbf{e}^{s}(t)$ , $\mathbf{z}^{\mathbf{e}^{s}}_{v}(t)$ , $v\in\Sigma^{+}$ belong to $\mathcal{H}_{t,+}^{\mathbf{v}}$ .

{pf}

[Proof of Lemma 11] Recall from Lemma 9,

[TABLE]

and hence,

[TABLE]

That is, the components of $\mathbf{y}^{s}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{v}}$ . In particular, from (Petreczky and Vidal, 2018, Lemma 11), it follows that the coordinates of $\mathbf{z}^{\mathbf{y}^{s}}_{w}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{v}}$ and hence, $\mathcal{H}_{t}^{\mathbf{y}^{s}}\subseteq\mathcal{H}_{t}^{\mathbf{v}}$ . Since $\mathbf{e}^{s}(t)=\mathbf{y}^{s}(t)-E_{l}[\mathbf{y}^{s}(t)\mid\mathcal{H}_{t}^{\mathbf{y}^{s}}]$ , it follows that the components of $\mathbf{e}^{s}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{v}}$ . Since $\mathbf{z}_{v}^{\mathbf{v}}(t)=\mathbf{z}_{v1}^{\mathbf{v}}(t+1)$ , $\mathbf{v}(t)=\mathbf{z}_{1}^{\mathbf{v}}(t+1)$ , it follows that $\mathcal{H}_{t,+}^{\mathbf{v}}\subseteq\mathcal{H}_{t+1}^{\mathbf{v}}$ and from (Petreczky and Vidal, 2018, Lemma 11) it follows that the components of $\mathbf{z}_{v}^{\mathbf{e}^{s}}(t)$ belong to $\mathcal{H}_{t}^{\mathbf{v}}\subseteq\mathcal{H}_{t,+}^{\mathbf{v}}$ .

Lemma 12

If $\mathbf{y}$ has a realization by a stationary LPV-SSA representation with input $\mathbf{u}$ , then the components of $\mathbf{y}^{s}(t),\mathbf{z}^{\mathbf{y}^{s}}_{v}(t),\mathbf{e}^{s}(t),\mathbf{z}^{\mathbf{e}^{s}}_{v}(t)$ , $v\in\Sigma^{+}$ are orthogonal to $\mathcal{H}_{t,+}^{\mathbf{u}}$ , i.e., for all $v,w\in\Sigma^{+}$

[TABLE]

{pf}

[Lemma 12] From Lemma 7–8 and by noticing that $\mathbf{v}(t)=\mathbf{v}(t)\bm{\mu}_{1}(t)$ it follows that the elements of $\mathcal{H}_{t,+}^{\mathbf{v}}$ are orthogonal to $\mathcal{H}_{t,+}^{\mathbf{u}}$ . Hence, the coordinates of $\mathbf{u}(t)$ , $\mathbf{z}_{w}^{\mathbf{u}}(t)$ , $w\in\Sigma^{+}$ are orthogonal to $\mathcal{H}_{t,+}^{\mathbf{v}}$ . Since the coordinates of $\mathbf{y}^{s}(t),\mathbf{z}^{\mathbf{y}^{s}}_{v}(t),\mathbf{e}^{s}(t),\mathbf{z}^{\mathbf{e}^{s}}_{v}(t)$ belong to $\mathcal{H}_{t,+}^{\mathbf{v}}$ , it follows that the coordinates of $\mathbf{y}^{s}(t),\mathbf{z}^{\mathbf{y}^{s}}_{v}(t),\mathbf{e}^{s}(t),\mathbf{z}^{\mathbf{e}^{s}}_{v}(t)$ are orthogonal to $\mathcal{H}_{t,+}^{\mathbf{u}}$ . Since $\mathcal{H}_{t,+}^{\mathbf{u}}$ is generated by the coordinates of $\mathbf{u}(t)$ , $\mathbf{z}_{w}^{\mathbf{u}}(t)$ , $w\in\Sigma^{+}$ , (32) follows.

Lemma 13

$\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ * is a white noise process w.r.t. $\bm{\mu}$ and $E[\mathbf{e}^{s}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]=0$ for all $\sigma\in\Sigma$ .*

{pf}

[Proof of Lemma 13] In order to prove the statement of the lemma, we will first show that $\mathbf{r}(t)=\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ is a ZMWSII, by showing that $\mathbf{r}$ satisfies the conditions of Definition 1 one by one. First, we show that the processes $\mathbf{r}(t),\mathbf{z}_{w}^{\mathbf{r}}(t),w\in\Sigma^{+}$ is zero mean, square integrable.

Note that $\mathbf{u}$ is a white noise process w.r.t. $\bm{\mu}$ , in particular, it is a ZWMSII process and hence $\mathbf{u}(t),\mathbf{z}_{w}^{\mathbf{u}}(t),w\in\Sigma^{+}$ are zero mean, square integrable. From the fact that $\Sigma_{s}$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ it follows that $\mathbf{e}^{s}$ is also a white noise process w.r.t. $\bm{\mu}$ , in particular, it is also ZWMSII and thus $\mathbf{e}^{s}(t),\mathbf{z}_{w}^{\mathbf{e}^{s}}(t),w\in\Sigma^{+}$ is zero mean, square integrable. From this it follows that $\mathbf{r}(t)=\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ and $\mathbf{z}_{w}^{\mathbf{r}}(t)=\begin{bmatrix}(\mathbf{z}^{\mathbf{e}^{s}}_{w}(t))^{T}&(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}\end{bmatrix}^{T}$ are zero mean and square integrable.

From Lemma 11 it follows that $\mathbf{e}^{s}(t)$ belongs to $\mathcal{H}_{t,+}^{\mathbf{v}}(t)$ , where $\mathbf{v}$ is a noise process of a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ . From the definition of a stationary LPV-SSA representation it then follows that $\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ is ZMWSII. Hence, with the notation of Definition 1, the $\sigma$ -algebras $\mathcal{F}_{t}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}$ and $\mathcal{F}_{t}^{\bm{\mu},+}$ are conditionally independent w.r.t. $\mathcal{F}_{t}^{\bm{\mu},-}$ . From the fact that $\mathbf{e}^{s}(t)$ belongs to $\mathcal{H}_{t,+}^{\mathbf{v}}(t)$ it follows that $\mathbf{e}^{s}(t)$ is measurable with respect to the $\sigma$ -algebra generated by $\{\mathbf{v}(t)\}\cup\{\mathbf{z}_{v}^{\mathbf{v}}(t)\mid v\in\Sigma^{+}\}$ and the latter $\sigma$ -algebra is a subset of $\mathcal{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\lor\mathcal{F}^{\bm{\mu},-}_{t}$ , where for two $\sigma$ -algebras $\mathcal{F}_{i}$ , $i=1,2$ , $\mathcal{F}_{1}\lor\mathcal{F}_{2}$ denotes the smallest $\sigma$ -algebra generated by the $\sigma$ -algebras $\mathcal{F}_{1},\mathcal{F}_{2}$ . That is, $\mathbf{e}^{s}(t)$ is measurable w.r.t. the $\sigma$ algebra $\mathcal{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\lor\mathcal{F}^{\bm{\mu},-}_{t}$ $\mathcal{F}_{t}^{\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}\subseteq\mathcal{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\lor\mathcal{F}^{\bm{\mu},-}_{t}$ . Since $\mathscr{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}$ and $\mathscr{F}^{\bm{\mu},+}$ are conditionally independent w.r.t. $\mathscr{F}^{\bm{\mu},-}_{t}$ , from (van Putten and van Schuppen, 1985, Proposition 2.4) it follows that $\mathscr{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\lor\mathscr{F}^{\bm{\mu},-}_{t}$ and $\mathscr{F}^{\bm{\mu},+}_{t}$ are conditionally independent w.r.t. $\mathscr{F}^{\bm{\mu},-}_{t}$ , and as $\mathscr{F}^{\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\subseteq\mathscr{F}^{\begin{bmatrix}\mathbf{v}^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}\lor\mathscr{F}^{\bm{\mu},-}_{t}$ , it follows that $\mathscr{F}^{\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}}_{t}$ and $\mathscr{F}^{\bm{\mu},+}$ are conditionally independent w.r.t. $\mathscr{F}^{\bm{\mu},-}_{t}$ .

Finally, from (32) it follows that $\mathbf{r}(t),\mathbf{z}_{w}^{\mathbf{r}}(t),w\in\Sigma^{+}$ , $\mathbf{r}(t)=\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ are jointly wide-sense stationary, i.e., for all $s,t\in\mathbb{Z}$ , $s\leq t$ , $v,w\in \Sigma^{+}$ ,

[TABLE]

Indeed, from (32) and the fact that $\mathbf{e}_{s}$ , $\mathbf{u}$ are ZMWSII and hence the processes $\mathbf{e}^{s}(t),\mathbf{z}_{w}^{\mathbf{e}^{s}}(t),w\in\Sigma^{+}$ are jointly wide-sense stationary and the processes $\mathbf{u}(t),\mathbf{z}_{w}^{\mathbf{u}}(t),w\in\Sigma^{+}$ are also jointly wide-sense stationary, it follows that

[TABLE]

Above, we used the fact that if $s<t$ , then $\mathbf{u}(s+k)=\mathbf{z}_{h}^{\mathbf{u}}(t+k+1)$ , $\mathbf{e}^{s}(s+k)=\mathbf{z}_{h}^{\mathbf{e}^{s}}(t+k+1)$ , $\mathbf{u}(s)=\mathbf{z}_{h}^{\mathbf{u}}(t+1)$ , $\mathbf{e}^{s}(s)=\mathbf{z}_{h}^{\mathbf{e}^{s}}(t+1)$ , $\mathbf{z}^{\mathbf{u}}_{w}(s+k)=\mathbf{z}_{wh}^{\mathbf{u}}(t+k)$ , $\mathbf{z}_{v}^{\mathbf{e}^{s}}(s+k)=\mathbf{z}_{vh}^{\mathbf{e}^{s}}(t+k)$ , where $h=\underbrace{1\cdots 1}_{t-s}$ .

That is, we have shown that $\mathbf{r}(t)=\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ satisfies all the conditions of Definition 1.

Next we show that $\mathbf{r}(t)$ is a white noise process w.r.t. $\bm{\mu}$ , i.e., $E[\mathbf{r}(t)(\mathbf{z}^{\mathbf{r}}_{w}(t))^{T}]=0$ for all $w\in\Sigma^{+}$ . From (32) it follows that

[TABLE]

Notice $\mathbf{e}_{s}$ is a white noise process w.r.t. $\bm{\mu}$ , since $\Sigma_{s}=(\{\hat{A}^{s}_{i},\hat{K}^{s}_{i}\}_{i=1}^{n_{\mu}},\hat{C}^{s},I_{n_{y}},\hat{\mathbf{x}}^{s},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}^{s}$ without inputs, and hence $E\left[\mathbf{e}_{s}(t)(\mathbf{z}_{w}^{\mathbf{e}_{s}}(t)\right]=0$ . Furthermore, $\mathbf{u}$ is a white noise process w.r.t. $\bm{\mu}$ by assumption, so $E\left[\mathbf{u}(t)(\mathbf{z}_{w}^{\mathbf{u}}(t)\right]=0$ . Hence, $E\left[\mathbf{r}(t)(\mathbf{z}^{\mathbf{r}}_{w}(t))^{T}\right]=0$ .

It is left to show that $E[\mathbf{e}^{s}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]=0$ for all $\sigma\in\Sigma$ . Notice that $E[\mathbf{e}^{s}(t)\mathbf{u}^{T}(t)\bm{\mu}_{\sigma}^{2}(t)]=E[\mathbf{z}^{\mathbf{e}^{s}}_{\sigma}(t+1)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t+1))^{T}]p_{\sigma}$ by definition, and from (32) it follows that $E[\mathbf{z}^{\mathbf{e}^{s}}_{\sigma}(t+1)(\mathbf{z}^{\mathbf{u}}_{\sigma}(t+1))^{T}]=0$ .

Lemma 14

$\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}_{\sigma}\otimes\hat{A}_{\sigma}$ * is stable, where $\hat{A}_{\sigma}=\mathrm{diag}(\hat{A}^{d}_{\sigma},\hat{A}^{s}_{\sigma})$ .*

{pf}

[Proof of Lemma 14] From (Costa et al., 2005, Proposition 2.6), it follows that $\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}^{d}_{\sigma}\otimes\hat{A}^{d}_{\sigma}$ and $\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}^{s}_{\sigma}\otimes\hat{A}^{s}_{\sigma}$ , are stable if $\exists\ Q^{d},Q^{s}>0$ such that,

[TABLE]

It then follows that,

[TABLE]

From the corollary, (Costa et al., 2005, Proposition 2.6), it follows that, $\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}_{\sigma}\otimes\hat{A}_{\sigma}$ is stable, with $\hat{A}_{\sigma}=\mathrm{diag}(\hat{A}^{d}_{\sigma},\hat{A}^{s}_{\sigma})$ . $\blacksquare$

Lemma 15

The processes $\hat{\mathbf{x}},\mathbf{u},\mathbf{e}$ satisfy all the conditions for noise and state processes of a stationary LPV-SSA representation with no inputs, i.e., $\begin{bmatrix}\hat{\mathbf{x}}^{T}&\mathbf{u}^{T}&(\mathbf{e}^{s})^{T}\end{bmatrix}^{T}$ is a ZMWSII, $\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ is a white noise process w.r.t. $\bm{\mu}$ , and $E[\hat{\mathbf{x}}(t)(\mathbf{z}^{[\mathbf{u}\ \mathbf{e}^{s}]}_{w}(t))^{T}]=0$ and $E[\mathbf{z}_{\sigma}^{\hat{\mathbf{x}}}(t)(\mathbf{z}^{[\mathbf{u}\ \mathbf{e}^{s}]}_{\sigma}(t))^{T}]=0$ .

{pf}

[Proof of Lemma 15] It follows from Lemma 13 that $\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ is a white noise process w.r.t. $\bm{\mu}$ .

From (Petreczky and Vidal, 2018, Lemma 2), it follows that

[TABLE]

and

[TABLE]

Thus,

[TABLE]

From Lemma 14, it follows that $\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}_{\sigma}\otimes\hat{A}_{\sigma}$ is stable. That is, the noise process $\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ and the matrices $\{\hat{A}_{\sigma}\}_{\sigma\in\Sigma}$ satisfy the conditions of a stationary LPV-SSA representation without inputs. From (Petreczky and Vidal, 2018, Lemma 3) it then follows that $\hat{\mathbf{x}}$ is the unique process such that $\begin{bmatrix}\mathbf{u}^{T}&(\mathbf{e}^{s})^{T}\end{bmatrix}^{T}$ satisfies all the conditions of a stationary LPV-SSA representation.

{pf}[Proof of Lemma 2] It is clear from Lemmas 15, 13, 14 that $\{A_{\sigma}\}_{\sigma\in\Sigma}$ satisfies the conditions of the definition of a stationary LPV-SSA representation without inputs. From Lemma 13, it follows that $\hat{\mathbf{x}}$ and $\begin{bmatrix}(\mathbf{e}^{s})^{T}&\mathbf{u}^{T}\end{bmatrix}^{T}$ satisfies the conditions of an LPV-SSA representation without inputs. From Lemmas 15 it follows that the noise process $\mathbf{e}^{s}$ and the input $\mathbf{u}$ satisfy the condition of $E[\mathbf{e}^{s}(t)(\mathbf{u}(t))^{T}\bm{\mu}_{\sigma}^{2}(t)]=0$ , $\sigma\in\Sigma$ . Hence, $(\{\hat{A}_{\sigma},\hat{B}_{\sigma},\hat{K}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{e}^{s})$ is a stationary LPV-SSA representation of $\mathbf{y}$ with input $\mathbf{u}$ .

A.3 Proof of Lemma 4

{pf}

[Proof of Lemma 4]

**If $(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D,\mathbf{x},\mathbf{u})$ is a stationary LPV-SSA representation without inputs of $\mathbf{y}^{d}$ $\implies$ $\Psi_{\mathbf{u},\mathbf{y}}=M_{\mathscr{S}}$ , where $\mathscr{S}=(\{A_{\sigma},B_{\sigma}\}_{\sigma\in\Sigma},C,D)$ . **

Recall that,

[TABLE]

Consider,

[TABLE]

This follows, as $\mathbf{u}$ is white noise process, $E[\mathbf{u}(t)(\mathbf{z}_{w}^{\mathbf{u}}(t))^{T}]=0$ and $E[\mathbf{z}^{\mathbf{u}}_{\sigma s}(t)(\mathbf{z}_{w}^{\mathbf{u}}(t))^{T}]=\Lambda_{\mathbf{u}}$ if $\sigma s=w$ , otherwise $E[\mathbf{z}^{\mathbf{u}}_{\sigma s}(t)(\mathbf{z}_{w}^{\mathbf{u}}(t))^{T}]=0$ .

Similarly, $E[\mathbf{y}^{d}(t)\mathbf{u}(t)]=D\Lambda_{\mathbf{u}}$ .

Finally, we recall that $\mathbf{y}(t)=\mathbf{y}^{d}(t)+\mathbf{y}^{s}(t)$ . From Lemma 12 it follows that that the he components of $\mathbf{y}^{s}(t)$ is orthogonal to $\mathcal{H}^{\mathbf{u}}_{t,+}$ , that is, $E[\mathbf{y}^{s}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]=0$ and $E[\mathbf{y}^{s}(t)(\mathbf{u}(t))^{T}]=0$ . Thus, we have, $E[\mathbf{y}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]=E[\mathbf{y}^{d}(t)(\mathbf{z}^{\mathbf{u}}_{w}(t))^{T}]$ , $E[\mathbf{y}(t)(\mathbf{u}(t))^{T}]=E[\mathbf{y}^{d}(t)(\mathbf{u}(t))^{T}]$ and the statement of Lemma follows.

$\hat{\mathscr{S}}=(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},\hat{D})$ ** is a minimal dimensional determinstic LPV-SSA representation such that $M_{\hat{\mathscr{S}}}=\Psi_{\mathbf{u},\mathbf{y}}$ $\implies$ $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a stationary LPV-SSA representation (without inputs) of $\mathbf{y}^{d}$ **. Consider the formal power series (Petreczky and Vidal, 2018, Appendix C) of $\Psi(w)=\left[M_{\hat{\mathscr{S}}}(1w)\sqrt{p_{1w}}\ \ \ M_{\hat{\mathscr{S}}}(n_{\mu}w)\sqrt{p_{{n_{\mu}}w}}\right]$ . Let $\tilde{\mathscr{S}}=(\{\tilde{A}_{\sigma},\tilde{B}_{\sigma}\}_{\sigma\in\Sigma},\tilde{C},\tilde{D})$ be any deterministic LPV-SSA representation such that $M_{\tilde{\mathscr{S}}}$ . Consider the recognizable representation in the sense of (Petreczky and Vidal, 2018, Appendix C) defined as $R_{\tilde{\mathscr{S}}}=(\{\sqrt{p_{\sigma}}\tilde{A}_{\sigma}\}_{\sigma\in\Sigma},\tilde{B},\tilde{C})$ , $\tilde{B}=\left[\sqrt{p_{1}}\tilde{B}_{1}\ \cdots\ \sqrt{p_{n_{\mu}}}\tilde{B}_{n_{\mu}}\right]$ . We claim that $R_{\tilde{\mathscr{S}}}$ is a recognizable representation of $\Psi$ (see (Petreczky and Vidal, 2018, Appendix C) for the definition of a recognizable representation of a formal power series), and if $\tilde{\mathscr{S}}$ is a minimal dimensional deterministic LPV-SSA representation such that $M_{\tilde{\mathscr{S}}}=\Psi_{\mathbf{u},\mathbf{y}}$ , then $R_{\tilde{\mathscr{S}}}$ is a minimal dimensional representation of $\Psi$ . For the definition of the dimension of a recognizable representation, see (Petreczky and Vidal, 2018, Appendix C). We call $R_{\tilde{\mathscr{S}}}$ the recognizable representation associated with the deterministic LPV-SSA representation $\mathscr{S}$ .

Indeed, $M_{\tilde{\mathscr{S}}}(\sigma w)\sqrt{p_{\sigma}w}=\sqrt{p_{w}}\tilde{C}\tilde{A}_{w}\tilde{B}_{\sigma}\sqrt{p_{\sigma}}$ , hence $\Psi(w)=\tilde{C}F_{w}\tilde{B}$ , $F_{\sigma}=\sqrt{p_{\sigma}}\tilde{A}_{\sigma}$ , which by definition (Petreczky and Vidal, 2018, Appendix C) means that $R$ is a representation of $\Psi$ .

In order to show that $R_{\tilde{\mathscr{S}}}$ is a minimal representation of $\Psi$ , if $\tilde{\mathscr{S}}$ is a minimal dimensional deterministic LPV-SSA representation such that $M_{\tilde{\mathscr{S}}}=\Psi_{\mathbf{u},\mathbf{y}}$ , we proceed as follows. Consider a recognizable representation $R_{\mathrm{o}}=(\{\bar{F}_{\sigma}\}_{\sigma\in\Sigma},\bar{G},\bar{C})$ of $\Psi$ . Define the deterministic LPV-SSA representation $\bar{\mathscr{S}}=(\{\{\bar{A}_{\sigma},\bar{B}_{\sigma}\}_{\sigma\in\Sigma},\bar{C},D)$ , where $\bar{A}_{\sigma}=\frac{1}{\sqrt{p_{\sigma}}}\bar{F}_{\sigma}$ , and $\bar{G}=\left[\sqrt{p_{1}}\tilde{B}_{1}\ \cdots\ \sqrt{p_{n_{\mu}}}\tilde{B}_{n_{\mu}}\right]$ . Then, since $\Psi(w)=\bar{C}\bar{F}_{w}\bar{G}$ , $w\in\Sigma^{*}$ , it follows that $M_{\tilde{\mathscr{S}}}(\sigma w)=\frac{1}{\sqrt{p_{w}}\sqrt{p_{\sigma}}}\bar{C}\bar{F}_{w}(\sqrt{p_{\sigma}}\tilde{B}_{\sigma})=\bar{C}\bar{A}_{w}\bar{B}_{\sigma}$ , i.e., $M_{\bar{\mathscr{S}}}=M_{\tilde{\mathscr{S}}}=\Psi_{\mathbf{u},\mathbf{y}}$ . By the assumption, $\tilde{\mathscr{S}}$ is a minimal dimensional determinsitic LPV-SSA representation such that $M_{\tilde{\mathscr{S}}}=\Psi_{\mathbf{u},\mathbf{y}}$ , hence the dimension of $\bar{\mathscr{S}}$ should not be smaller than that of $\tilde{\mathscr{S}}$ . However, the dimension of $\tilde{\mathscr{S}}$ equals the dimension of the representation $R_{\tilde{\mathscr{S}}}$ , and the dimension of $\bar{\mathscr{S}}$ equals the dimension of $R_{\mathrm{o}}$ . That is, the dimension of any representation of $\Psi$ cannot be smaller than the dimension of $R_{\tilde{\mathscr{S}}}$ , i.e., $R_{\tilde{\mathscr{S}}}$ is minimal.

Since it is assumed that $\mathbf{y}$ has a stationary LPV-SSA representation with input $\mathbf{u}$ , from Lemma 1 it follows that there exists a stationary LPV-SSA representation $(\{A_{\sigma},B_{\sigma}\},C,D,\mathbf{x}^{d},\mathbf{u})$ of $\mathbf{y}^{d}$ without inputs. Then by the first part of this lemma, the deterministic LPV-SSA representation $\mathscr{S}=(\{A_{\sigma},B_{\sigma}\},C,D)$ is such that $M_{\mathscr{S}}=\Psi_{\mathbf{u},\mathbf{y}}$ . Since by definition of a stationary LPV-SSA representation without inputs, $\sum_{\sigma\in\Sigma}p_{\sigma}A_{\sigma}\otimes A_{\sigma}=\sum_{\sigma\in\Sigma}(\sqrt{p_{\sigma}}A_{\sigma})\otimes(\sqrt{p_{\sigma}}A_{\sigma})$ is stable, then using the terminology of (Petreczky and Vidal, 2018, Appendix C), the representation $R_{\mathscr{S}}$ associated with $\mathscr{S}$ is a stable representation. Recall from (Petreczky and Vidal, 2018, Appendix C) that a recognizable representation $R=(\{F_{\sigma}\}_{\sigma\in\Sigma},G,H)$ is stable, if all eigenvalues of the matrix $\sum_{\sigma\in\Sigma}F_{\sigma}\otimes F_{\sigma}$ are inside the open unit disk. Since by the discussion above $R_{\mathscr{S}}$ is a representation of $\Psi$ , by (Petreczky and Vidal, 2018, Theorem 6) $\Psi$ is square summable (see (Petreczky and Vidal, 2018, Appendix C) for the definition of a square summable formal power series).

Consider now the minimal deterministic LPV-SSA representation $\hat{\mathscr{S}}$ from the statement of the lemma. From the discussion above it then follows that the associated recognizable representation $R_{\hat{\mathscr{S}}}=(\{\sqrt{p_{\sigma}}\hat{A}_{\sigma}\}_{\sigma\in\Sigma},B,C)$ is a minimal representation of $\Psi$ . Since $\Psi$ is square summable and $R_{\hat{\mathscr{S}}}$ is minimal, from (Petreczky and Vidal, 2018, Theorem 6) it follows that $R_{\hat{\mathscr{S}}}$ is stable, which means that all the eigenvalues of $\sum_{\sigma\in\Sigma}p_{\sigma}\hat{A}_{\sigma}\otimes\hat{A}_{\sigma}$ are inside the open unit disk. Notice that $\mathbf{u}$ is a white noise process w.r.t. $\bm{\mu}$ by assumption. Then it follows from (Petreczky and Vidal, 2018, Lemma 3) that with $\hat{\mathbf{x}}(t)=\sum_{w\in\Sigma^{*},\sigma\in\Sigma}\sqrt{p_{\sigma w}}\hat{A}_{w}\hat{B}_{\sigma}\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)$ $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a stationary LPV-SSA representation.

It is left to show that $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a representation of $\mathbf{y}^{d}$ . Consider the representation stationary LPV-SSA representation $(\{A_{\sigma},B_{\sigma}\},C,D,\mathbf{x}^{d},\mathbf{u})$ of $\mathbf{y}^{d}$ without inputs from the discussion above. It then follows from (Petreczky and Vidal, 2018, Lemma 1) that

[TABLE]

where we used that by (Petreczky and Vidal, 2018, Lemma 3) $\hat{\mathbf{x}}(t)=\sum_{w\in\Sigma^{*},\sigma\in\Sigma}\sqrt{p_{\sigma w}}\hat{A}_{w}\hat{B}_{\sigma}\mathbf{z}^{\mathbf{u}}_{\sigma w}(t)$ . This means that $\mathbf{y}^{d}(t)=\hat{C}\hat{\mathbf{x}}(t)+\hat{D}\mathbf{u}(t)$ , and hence $(\{\hat{A}_{\sigma},\hat{B}_{\sigma}\},\hat{C},\hat{D},\hat{\mathbf{x}},\mathbf{u})$ is a representation of $\mathbf{y}^{d}$ .

A.4 Proof of Lemma 5

Assume $\mathscr{S}=(\{\hat{A}_{\sigma},\hat{G}_{\sigma}\}_{\sigma\in\Sigma},\hat{C},I_{n_{y}})$ is LPV-SSA representation whose sub-Markov parameters are $M_{\mathscr{S}}=\Psi_{\mathbf{y}^{s}}$ .

Let $\Psi(w)=\left[\Psi_{\mathbf{y}^{s}}(1w)\cdots\Psi_{\mathbf{y}^{s}}(n_{\mu}w)\right]$ , $\forall w\in\Sigma^{*}$ . Then, $R=(\{\hat{A}_{\sigma},\}_{\sigma\in\Sigma},\hat{G},\hat{C})$ , with $\hat{G}=\left[\hat{G}_{1}\cdots\hat{G}_{n_{\mu}}\right]$ , is a representation of $\Psi$ and by (Petreczky et al., 2017, Theorem 1, Theorem 2) and the definition of observability and reachability for representations in (Petreczky and Vidal, 2018, Appendix C), it follows that $R$ is observable and reachable, and hence $R$ is minimal by Berstel and Reutenauer (1984); Sontag (1979).

Then, the statement follows from (Petreczky and Vidal, 2018, Theorem 5 and Lemma 21). $\blacksquare$ .

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Bamieh and Giarré (2002) Bamieh, B.A. and Giarré, L. (2002). Identification of linear parameter-varying models. International Journal of Robust Nonlinear Control , 12(9), 841–853.
2Berstel and Reutenauer (1984) Berstel, J. and Reutenauer, C. (1984). Rational series and their languages . EATCS Monographs on Theoretical Computer Science. Springer-Verlag.
3Bilingsley (1986) Bilingsley, P. (1986). Probability and measure . Wiley.
4Costa et al. (2005) Costa, O.L.V., Fragoso, M.D., and Marques, R.P. (2005). Discrete-time Markov Jump Linear Systems . Springer.
5Cox et al. (2018) Cox, P., Petreczky, M., and Tóth, R. (2018). Towards efficient maximum likelihood estimation of LPV-SS models. Automatica , 97(9), 392–403.
6Cox et al. (2015) Cox, P., Tóth, R., and Petreczky, M. (2015). Estimation of LPV-SS models with static dependency using correlation analysis. In Proc. 1st IFAC Workshop on Linear Parameter Varying Systems , 91–96. Grenoble, France.
7dos Santos et al. (2009) dos Santos, P., Ramos, J., and de Carvalho, J. (2009). Identification of bilinear systems with white noise inputs: An iterative deterministic-stochastic subspace approach. IEEE Transactions on Control Systems Technology , 17(5), 1145–1153.
8Favoreel et al. (1999) Favoreel, W., De Moor, B., and Van Overschee, P. (1999). Subspace identification of bilinear systems subject to white inputs. IEEE Transactions on Automatic Control , 44(6), 1157–1165. 10.1109/9.769370 . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Realization and identification algorithm for stochastic LPV state-space models with exogenous inputs

Abstract

keywords:

1 Introduction

2 PROBLEM FORMULATION

Problem 1** (Realization problem)**

Problem 2** (Identification problem)**

3 Properties of LPV-SSA representation

Notation 1** (Σ\SigmaΣ)**

Assumption 1** (White noise scheduling)**

Definition 1** (ZMWSSI, Petreczky and Vidal (2018))**

Definition 2** (Petreczky and Vidal (2018))**

Definition 3** (White noise w.r.t. μ\bm{\mu}μ)**

Assumption 2

Definition 4

Notation 2** (Matrix Product)**

Definition 5** (Stationary LPV-SSA)**

4 Decomposition of the output of LPV-SSA representation

Notation 3** (Orthogonal projection ElE_{l}El​)**

Definition 6** (Deterministic and stochastic components)**

Lemma 1** (Decomposition of y\mathbf{y}y)**

Lemma 2

5 Realization algorithms

5.1 Basis reduced Ho-Kalman realization algorithm

Definition 7** (Selection)**

Example 1

Lemma 3** (Adapted from (Cox et al., 2018))**

5.2 Correlation analysis: finding an LPV-SSA representation of yd\mathbf{y}^{d}yd

Lemma 4

Corollary 1

5.3 Covariance realization algorithm

Lemma 5

Corollary 2

6 Identification algorithm

Assumption 3

Lemma 6** (Consistency)**

Remark 1** (Intuition behind (23))**

Remark 2** (Alternative way of computing ΨysN\Psi_{\mathbf{y}^{s}}^{N}ΨysN​)**

7 Numerical example

8 Conclusion

Appendix A Proofs

A.1 Proof of Lemma 1

Lemma 7

Lemma 8

Lemma 9

Lemma 10

A.2 Proof of Lemma 2

Lemma 11

Lemma 12

Lemma 13

Lemma 14

Lemma 15

A.3 Proof of Lemma 4

A.4 Proof of Lemma 5

Problem 1 (Realization problem)

Problem 2 (Identification problem)

Notation 1 ( $\Sigma$ )

Assumption 1 (White noise scheduling)

Definition 1 (ZMWSSI, Petreczky and Vidal (2018))

Definition 2 (Petreczky and Vidal (2018))

Definition 3 (White noise w.r.t. $\bm{\mu}$ )

Notation 2 (Matrix Product)

Definition 5 (Stationary LPV-SSA)

Notation 3 (Orthogonal projection $E_{l}$ )

Definition 6 (Deterministic and stochastic components)

Lemma 1 (Decomposition of $\mathbf{y}$ )

Definition 7 (Selection)

Lemma 3 (Adapted from (Cox et al., 2018))

5.2 Correlation analysis: finding an LPV-SSA representation of $\mathbf{y}^{d}$

Lemma 6 (Consistency)

Remark 1 (Intuition behind (23))

Remark 2 (Alternative way of computing $\Psi_{\mathbf{y}^{s}}^{N}$ )