This paper introduces an automated controller synthesis method for retarded jump-diffusion systems, enabling the design of controllers with probabilistic guarantees by using finite abstractions and stability analysis.
Contribution
It presents the first automated, correct-by-construction control synthesis scheme for infinite-dimensional retarded jump-diffusion systems, including finite abstraction construction and stability-based refinement.
Findings
01
Finite abstractions are approximately bisimilar to original systems.
02
A probabilistic bound on output trajectory differences is established.
03
Controller synthesis successfully applied to temperature regulation in a delayed building model.
Abstract
In this paper, we provide for the first time an automated, correct-by-construction, controller synthesis scheme for a class of infinite dimensional stochastic systems, namely, retarded jump-diffusion systems. First, we construct finite abstractions approximately bisimilar to non-probabilistic retarded systems corresponding to the original systems having some stability property, namely, incremental input-to-state stability. Then, we provide a result on quantifying the distance between output trajectory of the obtained finite abstraction and that of the original retarded jump-diffusion system in a probabilistic setting. Using the proposed result, one can refine the control policy synthesized using finite abstractions to the original systems while providing guarantee on the probability of satisfaction of high-level requirements. Moreover, we provide sufficient conditions for the proposed…
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
Symbolic Models for Retarded Jump-Diffusion Systems
Pushpak Jagtap1
and
Majid Zamani2,3
1Department of Electrical and Computer Engineering, Technical University of Munich, Germany.
3Computer Science Department, Ludwig Maximilian University of Munich, Germany.
Abstract.
In this paper, we provide for the first time an automated, correct-by-construction, controller synthesis scheme for a class of infinite dimensional stochastic systems, namely, retarded jump-diffusion systems. First, we construct finite abstractions approximately bisimilar to non-probabilistic retarded systems corresponding to the original systems having some stability property, namely, incremental input-to-state stability. Then, we provide a result on quantifying the distance between output trajectory of the obtained finite abstraction and that of the original retarded jump-diffusion system in a probabilistic setting. Using the proposed result, one can refine the control policy synthesized using finite abstractions to the original systems while providing guarantee on the probability of satisfaction of high-level requirements. Moreover, we provide sufficient conditions for the proposed notion of incremental stability in terms of the existence of incremental Lyapunov functions which reduce to some matrix inequalities for the linear systems. Finally, the effectiveness of the proposed results is illustrated by synthesizing a controller regulating the temperatures in a ten-room building modelled as a delayed jump-diffusion system.
1. Introduction
Finite (a.k.a. symbolic) abstraction techniques have gained significant attention in the last few years since they provide tools for automated, correct-by-construction, controller synthesis for several classes of control systems. In particular, such abstractions provide approximate models that are related to concrete systems by aggregating concrete states and inputs to the symbolic ones. Having such finite abstractions, one can make use of the existing automata-theoretic techniques [MPS95] to synthesize hybrid controllers enforcing rich complex specifications (usually expressed as linear temporal logic formulae or automata on infinite strings) over the original systems.
In the past few years, there have been several results providing bisimilar finite abstractions for various continuous-time non-probabilistic as well as stochastic systems. The results include construction of approximately bisimilar abstractions for incrementally stable control systems [PGT08], switched systems [GPT10], stochastic control systems [ZMEM*+*14], and randomly switched stochastic systems [ZA14]. However, the abstractions obtained in these results are based on state-space quantization which suffer severely from the curse of dimensionality, i.e., the computational complexity increases exponentially with respect to the state-space dimension of the concrete system.
To alleviate this issue, authors in [CGG13] proposed an alternative approach for constructing approximately bisimilar abstractions for incrementally stable non-probabilistic switched systems without discretizing the state-space. The concept is further extended to provide finite abstractions for incrementally stable stochastic switched systems [ZAG15], stochastic control systems [ZTA17], and infinite dimensional non-probabilistic control systems [Gir14].
For a comparison between state-space discretization based and free approaches, we refer the interested readers to the discussion in Section 5.4 in [ZTA17].
On the other hand, retarded stochastic systems are widely used to model various processes in finance, ecology, medical, and engineering (see, e.g., [Sha13, BSP08, KM13]). However, the construction of symbolic models for such classes of systems are still unaddressed due to underlying challenges such as infinite-dimensional functional state-space and dependency on state history. The authors in [PPDBT10, PPDB15] provide the construction of abstractions for incrementally stable non-probabilistic time-delayed systems by spline-based approximation of functional spaces. However, the proposed results are complex from the implementation point of view and also suffer from the curse of dimensionality with respect to the state-space dimension of the concrete system. This motivates our work in this paper to provide a scheme for the construction of finite abstractions for a class of infinite dimensional stochastic systems, namely, retarded jump-diffusion systems without discretizing the state-space.
The main contribution of this paper is twofold. First, we introduce a notion of incremental input-to-state stability for retarded jump-diffusion systems and provide sufficient conditions for it in terms of the existence of a notion of incremental Lyapunov functions. In the linear case, we show that the sufficient conditions reduce to a matrix inequality. Second, under the assumption of incremental stability property over retarded jump-diffusion systems, we provide a construction of finite abstractions which are approximately bisimilar to the corresponding non-probabilistic version of retarded systems. Then, under some mild assumptions over incremental Lyapunov functions, we obtain a lower bound on the probability such that the distance between output trajectories of the obtained finite abstraction and those of the original retarded jump-diffusion system remains close over a finite time horizon. One can leverage the proposed probability closeness to synthesize a control policy using constructed finite abstractions and refine it back to the original system while providing guarantee on the probability of satisfaction over the original system. Further, we demonstrate the effectiveness of the proposed results by synthesizing a controller keeping temperatures in a comfort zone in a ten-room building modelled as a linear delayed jump-diffusion system.
2. Retarded Jump-Diffusion Systems
2.1. Notations
Let the triplet (Ω,F,\mathdsP) denote a probability space with a sample space Ω, filtration F, and the probability measure \mathdsP. The filtration \mathdsF=(Fs)s≥0 satisfies the usual conditions of right continuity and completeness [ØS05]. The symbols N, N0, Z, R, R+, and R0+ denote the set of natural, nonnegative integer, integer, real, positive, and nonnegative real numbers, respectively. We use Rn×m to denote a vector space of real matrices with n rows and m columns. Symbol ei∈Rn denotes a vector whose all elements are zero, except the ith one, which is one.
Let the family of continuous functions ζ:[−τ,0]→Rn be denoted by C([−τ,0];Rn) with a norm ∥ζ∥[−τ,0]:=sup−τ≤θ≤0∥ζ(θ)∥, where ∥⋅∥ denotes the Euclidean norm in Rn. We denote by ζ≡c a constant function ζ∈C([−τ,0];Rn) with value c∈Rn. For q>0 and t∈R0+, LFtq([−τ,0];Rn) denotes the family of all Ft-measurable C([−τ,0];Rn)-valued random processes ϕ:={ϕ(θ)∣−τ≤θ≤0} such that sup−τ≤θ≤0\mathdsE[∥ϕ(θ)∥q]<∞. The notation CF0b([−τ,0];Rn) denotes the family of all bounded F0-measurable C([−τ,0];Rn)-valued random variables. For a matrix A∈Rn×m,
∥A∥ represents the Euclidean norm of A.
We use λmin(A) and λmax(A) to denote the minimum and maximum eigenvalue of a symmetric matrix A, respectively. The diagonal set △⊂R2n is defined as △={(x,x)∣x∈Rn}.
The closed ball centered at x∈Rm with radius R is defined by BR(x)={y∈Rm∣∥x−y∥≤R}. A set B⊆Rm is called a box if B=∏i=1m[ci,di], where ci,di∈R with ci<di for each i∈{1,…,m}. The span of a box B is defined as span(B)=min{∣di−ci∣∣i=1,…,m}, where ∣a∣ represents the absolute value of a∈R. By defining [Rm]η:={z∈Rm∣zi=mkiη,ki∈Z}, the set ⋃b∈[Rm]ηBR(b) is a countable covering of Rm for any η∈R+ and R≥η/2. For a box B⊆Rm and η≤span(B), define the η-approximation [B]η:=[Rm]η∩B. Note that [B]η=∅ for any η≤span(B). We extend the notions of span and of approximation to finite unions of boxes as follows. Let A=⋃j=1MAj, where each Aj is a box. Define span(A)=min{span(Aj)∣j=1,…,M}, and for any η≤span(A), define [A]η=⋃j=1M[Aj]η.
A continuous function γ:R0+→R0+ belongs to class K if it is strictly increasing and γ(0)=0; it belongs to class K∞ if γ∈K and γ(r)→∞ as r→∞. A continuous function β:R0+×R0+→R0+ belongs to class KL if for each fixed s, the map β(r,s) belongs to class K with respect to r and, for each fixed r=0, the map β(r,s) is decreasing with respect to s and β(r,s)→0 as s→∞. Given a measurable function f:R0+→Rn, the (essential) supremum of f is denoted by ∥f∥∞; we recall that ∥f∥∞ := (ess)sup{∥f(t)∥,t≥0}. We identify a relation R⊆A×B with the map R:A→2B defined by b∈R(a) if and only if (a,b)∈R.
2.2. Retarded Jump-Diffusion Systems
Let (Ws)s≥0 be an r-dimensional \mathdsF-Brownian motion and (Ps)s≥0 be an r~-dimensional F-Poisson process. We assume that the Poisson process and the Brownian motion are independent of each other. The Poisson process Ps:=[Ps1;…;Psr~] models r~ kinds of events whose occurrences are assumed to be independent of each other.
Definition 2.1**.**
A retarded jump-diffusion system (RJDS) is a tuple ΣR=(Rn,X,U,U,f,g,r), where:
•
Rn* is the Euclidean space;*
•
X* is a subset of C([−τ,0];Rn), for some τ∈R0+;*
•
U⊆Rm* is the bounded input set which is finite unions of boxes;*
•
U* is a subset of the set of all measurable, locally essentially bounded functions of time from R0+ to U;*
•
f:X×U→Rn, satisfies the following Lipschitz assumption: there exist constants Lf, Lu∈R+, such that
∥f(xt,u)−f(x^t,u^)∥≤Lf∥xt−x^t∥[−τ,0]+Lu∥u−u^∥ for all xt,x^t∈X and all u,u^∈U;
•
g:X→Rn×r* satisfies the following Lipschitz assumption: there exists a constant Lg∈R0+ such that ∥g(xt)−g(x^t)∥≤Lg∥xt−x^t∥[−τ,0] for all xt,x^t∈X;*
•
r:X→Rn×r~* satisfies the following Lipschitz assumption: there exists a constant Lr∈R0+ such that ∥r(xt)−r(x^t)∥≤Lr∥xt−x^t∥[−τ,0] for all xt,x^t∈X.*
An Rn-valued continuous-time process ξ is said to be a solution process for ΣR if there exists υ∈U satisfying
[TABLE]
P-almost surely (P-a.s.), where f, g, and r are the drift, diffusion, and reset terms, respectively, and ξt:={ξ(t+θ)∣−τ≤θ≤0}. We
emphasize that postulated assumptions on f, g, and r ensure the existence and uniqueness of the solution process ξ on t≥−τ [ØS05, Theorem 1.19]. Throughout the paper we use the notation ξζ,υ(t) to denote the value of a solution process starting from initial condition ζ={ξ(θ)∣−τ≤θ≤0}∈CF0b([−τ,0];Rn)P-a.s. and under the input signal υ at time t. We also use the notation ξt,ζ,υ to denote the solution process starting from initial condition ζ={ξ(θ)∣−τ≤θ≤0}∈CF0b([−τ,0];Rn)P-a.s. and under the input signal υ. Note that for any t∈R0+, ξζ,υ(t) is a random variable taking values in Rn and ξt,ζ,υ is a random variable taking values in C([−τ,0];Rn). Here, we assume that the Poisson processes Psi, for any i∈{1,2,…,r~}, have the rates of λi. Now we will introduce delayed jump-diffusion system (ΣD)(DJDS) as a special case of retarded jump-diffusion system which is given by
[TABLE]
where F:Rn×Rn×U→Rn, G:Rn×Rn→Rn×r, and R:Rn×Rn→Rn×r~ are the drift, diffusion, and reset terms, respectively. The constants τ1, τ2, and τ3 are the state delay in the drift, diffusion, and reset terms, respectively.
2.3. Incremental Stability for RJDS and DJDS
Here, we introduce a notion of incremental stability for RJDS (resp. DJDS).
Definition 2.2**.**
An RJDS ΣR (resp. DJDS ΣD) is incrementally input-to-state stable in the qth moment, where q≥1, denoted by (δ-ISS-Mq), if there exist a KL function β and a K∞ function γ such that for any t∈R0+, any two initial conditions ζ,ζ^∈CF0b([−τ,0];Rn), and any υ,υ^∈U the following condition is satisfied:
[TABLE]
One can readily verify that in the absence of delay, Definition 2.2 reduces to that of δ-ISS-Mq for stochastic control systems in [ZMEM*+*14, Definition 3.1].
For later use, we provide the infinitesimal generators (denoted by operator L) for an RJDS ΣR and a DJDS ΣD using Itô’s differentiation [JP09, equation (23)]. Let function V:Rn×Rn→R0+ be twice differentiable on Rn×Rn∖△.
The infinitesimal generator of V associated with an RJDS ΣR in (2.1) is an operator, denoted by LV, from C([−τ,0];Rn)×C([−τ,0];Rn) to R, and ∀t∈R0+,∀xt,x^t∈C([−τ,0];Rn) and ∀u,u^∈U it is given by
[TABLE]
The infinitesimal generator of V associated with a DJDS ΣD in (2.2) is an operator, denoted by LV, from R8n to R and ∀x,x^,y,y^,z,z^,p,p^∈Rn, and ∀u,u^∈U it is given by
[TABLE]
The symbols ∂x and ∂x,x^ in (2.4) and (2.5) represent first and second-order partial derivatives with respect to x (1st argument) and x^ (2nd argument), respectively. Note that we dropped the arguments of ∂xV, ∂x^V, ∂x,xV, ∂x^,xV, ∂x,x^V, and ∂x^,x^V in (2.4) and (2.5) for the sake of simplicity.
Now we describe δ-ISS-Mq in terms of existence of so-called δ-ISS-Mq Lyapunov functions for RJDS and DJDS using Razumikhin-type condition as defined next.
Definition 2.3**.**
Consider an RJDS ΣR and a continuous function V:Rn×Rn→R0+ that is twice differentiable on Rn×Rn∖△. The function V is called a δ-ISS-Mq Lyapunov function for ΣR for q≥1, if there exist K∞ functions α,α, and φ, such that:
(i)
α(resp. α) is a convex (resp. concave) function;
2. (ii)
where ω:Rn×Rn→R+ is a nonnegative function such that there exists a K∞ function ω~ satisfying ω(x,x^)≥ω~(∥x−x^∥q) and lim∥s∥→∞α(∥s∥q)ω~(∥s∥q)>0; q~:Rn×Rn→R is a function such that q~(x,x^)−V(x,x^)≥q(∥x−x^∥), where q is a K∞ function satisfying lim∥s∥→∞α(∥s∥q)q(∥s∥)>0.
Definition 2.4**.**
Consider a DJDS ΣD and a continuous function V:Rn×Rn→R0+ that is twice differentiable on Rn×Rn∖△. Function V is called a δ-ISS-Mq Lyapunov function for ΣD for q≥1, if there exist constants κ0, κ1, κ2, κ3 such that κ0≥∑i=13κi≥0, a nonnegative function ψ:Rn×Rn→R+, K∞ functions α,α,φ, and K function ω^ such that: conditions (i) and (ii) in Definition 2.3 hold and ∀x,x^,y,y^,z,z^,p,p^∈Rn and ∀u,u^∈U,
[TABLE]
and ψ(x,x^)≥ω^(∥x−x^∥q) and lim∥s∥→∞α(∥s∥q)ω^(∥s∥q)>0.
Now we provide the description of δ-ISS-Mq for an RJDS ΣR in terms of existence of δ-ISS-Mq Lyapunov functions in the following theorem.
Theorem 2.5**.**
An RJDS ΣR is δ-ISS-Mq if it admits a δ-ISS-Mq Lyapunov function as in Definition 2.3.
Proof.
The proof is inspired by the proof of Theorem 3.1 in [HM09]. Denote φ=φ(∥υ−υ^∥∞) and V0=α(\mathdsE[∥ζ−ζ^∥[−τ,0]q]), ∀υ,υ^∈U and ∀ζ,ζ^∈CF0b([−τ,0];Rn). By using Lemma 3.2 and 3.3 in [HM09], there exist a constant aq>0 and a K∞ function μω such that ∀t≥0, \mathdsE[ω(ξζ,υ(t),ξζ^,υ^(t))]≥2φ and \mathdsE[q~(ξζ,υ(t),ξζ^,υ^(t))]−\mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≥aq, whenever \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≥μω−1(2φ). Without loss of generality, assume μω−1(2φ)<α(sup−τ≤θ≤0\mathdsE[∥ζ(θ)−ζ^(θ)∥q)]≤V0. Let J be the minimal nonnegative integer such that M0=μω−1(2φ)+Jaq>V0. Let τ^=max{τ,M0/φ} and tj=jτ^ for j∈{0,1,…,J}. In order to prove the theorem, we need to show
[TABLE]
where Mj=μω−1(2φ)+(J−j)aq and j∈{0,1,…,J}.
First we show that \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤V0,∀t≥t0.
Suppose that ta:=inf{t>t0∣\mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]>V0}<∞. Since \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))] is continuous in time t≥0, there exist a pair of constants tb and tc such that t0≤tb≤ta<tc and
[TABLE]
However, by generalized Itô’s formula [Sko09] and condition (2.6) in Definition 2.3, we have
[TABLE]
fot all t∈(tb,tc], which contradicts (2.9). Thus the inequality \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤V0 must be true for all t≥t0.
Now we show that \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤M1,∀t≥t1. Let tm:=inf{t≥t0∣\mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤M1}<∞. If tm>t1, then ∀t∈[t0,t1], we have
[TABLE]
Using condition (2.6) in Definition 2.3, inequality (2.10) implies
[TABLE]
Consequently, by generalized Itô’s formula, we have \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤V0−φτ^<0, which contradicts the property of \mathdsE[V(ξζ,υ(t), ξζ^,υ^(t))]≥0,∀t≥0. Hence, we must have tm≤t1. Let
[TABLE]
Again as \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))] is continuous in t≥0, there exists constants tˉb and tˉc such that t1≤tˉb≤tˉa<tˉc and
[TABLE]
By using similar reasoning as before, generalized Itô’s formula [Sko09] and condition (2.6) in Definition 2.3, the assumption results in contradiction, thus we have (2.8) for j=1. Now define tj:=inf{t≥tj−1∣\mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤Mj}<∞ for j=2,3,…,J. By similar type of reasoning, we get \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤Mj, ∀t≥tj. Particularly, \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤MJ=μω−1(2φ),∀t≥tJ. By following Jensen’s inequality, one obtains
[TABLE]
where γ(s)=α−1(μω−1(2φ(s))) for all s∈R0+.
Now choose a KL function β such that β(V0,t)≥2V0−tJV0t, ∀t∈[0,tJ]. So we have \mathdsE[V(ξζ,υ(t),ξζ^,υ^(t))]≤β(V0,t),∀t∈[0,tJ] which implies
[TABLE]
where β(s,t)=α−1(β(α(s),t)) for any s,t∈R0+.
From (2.11) and (2.12), one can readily verify inequality (2.3) which implies that ΣR is δ-ISS-Mq.
∎
The next corollary proposes similar results as in the previous theorem but for DJDS.
Corollary 2.6**.**
*A DJDS ΣD is δ-ISS-Mq if it admits a δ-ISS-Mq Lyapunov function as in Definition 2.4.
Proof.
Let ω(x,x^)=1+κ01ψ(x,x^) for all x,x^∈Rn. Now by considering Definition 2.4, we have
[TABLE]
for all t≥0 and ϕ,ϕ^∈LFtq([−τ,0];Rn) satisfying condition (2.7) in Definition 2.3 with function q~(ϕ(0),ϕ^(0)):=V(ϕ(0),ϕ^(0))+ω(ϕ(0),ϕ^(0)). Moreover, functions ω~(s)=q(s)=1+κ0ω^(s), ∀s∈R0+, satisfy properties required in condition (iii) in Definition 2.3. Therefore, V satisfies all the conditions in Definition 2.3. Thus by following Theorem 2.5, we obtain that ΣD is δ-ISS-Mq.
∎
In the following lemma, we provide a similar result as in Corollary 2.6 but tailored to linear delayed jump-diffusion systems in which sufficient conditions boil down to a matrix inequality.
Lemma 2.7**.**
Consider a DJDS ΣD as given in (2.2), where for all x,y,z,p∈Rn and u∈U, F(x,y,u):=A1x+A2y+Bu, for some A1,A2∈Rn×n and B∈Rn×m, G(x,z):=[G1xG2x⋯Grx]+[G1zG2z⋯Grz] and R(x,p):=[R1xR2x⋯Rr~x]+[R1pR2p⋯Rr~p], for some Gi,Gi,Ri,Ri∈Rn×n. Then, system ΣD is δ-ISS-M2 if there exist constants c1,c2,c3,c4,c5∈R+ satisfying c1>∑i=24ci and
[TABLE]
*where P is a symmetric positive definite matrix and Δ=PA1+A1TP+i=1∑rˉGiTPGi+i=1∑r~λi(PRi+RiTP+RiTPRi).
*
Proof.
Consider a function V:Rn×Rn→R0+ given by
[TABLE]
where P is a symmetric positive definite matrix.
One can readily verify that the function V in (2.14) satisfies properties (i) and (ii) in Definition 2.3 with functions α(s):=21λmin(P)s and α(s):=21λmax(P)s for all s∈R0+ and q=2. By considering the infinitesimal generator in (2.5) associated with the considered linear delayed jump-diffusion system, Lipschitz assumptions, Young’s inequality, consistency of norms, and (2.13) one can obtain the following chains of inequalities
[TABLE]
Thus by following the proof of Corollary 2.6 with κ0=∑i=24ci, ψ(x,x^)=(c1−κ0)V(x,x^), ω(x,x^)=1+κ01ψ(x,x^)=κV(x,x^), where κ=1+κ0c1−κ0, ∀t≥0, ∀ζ,ζ^∈CF0b([−τ,0];Rn), and ∀ϕ,ϕ^∈LFtk([−τ,0];Rn) satisfying (2.7), one obtains
[TABLE]
By using generalized Ito’s formula, (2.15), and condition (ii) in Definition 2.3, we have
[TABLE]
which, by virtue of Gronwall’s inequality, leads to
[TABLE]
Now by using condition (ii) in Definition 2.3, one obtains
[TABLE]
and, hence,
[TABLE]
Therefore, by introducing functions β and γ as
[TABLE]
for any s,t∈R0+, inequality (2.3) is satisfied.
∎
Remark 2.8**.**
For fixed values of ci, i={1,…,5}, the inequality (2.13) boils down to a linear matrix inequality (LMI) which can be solved efficiently using semidefinite programming. One may also solve a bilinear matrix inequality (BMI) (locally) using a V−K iteration [GB94]. That is, for fixed values ci, i={1,…,5}, we find matrix P satisfying the LMI, and then for a fixed P we find constants ci, i={1,…,5}, to maximize the value of c1−∑i=24ci, and we iterate until there is no improvement in the value of c1−∑i=24ci.
In order to provide results on the construction of symbolic models and given a RJDS ΣR, we introduce the corresponding non-probabilistic retarded systems (denoted by ΣR) obtained by removing diffusion and reset terms (that is, g and r in (2.1)). From now onwards, we use notation ξζ,υ(t) to denote the value of a trajectory of ΣR in Rn and ξt,ζ,υ to denote the solution of ΣR in C([−τ,0];Rn) at time t∈R0+ started from the non-probabilistic initial condition ζ∈CF0b([−τ,0];Rn), where F0 is the trivial sigma-algebra, and under input signal υ.
Now, we provide a technical lemma which is used later to show a relation between non-probabilistic retarded systems ΣR and their symbolic models.
Lemma 2.9**.**
Consider an incrementally input-to-state stable non-probabilistic retarded systems ΣR corresponding to a δ-ISS-Mq RJDS ΣR for q≥1, that is for any t∈R0+, any ζ,ζ^∈C([−τ,0];Rn), and any υ,υ^∈U, it satisfies
[TABLE]
where β and γ are the functions appearing in (2.3).
Then there exists a KL function β~ such that the following inequality holds:
[TABLE]
where β~(s,t)=e−(t−τ)s+β(s,max{0,t−τ}).
Proof.
The proof is inspired by the proof of Theorem 3 in [PPDBT10]. From inequality (2.17), we obtain the following inequalities:
[TABLE]
and
[TABLE]
Moreover, we also have
[TABLE]
Inequalities (2.19) and (2.20) along with (2.21) yield
[TABLE]
One can rewrite the last inequality as
[TABLE]
for all t≥0, where β~(s,t):=e−(t−τ)s+β(s,max{0,t−τ}) is a KL function.
∎
3. Systems and Approximate Equivalence Relations
We recall the notion of system introduced in [Tab09] which later serves as a unified modeling framework for both retarded jump-diffusion systems and their finite abstractions.
Definition 3.1**.**
A systems is a tuple S=(X,X0,U,⟶,Y,H) where X is a set of states (possibly infinite), X0⊆X is a set of initial states, U is a set of inputs (possibly infinite), ⟶⊆X×U×X is a transition relation, Y is a set of outputs, and H:X→Y is an output map.
We denote x⟶ux′ as an alternative representation for a transition (x,u,x′)∈⟶, where state x′ is called a u-successor (or simply successor) of state x, for some input u∈U.
Moreover, a system S is said to be
•
metric, if the output set Y is equipped with a metric d:Y×Y→R0+.
•
finite (or symbolic), if X and U are finite.
•
deterministic, if there exists at most a u-successor of x, for any x∈X and u∈U.
•
nonblocking, if for any x∈X, there exists some u-successor of x, for some u∈U.
For a system S, the finite state-run generated from initial state x0∈X0 is a finite sequence of transitions:
[TABLE]
such that xi⟶uixi+1, for i∈{0,1,…,k−1}. The associated finite output-run is given by yi=H(xi), for i∈{0,1,…,k−1}. These finite runs can be directly extended to infinite runs as well.
Now, we provide the notion of approximate (bi)simulation relation between two systems, introduced in [GP07], which is later used for analyzing and synthesizing controllers for retarded jump-diffusion systems.
Definition 3.2**.**
Let S1=(X1,X10,U1,1⟶,Y1,H1) and S2=(X2,X20,U2,2⟶,Y2,H2) be two metric systems having the same output sets Y1=Y2 and metric d. For ε∈R0+, a relation R⊆X1×X2 is said to be an ε-approximate bisimulation relation between S1 and S2 if it satisfies the following conditions:
(i)
∀(x1,x2)∈R, we have d(H1(x1),H2(x2))≤ε;
2. (ii)
∀(x1,x2)∈R, x11⟶u1x1′ in S1 implies x22⟶u2x2′ in S2 satisfying (x1′,x2′)∈R;
3. (iii)
∀(x1,x2)∈R, x22⟶u2x2′ in S2 implies x11⟶u1x1′ in S1 satisfying (x1′,x2′)∈R.
If we remove condition (iii), then R⊆X1×X2 is said to be an ε-approximate simulation relation from S1 to S2.
The system S1 is ε-approximate bisimilar to S2, denoted by S1≅SεS2, if there exists an ε-approximate bisimulation relation R between S1 and S2 such that: ∀x10∈X10, ∃x20∈X20 with (x10,x20)∈R and ∀x20∈X20, ∃x10∈X10 with (x10,x20)∈R.
In order to present the main results of the paper, we need to employ the notion of system as an abstract representation of a retarded jump-diffusion system. First, we define a metric system associated with the retarded jump-diffusion system ΣR, denoted by S(ΣR)=(X,X0,U,⟶,Y,H), where
•
X is the set of all C([−τ,0];Rn)-valued random variables defined on the probability space (Ω,F,\mathdsP);
•
X0 is a subset of CF0b([−τ,0];Rn);
•
U=U;
•
ζ⟶υζ′ if ζ and ζ′ are measurable in Ft and Ft+h, respectively, for some t∈R0+ and h∈R+, and there exists a solution ξt∈LFtq([−τ,0];Rn) of ΣR satisfying ξt=ζ and ξh,ζ,υ=ζ′\mathdsP-a.s.;
•
Y=X;
•
H(ζ)=ζ.
From now on, we restrict our attention to the sampled-data system, where control signals (in ΣR) are piecewise-constant over intervals of length h∈R+, i.e.
[TABLE]
The metric systems associated with the sampled-data retarded jump-diffusion systems can be defined as Sh(ΣR)=(Xh,Xh0,Uh,h⟶,Yh,Hh), where Xh=X, Xh0=X0, Uh=Uh, Yh=Y, Hh=H, and ζhh⟶υhζh′ if ζh and ζh′ are measurable in Fih and F(i+1)h, respectively, for some i∈N0, and there exists a solution ξt∈LFtq([−τ,0];Rn) of ΣR satisfying ξt=ζh and ξh,ζh,υh=ζh′\mathdsP-a.s. In other words, a finite state-run of Sh(ΣR), represented by ζ0h⟶υ0ζ1h⟶υ1⋯h⟶υk−1ζk, where υi∈Uh and ζi+1=ξh,ζi,υi\mathdsP-a.s. for i∈{0,1,…,k−1}, captures solutions of RJDS ΣR at the sampling times t=0,h,…,kh, started from ζ0∈X0 and resulting from control input υ obtained by the concatenation of input signals υi∈Uh. Moreover, the corresponding finite output-run is {y0,y1,…,yk}. Similarly, we consider metric systems corresponding to non-probabilistic sampled-data retarded systems denoted by Sh(ΣR)=(Xh,Xh0,Uh,h⟶,Yh,Hh) where Xh=C([−τ,0];Rn), Xh0⊆Xh, Uh=Uh, Yh=Xh, Hh(ζ)=ζ, and ζhh⟶υhζh′ if ζh′=ξh,ζh,υh. For later use, we represent an Rn-valued output at the kth sampling instance starting from initial state ζ under input signal υh by ξζ,υh(kh).
4. Symbolic Models for RJDS
4.1. Finite Dimensional Abstractions
In this subsection, we introduce a finite dimensional abstraction for Sh(ΣR). Consider metric systems associated with the sampled-data retarded systems Sh(ΣR) and consider triple ρ=(h,N,ζs) of parameters, where h∈R+ is the sampling time, N∈N is a temporal horizon, and ζs∈C([−τ,0];Rn) is a source state. Let us define a metric system as
[TABLE]
where
•
Xρ=U, Xρ0=Xρ, Uρ=U, Yρ=Yh;
•
xρρ⟶uρxρ′, where xρ=(u1,u2,…,uN)∈Xρ, if and only if xρ′=(u2,…,uN,uρ);
•
Hρ(xρ)=ξNh,ζs,xρ.
Here, we abuse notation by identifying xρ=(u1,u2,…, uN)∈[U]ηN as an input curve υ:[0,Nh)→[U]η such that υ(t)=uk for any t∈[(k−1)h,kh) for k∈{1,…,N} in ξNh,ζs,xρ. We use similar notations in the rest of the paper as well. Notice that the system Sρ(ΣR) is deterministic, non-blocking, and finite dimensional (but not necessarily symbolic unless U is a finite set). Note that Hρ is the output map from non-probabilistic state xρ∈Xρ to a C([−τ,0];Rn)-valued solution process ξNh,ζs,xρ and corresponding Rn-valued solution is represented by ξζs,xρ(Nh).
The next theorem provides the results on the construction of finite dimensional abstractions which are approximately bisimilar to Sh(ΣR).
Theorem 4.1**.**
Consider a retarded system ΣR corresponding to δ-ISS-Mq RJDS ΣR for q≥1. Given any ε>0, let the sampling time h, temporal horizon N, and source state ζs be such that
[TABLE]
where Z(ζs)=u1∈Usup∥ξh,ζs,u1−ζs∥[−τ,0]q. Then, the relation
[TABLE]
is an ε-approximate bisimulation relation between Sh(ΣR) and Sρ(ΣR).
Proof.
Consider any (ζ,xρ)∈R1, where ζ∈Xh and xρ=(u1,u2,…,uN)∈Xρ. Then we have ∥Hh(ζ)−Hρ(xρ)∥[−τ,0]q≤ε. Thus condition (i) in Definition 3.2 holds. Now we show that condition (ii) in Definition 3.2 holds. Consider any υh:[0,h[→uh for some uh∈U and ζ′=ξh,ζ,υh. Consider uρ=uh and xρ′=(u2,…,uN,uρ) and let xρ=(u1,u2,…,uN,uρ) denote input sequence in UN+1. With the help of triangle inequality and \eqrefmmm1 one obtains the following chains of inequalities:
[TABLE]
Hence, (ζ′,xρ′)∈R1. Thus condition (ii) in Definition 3.2 holds. In a similar way, one can show that condition (iii) in Definition 3.2 holds which completes the proof.
∎
Note that in the above theorem, given any ε>0, one can select temporal horizon N to be sufficiently large to enforce term β~(Z(ζs),Nh) to be sufficiently small. This results in β~(ε,h)<ε which enforces a lower bound for the sampling time h.
Now we establish the results on the existence of finite dimensional abstraction Sρ(ΣR) such that Sh(ΣR)≅SεSρ(ΣR) given the result in Theorem 4.1.
For every ζ∈Xh0, there always exists xρ0∈Xρ0 such that ∥Hh(ζ)−Hρ(xρ0)∥[−τ,0]q≤ε. Hence (ζ,xρ0)∈R1. In a similar way, we can show that for every xρ0∈Xρ0 there exists ζ∈Xh0 such that (ζ,xρ0)∈R1, which completes the proof.
∎
4.2. Finite Abstractions
In this subsection, we provide a finite (a.k.a. symbolic) abstraction for Sh(ΣR) by quantizing input set U. Let us consider tuple ρ=(h,N,ζs,η), where η>0 is a quantization parameter and the quantized input set is denoted by [U]η (cf. Notation in Subsection 2.1). Now, we can define the corresponding finite systems as
[TABLE]
where
•
Xρ=[U]ηN,Xρ0=Xρ, Uρ=[U]η, Yρ=Yh;
•
xρρ⟶uρxρ′, where xρ=(u1,u2,…,uN)∈Xρ, if and only if xρ′=(u2,…,uN,uρ);
•
Hρ(xρ)=ξNh,ζs,xρ.
A finite state-run of Sρ(ΣR) is represented by xρ(0)ρ⟶υ0xρ(1)ρ⟶υ1⋯ρ⟶υk−1xρ(k), where υi∈Uρ.
For later use, we denote Y~y0,υρ:N0→Rn an Rn-valued output-run of Sρ(ΣR) starting from y0=ξζs,xρ(0)(Nh) under input signal υρ.
In order to provide an approximate bisimulation relation between sampled retarded systems and symbolic models, we need the following technical lemma.
Lemma 4.3**.**
Consider a retarded system ΣR corresponding to δ-ISS-Mq RJDS ΣR for q≥1 and a quantization parameter η such that 0<η≤span(U). Then the relation R2 given by
[TABLE]
is a γ(η)-approximate bisimulation relation between Sρ(ΣR) and Sρ(ΣR), and Sρ(ΣR)≅Sγ(η)Sρ(ΣR).
Proof.
Let (xρ,xρ)∈R2, then ∥ui−[ui]η∥≤η for i∈{1,2,…,N} implies that ∥xρ−xρ∥∞≤η. By using (2.18), one obtains
[TABLE]
Then, the first condition in Definition 3.2 holds. Now, consider any (xρ,xρ)∈R2, where xρ=(u1,u2,…,uN) and xρ=(u1,u2,…,uN). Let u∈Uρ and consider xρρ⟶uxρ′:=(u2,…,uN,u) in Sρ(ΣR). Choose u=[u]η and consider xρρ⟶uxρ′:=(u2,…,uN,u) in Sρ(ΣR). It is obvious that (xρ′,xρ′)∈R2 and, hence, condition (ii) in Definition 3.2 holds. Similarly, condition (iii) in Definition 3.2 holds which shows R2 is a γ(η)-approximate bisimulation relation between Sρ(ΣR) and Sρ(ΣR). For any xρ0:=(u1,u2,…,uN)∈Xρ0, there always exists xρ0:=([u1]η,[u2]η,…,[uN]η)∈Xρ0 and, hence, (xρ0,xρ0)∈R2. Note that the existence of such xρ0 is guaranteed by U being a finite union of boxes and by the inequality η≤span(U). Moreover, for any xρ0∈Xρ0 and by choosing xρ0=xρ0, one readily gets (xρ0,xρ0)∈R2 and, hence, Sρ(ΣR)≅Sγ(η)Sρ(ΣR).
∎
Now we provide the main results of this subsection on establishing an approximate bisimulation relation between Sh(ΣR) and Sρ(ΣR), which is an immediate consequence of the transitivity property of approximate bisimulation relations [GP07, Proposition 4] as recalled next.
Proposition 4.4**.**
Consider metric systems S1, S2, and S3 such that S1≅SδS2 and S2≅Sδ^S3, for some δ,δ^∈R0+. Then, we have S1≅Sδ+δ^S3.
Now we provide the first main result of this section.
Theorem 4.5**.**
Consider a retarded system ΣR corresponding to δ-ISS-Mq RJDS ΣR for q≥1. Given any ε>0 and a quantization parameter 0<η≤span(U), consider the results in Theorem 4.1 and Lemma 4.3. Then, the relation R given by
[TABLE]
is an (ε+γ(η))-approximate bisimulation relation between Sh(ΣR) and Sρ(ΣR).
Note that relations R1 and R2 in Theorem 4.5 have been defined in Theorem 4.1 and Lemma 4.3, respectively.
Having the result in Theorem 4.5, now we provide one of the main results of the paper that quantifies the closeness of the output trajectories of sampled retarded jump-diffusion systems Sh(ΣR) and those of symbolic models Sρ(ΣR).
In order to prove this result, we raise a supplementary assumption on δ-ISS-Mq Lyapunov functions V.
Assumption 4.6**.**
For a δ-ISS-Mq Lyapunov function V as in Definition 2.3, following conditions hold:
(i)
for function ω in Definition 2.3 there exists κ∈R+ such that ω(x,x^)≥κV(x,x^) for all x,x^∈Rn;
(ii)
there exists some constant α∈R0+ such that
[TABLE]
for all ζ,ζ^∈C([−τ,0];Rn), where H(V)(x,x^) denotes the Hessian matrix of V at x,x^∈Rn.
Note that condition (ii) in Assumption 4.6 is not restrictive, provided that V is restricted to a compact subset of Rn×Rn, the Hessian matrix H(V)(x,x^) of V is a positive semidefinite matrix in R2n×2n, and ∂x,xV(x,x^)≤P for some P a positive semidefinite matrix in Rn×n. With these conditions and Lipschitz assumptions on g(⋅) and r(⋅), one can always find α∈R0+ satisfying ((ii)).
Theorem 4.7**.**
Consider an RJDS ΣR admitting a δ-ISS-Mq Lyapunov function V for q≥1 as in Definition 2.3 and satisfying conditions in Assumption 4.6. For any abstraction parameters ρ=(h,N,ζs,η) and any ε∈R+ satisfying (4.1), consider finite abstraction Sρ(ΣR) and results in Theorem 4.5. For any Rn-valued output-run ξζ,υh of Sh(ΣR), there exists an Rn-valued output-run Y~ξζs,xρ(0)(Nh),υρ of Sρ(ΣR), and vice versa, such that following inequalities hold
[TABLE]
[TABLE]
Proof.
From the result in Theorem 4.5, we have Sh(ΣR)≅Sε+γ(η)Sρ(ΣR) which implies ∥ξh,ζ,υh−ξNh,ζs,xρ∥[−τ,0]q≤ε+γ(η) and consequently
[TABLE]
Hence, one obtains the following chain of (in)equalities:
[TABLE]
From the properties of V in Definition 2.3 and Assumption 4.6, one has
[TABLE]
for any t∈R0+, υh∈Uh and ζ∈CF0b([−τ,0];Rn). Using inequalities (4.2), (4.2), and the result in [Kus67, Theorem 1, pp. 79], one obtains the relations (4.4) and (4.5). In a similar way, we can prove the other direction.
∎
Inequalities (4.4) and (4.5) lower bound the probability such that the distance between output trajectories of the finite abstraction and those of the corresponding sampled retarded jump-diffusion system remains close over finite time horizon. One can leverage the result in Theorem 4.7 to synthesize control policies for finite abstractions and refine them to the original systems while providing guarantee on the probability of satisfaction. Similar relations were established in [JP09], [ZMEM*+*14], and [LSZ19] for stochastic hybrid systems, stochastic control systems, and interconnected stochastic control systems and their (in)finite abstractions. For a detailed discussion on how inequalities (4.4) and (4.5) can be used to provide a lower bound on the probability of satisfying the specification for the original systems, we kindly refer the interested readers to [LSZ19, Section 6].
5. An Example
To show the effectiveness of the proposed results, we consider a simple thermal model of ten-room building as shown schematically in Figure 1. The model used here is similar to that used in [ZAG15]. In addition, we modified arrangement of the rooms to increase state-space dimensions and considered that the dynamic is affected by delays in states and by jumps (modeling door and window opening). The dynamic of the considered delayed jump-diffusion system ΣD is given by the following delayed stochastic differential equations:
[TABLE]
where the terms Wti and Pti, i∈{1,2,…,10}, denote the standard Brownian motion and Poisson process with rate λi=0.1, respectively; ξi, i∈{1,2,…,10}, denote the temperature in each room; Te=15 (degree Celsius) is the external temperature; TH1=TH2=100 are the temperatures of two heaters; τ1=10 time units and τ2=5 time units are state delays in drift and diffusion terms, respectively; and the control inputs υ1, υ2 are amounted to 1 if corresponding heaters are on and to [math] if corresponding heaters are off. Here, we assume that at most one heater is on at each time instance which results in the finite input set U={(1,0),(0,1),(0,0)}. The system parameters are chosen as αij=5×10−2 for all i=j and i,j∈{1,2,…,10}, αe2=αe5=8×10−3, αe1=αe3=αe4=αe6=αe7=αe8=αe9=αe10=5×10−3, αH1=αH2=3.6×10−3, αiτ1=1×10−4, gi=2×10−3, giτ2=1×10−4, and ri=1×10−5, for i∈{1,2,…,10}. In this example, we work on the subset D=[17,22]10 of state space of ΣD.
Using Lyapunov function V(x,x^)=(x−x^)TP(x−x^), for all x,x^∈D, where
[TABLE]
one can readily obtain the functions α(s)=0.5029s, α(s)=0.8197s, for q=2 satisfying conditions (i) and (ii) in Definition 2.3. Using the results in Lemma 2.7, one obtains function β(s,t)=e−κts, ∀s∈R0+, with κ=0.6667.
By considering q=2, a constant function ζs≡[17,17,17,17,17,17,17,17,17,17]T∈C([−τ,0];Rn), where τ=10, a precision ε=0.05 and by fixing sampling time h=15 time units, one can obtain temporal horizon N=9 for Sρ(ΣD), satisfying inequality (4.1) in Theorem 4.1. Therefore, the resulting cardinality of the set of states of Sρ(ΣD) is 39=19683 and number of transitions are 310=59049. The CPU time taken for computing finite abstraction with N=9 is accounted to 0.015. From inequalities (4.4) and (4.5), one can observe that higher precision of the abstraction helps to improve the bound on the closeness of the trajectories. To get higher precision, one can increasing the value of N. For example, for N=11 one gets an abstraction with precision ε=0.008. However, it increases the size of the abstraction and the computation time which are 531441 and 0.413 seconds, respectively. Remark that since the input set is finite, the finite dimensional abstraction Sρ(ΣD) is also symbolic.
Now consider the objective to design a controller enforcing the trajectories of ΣD to stay within comfort zone W=[18,21]10. This corresponds to the LTL specification □W. The computation of the symbolic model Sρ(ΣD) and the controller synthesis have been performed using tool QUEST [JZst] on a computer with CPU 3.5GHz Intel Core i7. The CPU time taken to synthesize controller for N=9 and N=11 is accounted to 2.07 and 2.56 seconds, respectively.
For the given dynamics and δ-ISS-M2 Lyapunov function V on D, we have
α=0.0012 in Assumption 4.6. For ϵ=0.8, using the result in Theorem 4.7, we guarantee that the the distance between output of Sh(ΣD) and that of Sρ(ΣD) will not exceed ϵ+ε=0.85 over the discrete time horizon {0,15,…,150} with probability at least 75.8%. To get a better lower bound for the aforementioned probability, one can reduce the time horizon or increase ϵ. For example, if we consider discrete time horizon {0,15,…,75} and ϵ=1, the lower bound on the probability will be 89.5%.
Figure 2 shows a few realizations of the closed-loop solution process ξζ,υ starting form initial condition ζ≡[19,19,19,19,19,19,19,19,19,19]T∈Xh0. The synthesized control policies υ1 and υ2 are shown in Figure 3. The obtained probability (i.e., at least 75.8%) is also empirically verified by computing distance between output trajectories of Sh(ΣR) and of Sρ(ΣR) (i.e., ∥ξζ,υh(kh)−Y~ξζs,xρ(0)(Nh),υρ(k)∥2) using 5000 runs. Several realizations are shown in Figure 4.
Bibliography25
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[BSP 08] M. Bandyopadhyay, T. Saha, and R. Pal. Deterministic and stochastic analysis of a delayed allelopathic phytoplankton model within fluctuating environment. Nonlinear Analysis: Hybrid systems , 2(3):958–970, May 2008.
2[CGG 13] E. Le Corronc, A. Girard, and G. Goessler. Mode sequences as symbolic states in abstractions of incrementally stable switched systems. In 52nd IEEE Conference on Decision and Control , pages 3225–3230, December 2013.
3[GB 94] L. El Ghaoui and V. Balakrishnan. Synthesis of fixed-structure controllers via numerical optimization. In Proceedings of 1994 33rd IEEE Conference on Decision and Control , volume 3, pages 2678–2683, December 1994.
4[Gir 14] A. Girard. Approximately bisimilar abstractions of incrementally stable finite or infinite dimensional systems. In 53rd IEEE Conference on Decision and Control , pages 824–829, December 2014.
5[GP 07] A. Girard and G. J. Pappas. Approximation metrics for discrete and continuous systems. IEEE Transactions on Automatic Control , 52(5):782–798, May 2007.
6[GPT 10] A. Girard, G. Pola, and P. Tabuada. Approximately bisimilar symbolic models for incrementally stable switched systems. IEEE Transactions on Automatic Control , 55(1):116–126, January 2010.
7[HM 09] L. Huang and X. Mao. On input-to-state stability of stochastic retarded systems with Markovian switching. IEEE Transactions on Automatic Control , 54(8):1898–1902, August 2009.
8[JP 09] A. A. Julius and G. J. Pappas. Approximations of stochastic hybrid systems. IEEE Transaction on Automatic Control , 54(6):1193–1203, June 2009.