This paper develops criteria and algorithms for identifying nonlinear control systems from experiments, ensuring practical applicability with known convergence rates.
Contribution
It introduces new identifiability criteria and an implementable algorithm for nonlinear control system identification, with explicit convergence rate analysis.
Findings
01
Established several identifiability criteria.
02
Proposed an implementable identification algorithm.
03
Explicitly computed convergence rates.
Abstract
This paper studies the identification of nonlinearly parameterized control systems in given experiments. Several identifiability criteria are established and an implementable algorithm is proposed for practicality with the convergence rate explicitly computed.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification · Advanced Control Systems Optimization · Iterative Learning Control Systems
Full text
Identification of Disturbed Control Systems††thanks: This work was supported in part by the National Natural Science Foundation of China under Grants 61422308 and 11688101.
Chanying Li
C. Li is with the Key Laboratory of Systems and Control, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, P. R. China and the School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing 100049, P. R. China.
Abstract
This paper studies the identification of nonlinearly parameterized control systems in given experiments. Several identifiability criteria are established and an implementable algorithm is proposed for practicality with the convergence rate explicitly computed.
Consider the nonlinearly parameterized control system
[TABLE]
where xt,yt,ut and (wt,vt) represent the p×1 state vector, q×1 output vector, r×1 input vector and (p+q)×1 noise vector, respectively. Denote
χt≜(xt,…,xt−m+1) as the state regressor. Unknown parameter θ is non-random and belongs to a known nondegenerate compact hyperrectangle Θ⊂Rn. Moreover,
f:Rn×Rr×Rpm→Rp and h:Rn×Rp→Rq are two known functions. Let
h−1:Rn×Rq→2Rp be a set-valued function that
h−1(x,y)≜{z:h(x,z)=y}, then assume
A1
The noises {wt} and {vt} are two i.i.d sequences satisfying:
(i) {wt} is independent of {vt};
(ii) for each t≥1, (wt,vt) is independent of χ0 and {ui}0≤i≤t−1;
(iii) ∥w1∥≤Cw and ∥v1∥≤Cv for some Cw>0 and Cv≥0. In addition,
[TABLE]
where W≜B(0,Cw)⊂Rp and V≜B(0,Cv)⊂Rq.
A2
f and h are continuous; h−1 is bounded-valued and
upper semicontinuous*** A set-valued function ζ:X→2Y is bounded-valued if for ∀x∈X, ζ(x) is bounded. ζ
is said to be upper semicontinuous if for any x∈X with ζ(x)=∅ and any neighborhood U of ζ(x), there is a dx>0 such that ζ(B(x,dx))⊂U. .
An important issue in system identification is to solve the identifiability of system (1) in an
experiment (χ0,{ut})∈E, where E is the set of all admissible experiments defined by
[TABLE]
This direction arises from numerous engineering applications where identification has to be performed in control processes, especially with feedbacks inherent [1], [2], [3], [6], [10], [11], [14]. Unlike identification operating in open loop, a prominent feature of closed-loop identification is that there is no design level on data in parameter estimation, once a feedback law is chosen. In this paper, we assume that the experiment is designed in advance for control purposes. Then, outputs yt will be produced by control system (1) automatically. We aim to identify parameter θ in the running process of the control system.
Historically, identification of noise-free systems from input-output data has been well addressed. Literatures on this topic have also shed some light on the determining factor of identifiablity for disturbed control systems. As stated by [7], parameter identification is in nature a procedure of distinguishing output trajectories of different parameters. From this viewpoint, the critical criterion, in some sense, on linear system structure was deduced by [7]. Nonlinear systems with noises absent were treated therein as well. Considering noises, however, different observations might be produced by the same parameter. We thus introduce the definition of identifiability for disturbed control systems as following.
Definition 1**.**
System (1) is identifiable under experiment (χ0,{ut})∈E, if there is an estimator such that the unknown parameter θ in Θ can be uniquely determined by the data set Z∞≜{yt+1,ut}t≥0 with probability 1.
Examining output trajectories to check identifiability is not straightforward in most circumstances. So, interesting move to derive some simple identifiability criteria. This is exactly the first part of the paper, where it is argued in Section 2 that the excitation points of control system (1) are crucial for identifiability. In fact, given any experiment in E, the identifiability of system (1) is ensured
if the excitation point set is sufficiently dense. A lower bound of the required density is computed accordingly. On the other hand, if the density of the excitation points is smaller than the lower bound, the identification may possibly fail. Generally speaking, this structure condition for identifiability is weaker than that for the noise-free case. This is because noises {wt} in the state equation are advantageous
in identification, as suggested by the results.
Since the estimator studied for the proofs of Theorems 4 and 5 is only of theoretical interest,
the second part of the paper is intended to introduce an implementable algorithm
for the sake of practicality. The proposed estimator is called the grid searching (GS) estimator and has its origins in the nonlinear least-squares (NLS) method, whose asymptotic behaviours and approximation algorithms have been explored for decades [4], [5], [12], [13], [16]. By modifying the NLS method in Section 3,
the GS estimator is proved to be strong consistent for a basic class of disturbed control systems under some appropriate conditions. This estimator
can also cope with the situation where the noise variances are unknown.
2 Identifiability for Control Systems
We shall establish some identifiability criteria for system (1) on the basis of experiment data.
2.1 Notations
Throughout this paper, we consider the probability measure space (Ω,F,P). The notations and definitions used in this section are introduced here. Let diam(x,A)≜supx′∈Ad(x,x′), where d(⋅,⋅) denotes the distance between two points.
Denote φ≜(z1,…,zm), ψ≜(z1′,…,zm′) and χ≜(z1′′,…,zm′′) with zj,zj′∈Rq,zj′′∈Rp,j∈[1,m]. Then, for x∈Rn,w∈Rp,v∈Rq, define
[TABLE]
So, hˉ−1 and h^ are set-valued functions. Denote the images of hˉ, h′ and h^ at fixed points (x,χ), (x,u,χ)
and (x,u,φ), respectively,
by \mboxIm(hˉx,χ)≜hˉ(x,χ,Vm),
[TABLE]
Now, let k,l∈N+. View set Z⊂Rk as a point zˇ∈2Rk and by a slight abuse of notation, we write zˇ=Z.
Now, for function (respectively, set-valued function) ζ:Rk→Rl (respectively, 2Rl), define Bζ:2Rk→2Rl by Bζ(zˇ)=ζ(Z).
Let ζi be two functions and zˇi=Zi,i=1,2. We say
[TABLE]
Given ϵ>0, for any z∈Rk, denote zˇϵ=B(z,ϵ)∈2Rk. Then,
let Vˇϵm≜⋃ψ∈Vmψˇϵ and Πˇϵ≜⋃π∈W×Vπˇϵ.
Define the images of Bhˉ, Bh′ and Bh^ at points (xˇ,χˇ)∈2Rn×2Rpm, (xˇ,uˇ,χˇ)∈2Rn×2Rr×2Rpm
and (xˇ,uˇ,φˇ)∈2Rn×2Rr×2Rqm by
[TABLE]
2.2 Motivations and Excitation Points
Let us first look at a simple system
[TABLE]
where φt≜(yt,…,yt−m+1) is an observable pm×1 vector. The experiment thus becomes (φ0,{ut}) in E
and Assumptions A1–A2 degenerate to
A1’
{wt} is an i.i.d sequence satisfying
(i) for each t≥1, wt is independent of χ0 and {ui}0≤i≤t−1;
(ii) ∥w1∥≤Cw for some finite Cw>0 and
[TABLE]
A2’
f and f′ are continuous.
The most familiar experiments are the ones that casue ∥φt∥≤C, ∀t≥1 almost surely for some C>0. Apparently, if ∥φt∥≤C, by (1), (3) and Assumption A2’, it is easy to compute a C0>0 that
[TABLE]
and hence
(ft,…,ft+m−1)∈S≜∏i=1mB(0,C0)⊂Rpm. So, the following result is not suprising.
Theorem 2**.**
Under Assumptions A1’–A2’, let (φ0,{ut})∈E be an experiment such that P{∥φt∥≤C,i.o.}=1 for some C>0.
Then, control
system (13) is identifiable if for each pair x,x′∈Θ with x=x′, there are sufficiently dense points β∈S such that
f(x,β)=f(x′,β).
This theorem is a direct consequence of Theorem 4 appearing in a later section.
The observation of the above theorem enlightens us to introduce set
[TABLE]
where u is restricted to B(0,Cu) and
[TABLE]
We call η∈Pα an excitation point of α∈A0 for system (1). If a system (f,h) has sufficiently dense excitation points of α, then states χt are very likely to fall in Pα. This means it is relatively easy to distinguish x and x′.
Heuristically, Pα={β:f(x,β)=f(x′,β)} is composed of the points where different parameters give rise to different values of f.
Theorem 2 suggests that the identifiability of a control system depends on the density of Pα,α∈A0.
More precisely, for two sets Z,Z′∈Rl,l≥1, we define the lower density of Z′ in Z by
[TABLE]
Further, when Z=∏j=1mZj,Zj⊂Rl,l≥1 and Z′⊂Rlm, the m-symmetric lower density of Z′ in Z is defined by
[TABLE]
Clearly, d1(Z′∣Z)=d(Z′∣Z). To identify parameter θ, the density of Pα for control system (1) is deduced in the next subsection.
2.3 Identifiability Criteria
The criteria are presented in two cases.
2.3.1 Criterion for C-Recurrence
System states are usually constrained in a bounded area
in practice. It is a special case of C-recurrence defined below:
Definition 3**.**
*An experiment (χ0,{ut})∈E is said to be C-recurrent for some C>0, if the corresponding states satisfy
P{∥χt∥≤C,i.o.}=1.
The main result of this section is stated as follows.
Theorem 4**.**
Under Assumptions A1–A2, control system (1)
is identifiable for any C-recurrent experiment (χ0,{ut})∈E if dm(Pα∣S)>1/Cw for each α∈A0.
Remark 2.1**.**
General speaking, the lower bound 1/Cw in Theorem 4 cannot be further relaxed.
For example, consider system (13) with
[1,2]⊂Θ⊂R, m=2,p=1. Assume f′(u,y1,y2)=uy2 and
[TABLE]
It is evident that d2(P(1,2)∣S)=1/Cw. Moreover,
θ cannot be identified in experiment
((0,0),{0}), which is C-recurrent for any given C>0.
Remark 2.2**.**
To some extent, noises {wt} in the state equation are advantageous
in the closed-loop identification, whereas {vt} in the observation equation play an opposite role. This observation becomes clear during the proof of Theorem 4.
2.3.2 Criterion for General Case
Generally, given an α∈A0,
the excitation points
of α are expected in the following set for some ϵ>0:
[TABLE]
where u is only need to be considered in B(0,Cu).
Theorem 5**.**
Under Assumptions A1–A2, control system (1)
is identifiable for any experiment (χ0,{ut})∈E if for each α∈A0, there exists some ϵ>0 such that dm(Pα(ϵ)∣Rpm)>1/Cw.
We have thus far solved the identifiability issue. Later,
an implementable algorithm will be provided in Section 3 with the convergence rates explicitly computed.
The proofs of Theorems 4 and 5 are similar, so we only give the detailed proof of Theorem 4.
2.4.1 Theoretical Nonlinear Estimator
To design an estimator competent for the identification task, we need a simple result on functions f and h. For this, let
{Θk⊂Rn,k≥0} be a series of sets with Θ0≜Θ and Θk⊂Θk−1.
Define
[TABLE]
where ck≤k1 is properly small such that ({x}×Θk−1)∩Ak=∅ for all x∈Θk−1.
Lemma 6**.**
*Let Assumption A2 hold and dm(Pα∣S)>1/Cw,∀α∈A0. Then, for each k≥1, a finite covering of Ak in the form
{Ni,k×Ni,k′}nk−1≤i<nk,n0=1
exists. Moreover, for every i∈[nk−1,nk), the following two statements hold:
(i) Ni,k∩Nj,k=∅,Ni,k′∩Nj,k′=∅ for j=i and (Ni,k×Ni,k′)∩Ak=∅;
(ii) there exists a finite set of points Δik=∏j=1mEjik⊂Pα for some α∈A0 with*
[TABLE]
a sequence
{ψis∈Vm,(wis,l∗,vis,l∗)∈W×V,Ulis⊂Rr}1≤s≤n^k,1≤l≤n^is with {Ulis} being some mutually disjoint sets that B(0,Cu)=∑l=1n^isUlis and a number dk∈(0,mσk), where n^k, σk, dk depend only on k and n^is depends on i,s, such that for every s∈[1,n^k], if (x,χ,ψ,w,v,u)∈Ni,k×B(ηis,dk)×B(ψis,dk)×B((wis,l∗,vis,l∗),dk)×Ulis, l∈[1,n^is], then
[TABLE]
Proof.
Note that dm(Pα∣S)>1/Cw
for each α∈A0. Take a σα∈(0,Cw) and a sequence {Eˉjα}1≤j≤m such that ∏j=1mEˉjα⊂Pα and
minj∈[1,m]d(Eˉjα∣B(0,C0))>1/(Cw−2σα). Then, for every j∈[1,m] and z∈B(0,C0), z∈B(ejα,Cw−2σα) for some ejα∈Eˉjα.
Consequently, by the compactness of B(0,C0), there exists a finite set Ejα={ejα∈Eˉjα} such that
[TABLE]
So, for every α∈A0 and j∈[1,m],
[TABLE]
Clearly, Δα≜∏j=1mEjα⊂Pα and n^α≜∣Δα∣ is finite as well.
Now, fix k≥1. Given
α=(x,x′)∈Ak,
(16) shows that for any ηαs∈Δα and u∈B(0,Cu), there are some ψαs∈Vm and (wαs,u∗,vαs,u∗)∈W×V such that
[TABLE]
In view of Assumption A2, h′,hˉ are continuous and h^ is upper semicontinuous and bounded-valued.
By the compactness of W and V, \mboxIm(h^x,u,φ) is upper semicontinuous and bounded-valued at (x,u,φ) as well. Because B(0,Cu) is compact, for each s∈[1,n^α], there exist some mutually disjoint sets {Ulαs}1≤l≤n^αs with B(0,Cu)=∑l=1n^αsUlαs, some points {(wαs,l∗,vαs,l∗)∈W×V}1≤l≤n^αs, some neighbourhoods Bxα and Bx′α
of x and x′ respectively and a number εα>0,
such that
if u∈Ulαs, then
[TABLE]
where Dˉαs,l≜B(ηαs,εα)×B((wαs,l∗,vαs,l∗),εα). Note that Bxα,Bx′α and εα are taken independent of s∈[1,n^α].
In addition, as long as Bxα is sufficiently small, there is a dα∈(0,εα) satisfying
[TABLE]
Now, for any (x,x′)∈Ak, we find an open set Bxα×Bx′α fulfilling
(24) and (25) for all ηαs∈Δα,s∈[1,n^α].
Therefore, the compact set Ak can be covered by some finite open sets {Bi,k×Bi,k′,nˉk−1≤i<nˉk} (nˉ0=1), where for each i∈[nˉk−1,nˉk), it corresponds to a set Δik=∏j=1mEjα=Δα for some α∈Ak, a sequence {(ψis,wis,l∗,vis,l∗)∈Vm×W×V,Ulis⊂Rr}1≤s≤n^k,1≤l≤n^is with B(0,Cu)=∑l=1n^isUlis and some numbers σk,dk,εk with 0<dk<min{εk,mσk}, such that for any (x,φ)∈Bi,k×B(βis,εk) with
βis=hˉ(x,ηis,ψis) and ηis∈Δik,s∈[1,n^k],
[TABLE]
and when u∈Ulis, 1≤l≤n^is,
[TABLE]
So, if for some s∈[1,n^k],l∈[1,n^is], φ=hˉ(x,χ,ψ), u∈Ulis and
[TABLE]
then by (26)–(27),
h′(x,u,χ,w,v)∈/(⋃z∈Bi,k′\mboxIm(hˉz,u,φ)).
Finally,
let {Ni,k} and {Ni,k′} be a series of refined sets of {Bj,k} and {Bj,k′}, respectively, such that Ni,k∩Ni′,k=∅ and Ni,k′∩Ni′,k′=∅,i′=i. Clearly, {Ni,k×Ni,k′}i∈[nk−1,nk) is a finite covering of Ak. Without loss of generality, let (Ni×Ni′)∩Ak=∅,i∈[nk−1,nk). Since every Ni,k×Ni,k′⊂Bj,k×Bj,k′ for some j∈[nˉk−1,nˉk), (21) follows immediately. Besides, (20) holds by (22).
□
We now provide a theoretical estimator to identify parameter θ.
Rewrite the finite covering of Ak,k≥1 in Lemma 6 by
[TABLE]
So, Nˉi,k∩Nˉj,k=∅,∀i=j and ∑i=mk−1mk−1mk,i=nk−nk−1. Let θ0 be the center of Θ.
Algorithm:
Step 1
Let t0=0, θ^0=θ0 and Θ^i,0=Nˉi,1×Θ0 for all i=1,…,m1−1.
Step 2
For t>tk−1,k≥1, if Θ^i,t−1=(Θ^i,tk−1∖Ak) for all i∈[mk−1,mk), denote
[TABLE]
Let
Θ^i,t=Θ^i,t−1∖((Nˉi,k×⋃j∈JitkNˉij,k′)∩Ak), where i∈[mk−1,mk).
If for all i∈[mk−1,mk), Θ^i,t=(Θ^i,tk−1∖Ak), set
[TABLE]
Step 3
For t>tk−1,k≥1, if Θ^i,t=(Θ^i,tk−1∖Ak) for some i∈[mk−1,mk), take a point (x,x)∈Θ^i,t and set
[TABLE]
Set Θk=B(θ^t,ck), Θ^i,t=Nˉi,k+1×Θk for i=mk,…,mk+1−1, and tk=t.
Remark 2.3**.**
If tk−1<∞ for some k≥1, then the algorithm implies that Θk−1, Ak and {Nˉi,k×Nˉij,k}i∈[1,mk),j∈[1,mk,i] are well defined. As a result, in Lemma 6, {(ηis,ψis,wis,l∗,vis,l∗),Ulis}i∈[1,nk),s∈[1,n^k],l∈[1,n^is] are also well defined.
For each k≥1 and i∈[nk−1,nk), denote Γik≜⋃s=1n^k⋃l=1n^isDis,lk with
[TABLE]
where ηis,ψis,wis,l∗,vis,l∗,Ulis and dk>0 are defined in Lemma 6.
Lemma 7**.**
Let (φ0,{ut})∈E be an experiment designed that for each k≥1, if tk−1<∞ almost surely and tk=∞ on a set D with P(D)>0, then
[TABLE]
*will hold almost surely on D for all i∈[nk−1,nk) satisfying θ∈Ni,k, where ψt−1≜(vt−1,…,vt−m)T. Then, under the conditions of
Lemma 6, the nonlinear estimator constructed by (29)–(30) satisfies
limt→∞θ^t=θ almost surely.
*
Proof.
We first show that under an experiment (φ0,{ut})∈E designed in this lemma, the nonlinear algorithm will fulfill tk<∞ and θ∈Θk for all k≥0 almost surely (this also means Θk are well defined for all k almost surely). Since t0=0 and Θ0=Θ,
suppose for some k≥1, ti<∞ and θ∈Θi for all i∈[0,k−1] almost surely. We claim that tk<∞ a.s. for this k. Otherwise, there is a set D with P(D)>0 such that tk=∞ on D. Now, tk−1<∞, by Remark 2.3, Θk−1 and Ak are well defined.
Note that ({θ}×Θk−1)∩Ak=∅ and hence
θ∈Nˉς,k for some ς∈[mk−1,mk). Let
[TABLE]
So, ∣Ik∣=mk,ς>0.
The experiment ensures
Tik=∅ for all i∈Ik on D almost surely. Consequently, for each j∈[1,mk,ς] which corresponds to an integer i(j)∈Ik, there exist some random integers t(j),s(j),l(j) taking values in Ti(j)k,[1,n^k] and [1,n^i(j)s(j)] respectively such that
[TABLE]
Considering θ∈Nˉς,k=Ni(j),k,
by statement (ii) of Lemma 6,
[TABLE]
holds almost surely on D, where φt(j)−1=hˉ(θ,χt(j)−1,ψt(j)−1).
Now, by Step 3 of the algorithm, it is clear that for each i∈[mk−1,mk−1],
[TABLE]
Since tk=∞ on D, Θ^ς,t=(Θ^ς,tk−1∖Ak) for all t≥tk−1 on D.
Denote tˉk−1≜max1≤j≤mk,ςt(j), then (33) yields Jςtˉk−1k={1,…,mk,ς} a.s. on D. So, by Step 2,
[TABLE]
on D almost surely, which leads to a contradiction. Therefore, tk<∞ almost surely. Moreover, Step 3 implies that Θk is well defined almost surely.
The remainder is devoted to verifying θ∈Θk on {tk<∞}. Take a trajectory on which tk<∞. The follow-up arguments are restricted on this trajectory. Denote
[TABLE]
which means for each i∈Ik′,
there is a point x∈Nˉi,k such that d(x,θ)>ck.
Recall that θ∈Θk−1, then (x,θ)∈Ak and thus
θ∈Nˉij,k′ for some j∈[1,mk,i] due to
[TABLE]
So, for all t>tk−1,
yt∈⋃z∈Nˉij,k′\mboxIm(h^z,ut−1,φt−1),
which implies ((Nˉi,k×Nˉij,k′)∩Ak)=∅ belongs to Θ^i,t. Consequently, Θ^i,t=(Θ^i,tk−1∖Ak) for all t>tk−1 whenever i∈Ik′. Now, tk<∞, so any index ς causes Θ^ς,tk=(Θ^ς,tk−1∖Ak) at Step 3 must satisfy ς∈[mk−1,mk)\Ik′. Hence, diam(θ,Nˉς,k)≤ck.
Moreover, because of (34), θ^tk in (30) is well defined at Step 3 and θ^tk∈Nˉς,k. As a result,
[TABLE]
which immediately yields that θ∈Θk=B(θ^tk,ck) on the fixed trajectory.
Therefore, we have verified that tk<∞ and θ∈Θk for all k≥0 almost surely and hence (35) holds for all k≥1 accordingly.
Since Step 2 in the algorithm implies that for each k≥1,
[TABLE]
the lemma is thus proved by letting k→∞.
□
2.4.2 Proofs of the Theorems
Some notations are needed in the sequel. For each t≥0, denote ft≜f(θ,ut,χt) and Ωt≜{∥χt∥≤C}. Let
[TABLE]
Write Ejik={es,jik}1≤s≤∣Ejik∣,j∈[1,m] in Lemma 6. Clearly, ∏j=1m∣Ejik∣=n^k.
In addition, by Assumption A2,
[TABLE]
Lemma 8**.**
*Let tk−1<∞,k≥1 and i∈[nk−1,nk). If dm(Pα∣S)>1/Cw for all α∈A0 and Assumption A2 holds, then for each t≥m, there are some random integers {stj∈Ft−j}j∈[1,m] taking values in Nk,j={1,…,∣Ejik∣} on Ωt−m such that *
[TABLE]
where Bj,tik≜B(estj,jik,Cw−σk),j∈[1,m] and σk is defined in Lemma 6.
Proof.
Since tk−1<∞,k≥1, by the algorithm and Lemma 6, all the quantities appearing in the lemma are well defined. Fix
i∈[nk−1,nk). Note that by (39), (ft−1,…,ft−m)∈S on Ωt−m, then
for j=1,…,m, define
[TABLE]
Random sequence {stj}j∈[1,m] is well defined on Ωt−m since dm(Δik∣S)>1/(Cw−σk) by Lemma 6. So, stj∈Ft−j and (40) follows immediately.
□
Lemma 9**.**
Let (χ0,{ut})∈E be a C-recurrent experiment and dm(Pα∣S)>1/Cw for all α∈A0. Then, under Assumptions A1–A2, for each k≥1,
(32)
holds for all i∈[nk−1,nk) a.s. whenever tk−1<∞ a.s..
Proof.
Fix a k≥1 that tk−1<∞ a.s. and take an integer i∈[nk−1,nk).
Let
[TABLE]
and Dis,lk(2)≜B((wis,l∗,vis,l∗),dk)×Ulis. Recall that {Ulis} are mutually disjoint,
then
{Disk(1)×Dis,lk(2)}s∈[1,n^k],l∈[1,n^is]
are mutually disjoint as well.
As a result, for t≥m,
[TABLE]
By Assumption A1, for each s∈[1,n^k] and l∈[1,n^is], there is a ρk,1>0 such that
[TABLE]
and hence, by the independence of χt and ψt, (2) indicates that for some ρk,2>0,
[TABLE]
where Ftχ≜σ{Ft−m∪σ{χt}},t≥m. So, Ft−m⊂Ftχ.
Now, at time t≥m,
take {stj}j∈[1,m] in Lemma 8, which corresponds to some random index st and
point ηistk=(est1,1ik,…,estm,mik)T taking values in {1,…,n^k} and Δik on set Ωt−m, respectively. Let dˉk=dk/m<σk and
[TABLE]
According to Lemma 8, Ωt,jik is Ft−j measurable, j∈[1,m].
So, by Assumption A1 and Lemma 8, for any t≥m, there is a ρk,3>0 such that
[TABLE]
where the third inequality follows from (40).
So, in view of (47), for each t≥m,
[TABLE]
Now, for t≥1 and l∈[0,m], denote ζt,l≜χ(m+1)(t−1)+l and
[TABLE]
Clearly, {(ζt,l,ζt,l′)}t≥1
is adapted to the filtration {Ft,l′}t≥1 with Ft,l′≜F(m+1)t+l.
Since the experiment is C-recurrent,
∑t=m∞IΩt−m=∞ almost surely.
So, by (51),
[TABLE]
which means there at least exists some l∈[0,m] such that Pl=∞ a.s..
According to the Borel-Cantelli-Leˊvy theorem,
[TABLE]
Since tk−1<∞ almost surely, it is obvious that
for every i∈[nk−1,nk),
[TABLE]
The result follows immediately.
□
Proof of Theorem 4: It is a direct result of Lemmas 7 and 9.
Proof of Theorem 5: Given
α∈A0, since dm(Pα(ϵ)∣Rpm)>1/Cw for some ϵ>0, a countable set Δα=∏j=1mEjα⊂Pα(ϵ) exists (∣Δα∣=ℵ0) and
d(Ejα∣Rp)>1/(Cw−σα),σα∈(0,Cw).
If η∈Δα, by (19), for any u∈B(0,Cu), there are some ψ∈Vm and π(u)∈W×V such that
Moreover, \mboxIm(h^xˇϵ′,uˇϵ,βˇ)⊂⋃ψ∈Vm,π∈W×Vh^(xˇϵ′,uˇϵ,βˇ,ψˇϵ,πˇϵ), then it yields
[TABLE]
So, a similar proof of Lemma 6 shows that
Lemma 6 holds with Pα replaced by Pα(ϵ), n^k=ℵ0 and S=Rmp (C0=∞). Now, since any (χ0,{ut}) can be viewed as a C-recurrent experiment with C=∞ and Lemmas 8–9 are still true for C=∞, the result follows from Lemma 7.
3 Implementable Algorithm
The estimator in Section 2.4.1 is only theoretical valid, so we are going to develop an implementable nonlinear estimator here. For simplicity,
study the following basic control system
[TABLE]
in an experiment (χ0,{ut})∈E, where E is defined by (3),
θ∈Θ⊂Rn, ut,yt,wt are scalars and
φt=(yt,…,yt−m+1)T. Moreover,
f(x,z):Rn×Rm→R is known and ∂x∂f(x,z) exists. Both the above two functions are continuous. Assume
B1
{wt} is an i.i.d sequence with Ew1=0 and E∣w1∣κ<∞,κ>4. In addition,
Assumption B1 includes a large class of familiar distributions, such as uniform distribution U(−Cw,Cw) for finite Cw, as well as
Gaussian distributions and t-distributions for Cw=∞.
3.1 Grid Searching Estimator
Assumption B1 implies that Ew12 exists. Denote σw2≜Ew12 and σˉw2≜E(w12−σw2)2. Recall that Ωi={∥φi∥≤C} for some given C>0 (C can be taken ∞). Let γ>0 and define
[TABLE]
Let ηt(γ)≜∑i=1tΩi(γ,C).
At time t≥2, the grid searching estimator is designed according to function G^t:Rn×R→R defined below:
[TABLE]
Moreover, we remark that the knowledge of σω2 can be described by one of the following three scenarios:
(i) σω2 is known. Let Σt0≡{σw2}, t≥1.
(ii) σω2 is unknown without any prior information. Let Σt0=[0,t], t≥1.
(iii) σω2 is unknown but bounded by a known constant σ>0, i.e., σw2≤σ. Let Σt0≡[0,σ], t≥1.
Let λ,γ,C>0 be some adjustable parameters and let
[TABLE]
Algorithm
Step 1: At time t=0, denote o0 and σ02 as the center points of sets Θ and Σ00, respectively. Set
[TABLE]
Step 2: At time t≥1, equally divide Θ and Σt0 into two finite sequences of small boxes {Θti} and {Σtj} that Θ=⋃iΘti and Σt0=⋃jΣtj, where
the side lengthes of Θti and Σtj are less than 1/(t41−2κ1−λ) and 1/(t21−κ1−λ), respectively. Let oti and σtj2 be the center points of Θti and Σtj. If
[TABLE]
set
[TABLE]
Otherwise, for Jt=∅, take an arbitrary (i∗,j∗)∈Jt satisfying
[TABLE]
Set
[TABLE]
3.2 Strong Consistency
For 1≤k≤n, let x(k),xˉ(k)∈R2k−1n,
y(k),yˉ(k)∈R2k−1m and
z(k)=\mboxcol{x(k),y(k)}. Write x(1)=(x1(1),…,xn(1)). Now,
recursively define a sequence of functions {gj(k),1≤k≤j≤n} for system (52) as follows:
[TABLE]
Example 3.1**.**
In system (52) with n=1, g11(x,y)=∂x∂f(x,y). For n=2,
[TABLE]
The convergence of estimates θ^t is related to the density of set
[TABLE]
in R2n−1m for Cw=∞ or
in S=B(0,C0)⊂R2n−1m for Cw<∞, where C0 is defined similarly as that in (15).
This claim is verified for the case where
the closed-loop system is stable, i.e.,
[TABLE]
Theorem 10**.**
Under Assumption B1, let the closed-loop system (52) be stable.
If for each x∈Θ, either
dm(P′∣S2n−1m)>1/Cw for Cw<∞ or
P′=∅ for Cw=∞, then by choosing parameter γ sufficiently large and parameter λ∈(0,41−2κ1), the grid searching estimator satisfies
[TABLE]
Example 3.2**.**
Let us consider system (52) with f(x1,x2,y)=x1yb1+x2yb2, where x1,x2,y∈R and b1=b2. By Example 3.1, g22(x1,x2,xˉ1,xˉ2;y,yˉ)=yb1yˉb2−yˉb1yb2, which causes P′ dense in R2.
Example 3.3**.**
If Cw=∞, the only requirement on P′ for parameter identifiability is P′=∅. This applies to a lot of control systems. For instance, in system (52), let f(x,y)=\mboxsin(xy) for x,y∈R and Θ=[0,2π]. Example 3.1 shows gnn(x,y)=cos(xy). If y=1/8, then cos(xy)∈[2/2,1] for all x∈[0,2π]. Thus, 1/8∈P′.
We first introduce some notations. For two vectors p=(pi)i=1l,q=(qi)i=1l,l≥1, we say p≺q if there is an index j∈[1,l) such that pi=qi,1≤i≤j and pj+1<qj+1.
Define a series of sets {Hkt} by
[TABLE]
Lemma 11**.**
Let αi≜(ai,1,…,ai,n)T, i∈[1,t] for some fixed t≥1 and n≥1. Denote M(k) as the kth order leading principal minor of det(∑i=1tαiαiT) for k∈[1,n] and M′(k,k) as the k,k cofactor of M(k+1) for k∈[1,n−1]. If ∑i=1tai,j2=0 for all j∈[1,n], then there is a sequence {μh(k),νh(k),h∈Hkt,k∈[1,n]} such that
each M(k) and M′(k,k) can be written as †††∏j=10(∑h∈Hjtμh2(j))−1≜1.
[TABLE]
where, for each h=(p,q)∈Hkt,k∈[2,n],
[TABLE]
and there is a function ζh,k(⋅):Rtk→R independent of αi,i∈[1,t] such that
[TABLE]
Proof.
Let n=2.
Clearly, M(1)=∑i=1tai,12,M′(1,1)=∑i=1tai,22 and
[TABLE]
Similarly, M′(2,2)=∑(p,q)∈H2t(ap,1aq,3−aq,1ap,3)2.
Hence,
the lemma is true when n=2 with μh(1)=ah,1,νh(1)=ah,2,h∈H1t and
[TABLE]
Now, let n≥3. Suppose for some integer l∈[2,n−1], there is a sequence {μh(k),νh(k),h∈Hkt,k∈[1,l]} satisfying (65)–(67),
then we will show the existence of {μh(k),νh(k),h∈Hkt,k∈[1,l+1]} such that (65)–(67) hold.
For k=l+1, write M(k) as a block matrix by
[TABLE]
where
[TABLE]
Since ∑i=1tai,12=0, then
[TABLE]
Note that the (j,s) entry of M2(k)(∑i=1tai,12)−M1(k)M1T(k) is
[TABLE]
Let αp,q′(k)≜(ap,1aq,2−aq,1ap,2,…,ap,1aq,k−aq,1ap,k)T,
then
[TABLE]
Observe that matrix ∑(p,q)∈H2tαp,q′(l+1)(αp,q′(l+1))T has the same form of M(l), which is of dimension l. Moreover, ai,j,j∈[1,l] can be taken any values in M(l), so
by the assumption and (68),
[TABLE]
holds for some {μh′(k),νh′(k),h∈Hk+1t,k∈[1,l]} satisfying
[TABLE]
In addition, there is a sequence of {ζh,k′(⋅),h∈Hk+1t,k∈[1,l]} such that
Note that ζh,s+2 and ζh,s+1′ are independent of the values of αi,i∈[1,t], then
[TABLE]
Or equivalently, νh′(s+1)=νh(s+2). Therefore, μh′(k)=μh(k+1) and νh′(k)=νh(k+1) for all k∈[1,l−1].
Define μh(l+1)≜μh′(l) for all h∈Hl+1t, then
[TABLE]
Since ∑i=1tai,12=∑h∈H1tμh2(1), combining (72) and (73) leads to the first formula of (65) immediately for k=l+1.
If l<n−1, also let νh(l+1)≜νh′(l),h∈Hl+1t. Note that
[TABLE]
Finally, for each h∈Hl+1t, there is a ζh,l+1 independent of αi,i∈[1,t] such that
[TABLE]
So, with {μh(l+1),νh(l+1),h∈Hl+1t} defined above, (65)–(67) hold for k=l+1,
which completes the proof by induction.
□
Lemma 12**.**
Let the conditions of Lemma 11 hold and
denote λmin(∑i=1tαiαiT) as the minimal eigenvalue of matrix ∑i=1tαiαiT.
If there is a number ϵ>0 such that for each k∈[1,n−1] and s∈[k+1,n],
[TABLE]
where νh,s(k)≜ζh,k(ai,j,i=1,…,t;j=1,…,k−1,s),h∈Hkt,s∈[k,n]‡‡‡νh,k(k)=μh(k), then
[TABLE]
Proof.
Let π(n−1) be the set of the (n−1)-permutations of {1,2,…,n}. For p=(i1,…,in−1)∈π(n−1), define αi,p≜(ai,i1,…,ai,in−1)T and denote the n eigenvalues of ∑i=1tαiαiT by λi,1≤i≤n with λi≥λi+1,1≤i≤n−1. According to the Vieta’s formulas, one has
[TABLE]
Note that reordering the n elements ai,1,…,ai,n of vector αi,i∈[1,t] does not change the minimal eigenvalue of
∑i=1tαiαiT. So,
without loss of generality, for p1=(1,2,…,n−1), assume
which, by (85), leads to
λn≥nϵn−1(∑i=1tai,n2).
The lemma thus follows.
□
Lemma 13**.**
*Assume either
dm(P′∣S2n−1m)>1/Cw for Cw<∞ or
P′=∅ for Cw=∞.
Then, the following two statements hold:
(i) there is a sequence of sets Bjl≜{bjl,s}s∈[1,Njl],j∈[1,2n−1],l∈[1,m] with integers Njl≥1 such that
∏j=12n−1∏l=1mBjl⊂P′, and if Cw<∞,*
[TABLE]
(ii) there is a number d>0 such that
[TABLE]
where D≜∏j=12n−1Dj and
[TABLE]
Proof.
Since either dm(P′∣S2n−1m)>1/Cw for Cw<∞ or
P′=∅ for Cw=∞, statement (i) is straightforward (Njl≡1 for Cw=∞).
To show statement (ii), note that gnn(x,y) is continuous, then for each x∈Θ2n−1, there is a number dx>0 and a neighbourhood Bx of x such that for
Dj(x)≜∏l=1m(⋃s∈[1,Njl][bjl,s−dx,bjl,s+dx]),
[TABLE]
Now, Θ2n−1 is compact, by the finite covering theorem, there is a sequence {x(i)∈Θ2n−1}i∈[1,N] for some N∈N+ such that
Θ2n−1⊂⋃i∈[1,N]Bx(i). So, (88) holds by letting d=min1≤i≤Ndx(i).
□
To state the next lemma, denote ηt≜∑i=1tIi−m, Ωη≜{ω:limt→∞ηt=∞} and let Dj,j∈[1,2n−1] be defined by (89).
Lemma 14**.**
Under the conditions of Theorem 10, for all sufficiently large t,
[TABLE]
where CD>0 is a number independent of t.
Proof.
Let filtration {Fh} be defined by (38). Fix j∈[1,2n−1]. Observe that for each l∈[0,m−1], {I{φhm+l∈Dj}−P(φhm+l∈Dj∣F(h−1)m+l),Fhm+l}h≥0 is a martingale difference sequence, then for all sufficiently large t,
[TABLE]
For h≥m, we compute P(φh∈Dj∣Fh−m)IΩh−m by the following two cases:
(i) Cw<∞. In this case, fh−l=(f(θ,φh−l)+uh−l) falls in S for all l∈[1,m] on set Ωh−m. So, (87) yields
[TABLE]
For h≥m and l∈[1,m], denote
[TABLE]
So, by Assumption B1 and (92),
there is a Cd>0 such that
(ii) Cw=∞. Note that {Dj}j∈[1,2n−1] are bounded, then there is a Cf>0 such that
[TABLE]
Since Njl≡1, by Assumption B1, for any h≥m and l∈[1,m],
[TABLE]
where Cd is a positive number. So, (94) also holds for this case.
Combined with (91), both the two cases indicate that for all sufficiently large t,
[TABLE]
Then, (90) follows from (95) by noting that j is finite.
□
Now, at time t≥1, for any k∈[1,n] and h=(h1,h2,…,h2k−1)∈Hkt, denote
[TABLE]
Take γ sufficiently large that for each h≥1,
[TABLE]
For t≥1, let {θt,h}h∈[1,t] be a sequence of random variables taking values in Θ and
define ϑt,h,h∈Hkt,k∈[1,n] by ϑt,h≜\mboxcol{θt,h1,θt,h2,…,θt,h2k−1}.
Lemma 15**.**
Under the conditions of Theorem 10, there are some Cg,Cg,η>0 such that for all k∈[1,n],s∈[k,n] and all sufficiently large t,
[TABLE]
Proof.
First,
in view of (90) and (98), for all sufficiently large t,
[TABLE]
Moreover, considering Lemma 13, let Cng≜minx∈Θ2n−1miny∈D∣gnn(x,y)∣>0, then
[TABLE]
whenever t is sufficiently large.
Now, recursively define
Ck−1g≜Ckg/(2Cˉg),k=n,…,2, where
for B(0,γ)⊂Rm,
[TABLE]
Because of (100), suppose there is an integer k∈[2,n] such that for all s∈[k,n] and all sufficiently large t,
[TABLE]
Let s∈[k,n] and h=(p,q) with p=(pj)j=12k−2,q=(qj)j=12k−2∈Hk−1t. By (61), on set (⋂j=12k−2Ωpj(γ,C))∩(⋂j=12k−2Ωqj(γ,C)), it is evident that for r=k−1 and s,
[TABLE]
As a result, both r=k−1 and s lead to
[TABLE]
or equivalently, by (101) and ∑q∈Hk−1tIΩqj(γ,C)≤ηt2k−2,
[TABLE]
This implies that (101) is also true for k−1. The lemma is thus proved by taking Cg=min1≤i≤nCig and Cg,η=2nCD2n−1.
□
Lemma 16**.**
For t≥1, let {θt,h}h∈[1,t] be a sequence of random variables taking values in Θ. In Lemmas 11 and 12,
set
[TABLE]
where γ is a positive number. Then, under the conditions of Theorem 10, there is a CP>0 independent of t such that for all be sufficiently large t,
[TABLE]
*In addition, taking γ appropriately large, there is a number ϵ>0 such that for all be sufficiently large t, (81) holds a.s. on Ωη for each k∈[1,n−1] and s∈[k+1,n].
*
Proof.
First,
by (61), (66), (86) and (102), it is easy to verify that for each k∈[1,n−1], h=(p,q), p,q∈Hkt with p=(pj)j=12k−1,q=(qj)j=12k−1 and s∈[k+1,n],
[TABLE]
As a consequence, by Lemma 15 and (107), for each s∈[1,n],
[TABLE]
Hence, (103) holds by letting CP=Cg2Cg,η. Furthermore, for each k∈[1,n−1],
[TABLE]
On the other hand, for every s∈[k+1,n],
[TABLE]
where
[TABLE]
This with (108)
completes the proof by letting ϵ=(Cg2Cg,η)/(2Cg′) in (81).
□
Let t≥1. For each i∈[1,t], denote ϕi(x)≜∂x∂f(x,φi) and define
[TABLE]
where x=\mboxcol{x1,…,xt},xi∈Θ. The next lemma is straightforward.
Lemma 17**.**
*Under the conditions of Theorem 10, let θt≜\mboxcol{θt,1,…,θt,t}, where {θt,h,h∈[1,t]} is a sequence of random variables taking values in Θ. Then,
(i) for all sufficiently large t and
γ, there is a random positive number
C1 such that*
[TABLE]
(ii)
let C2=maxx∈Θ,∥z∥≤γ∥∂x∂f(x,z)∥2 with γ>0 sufficiently large, then
[TABLE]
Proof.
Note that Pt+1−1(θt)≥∑i=1tαiαiT, where αi is defined by (102). In view of Lemma 16, there is a number ϵ>0 such that (81) holds almost surely on Ωη for each k∈[1,n−1] and s∈[k+1,n], and hence Lemma 12 yields
[TABLE]
where (110) follows directly from (103) in Lemma 16.
Next, we show (111).
By (53), if Cw=∞, it is clear that
[TABLE]
When Cw<∞, without loss of generality, assume supi≥1∥φi∥IΩi−m<γ. Hence (111) follows as well.
□
Lemma 18**.**
Under Assumption B1, for any ε∈(0,1),
[TABLE]
Proof.
By Assumption B1, m′≜E∣w1∣τ exists for any τ∈(2,κ]. Observe that τ2∈(1,κ], employing the Minkowski inequality and the Lyapunov inequality yields
[TABLE]
Since {∣wi∣τ−m′,Fi} is a martingale difference sequence with
Under Assumption B1, if (110) and (111) hold for every θt defined in Lemma 17, then for any λ∈(0,41−2κ1),
[TABLE]
where Ωη′≜{t/ηt=O(1)}.
Proof.
Fix λ∈(0,41−2κ1) in the algorithm and let θ be the true value of the parameter. It is clear that for every sufficiently large t,
(θ,σw2)∈Θ×Σt.
Therefore, there are two points otit∈Θ and σtjt2∈Σt such that
Proof of Theorem 10: When the closed-loop system is stable, P(Ωη′)=1 and hence the theorem is a direct result of Lemmas 11–19.
Bibliography16
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] K. J. Astrom and B. Wittemnark, Problems of identification and control , J. Math. Anal. Applic., 34 (1971), pp. 90–113.
2[2] U. Forssell and L. Ljung, Closed-loop identification revisited , Automatica, 35 (1999), pp. 1215–1241.
3[3] M. Gevers and L. Ljung, Optimal experiment design with respect to the intended model application , Automatica, 22 (1986), pp. 543–554.
4[4] P. E. Gill and W. Murray, Algorithms for the solution of the nonlinear least-squares problem , SIAM J. Numer. Anal., 15 (1978), pp. 977–992.
5[5] N. I. M. Gould, S. Leyffer, and P. L. Toint, A multidimensional filter algorithm for nonlinear equations and nonlinear least-squares , SIAM J. Optim., 15 (2004), pp. 17–38.
6[6] M. S. Grewal, G. A. Bekey, and H. J. Payne, Parameter identifiability of dynamcal system , in Proc. IEEE Conf. Decision and Conrrol, 1974, pp. 446–448.
7[7] M. S. Grewal and K. Glover, Identifiability of linear and nonlinear dynamical systems , IEEE Trans. Automat. Control, 21 (1976), pp. 833–837.
8[8] D. W. Huang and L. Guo, “Estimation of nonstationary ARMAX models based on Hannan-Rissanen method”, The Annals of Statistics , 18 (1990), pp. 1729–1756.