Classical Discrete-Time Adaptive Control Revisited: Exponential   Stabilization

Daniel E. Miller

arXiv:1705.01494·math.OC·November 28, 2017·CCTA

Classical Discrete-Time Adaptive Control Revisited: Exponential Stabilization

Daniel E. Miller

PDF

TL;DR

This paper proves that classical pole placement adaptive controllers with projection achieve exponential stability and bounded noise gain without persistent excitation, improving robustness and performance guarantees.

Contribution

It demonstrates exponential stabilization and bounded noise gain for classical adaptive controllers using the original projection algorithm without persistent excitation.

Findings

01

Exponential stability of the closed-loop system.

02

Bounded noise gain in the presence of disturbances.

03

Tolerance to unmodelled dynamics and parameter variations.

Abstract

Classical discrete-time adaptive controllers provide asymptotic stabilization. While the original adaptive controllers did not handle noise or unmodelled dynamics well, redesigned versions were proven to have some tolerance; however, exponential stabilization and a bounded gain on the noise was rarely proven. Here we consider a classical pole placement adaptive controller using the original projection algorithm rather than the commonly modifed version; we impose the assumption that the plant parameters lie in a convex, compact set and that the parameter estimates are projected onto that set at every step. We demonstrate that the closed-loop system exhibits very desireable closed-loop behaviour: there are linear-like convolution bounds on the closed loop behaviour, which implies exponential stability and a bounded noise gain, as well an easily proven tolerance to unmodelled dynamics and…

Equations421

(P_{T}x)(t)=\left\{\begin{array}[]{ll}x(t)&\;\;{t\leq T}\\ 0&\;\;t>T.\end{array}\right.

(P_{T}x)(t)=\left\{\begin{array}[]{ll}x(t)&\;\;{t\leq T}\\ 0&\;\;t>T.\end{array}\right.

y (t + 1)

y (t + 1)

=

=

A (z^{- 1}) := 1 + a_{1} z^{- 1} + \dots + a_{n} z^{- n},

A (z^{- 1}) := 1 + a_{1} z^{- 1} + \dots + a_{n} z^{- n},

B (z^{- 1}) := b_{1} z^{- 1} + \dots + b_{n} z^{- n}

B (z^{- 1}) := b_{1} z^{- 1} + \dots + b_{n} z^{- n}

y (t + 1) = ϕ (t)^{T} θ^{*} + d (t) .

y (t + 1) = ϕ (t)^{T} θ^{*} + d (t) .

e (t + 1) := y (t + 1) - ϕ (t)^{T} \hat{θ} (t);

e (t + 1) := y (t + 1) - ϕ (t)^{T} \hat{θ} (t);

a r g mi n_{θ} {∥ θ - \hat{θ} (t) ∥ : y (t + 1) = ϕ (t)^{T} θ},

a r g mi n_{θ} {∥ θ - \hat{θ} (t) ∥ : y (t + 1) = ϕ (t)^{T} θ},

\hat{\theta}(t+1)=\left\{\begin{array}[]{ll}\hat{\theta}(t)&\mbox{ if $\phi(t)=0$}\\ \hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)&\mbox{ otherwise.}\end{array}\right.

\hat{\theta}(t+1)=\left\{\begin{array}[]{ll}\hat{\theta}(t)&\mbox{ if $\phi(t)=0$}\\ \hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)&\mbox{ otherwise.}\end{array}\right.

\hat{θ} (t + 1) = \hat{θ} (t) + \frac{α ϕ ( t )}{β + ϕ ( t ) ^{T} ϕ ( t )} e (t + 1) .

\hat{θ} (t + 1) = \hat{θ} (t) + \frac{α ϕ ( t )}{β + ϕ ( t ) ^{T} ϕ ( t )} e (t + 1) .

y (t + 1) = - a_{1} y (t) + b_{1} u (t) + d (t)

y (t + 1) = - a_{1} y (t) + b_{1} u (t) + d (t)

y (0) = y_{0} = ε \in (0, 1),

y (0) = y_{0} = ε \in (0, 1),

\hat{\theta}(0)=\left[\begin{array}[]{c}-\hat{a}_{1}(0)\\ \hat{b}_{1}(0)\end{array}\right]=\left[\begin{array}[]{c}1\\ 2\end{array}\right],\;\theta^{*}=\left[\begin{array}[]{c}2\\ 1\end{array}\right]

\hat{\theta}(0)=\left[\begin{array}[]{c}-\hat{a}_{1}(0)\\ \hat{b}_{1}(0)\end{array}\right]=\left[\begin{array}[]{c}1\\ 2\end{array}\right],\;\theta^{*}=\left[\begin{array}[]{c}2\\ 1\end{array}\right]

∣ y (t) ∣ \leq ε^{1/2}, t \in [0, N (ε)] .

∣ y (t) ∣ \leq ε^{1/2}, t \in [0, N (ε)] .

∥ \hat{θ} (t) - θ_{0} ∥ \leq 10 (2)^{1/2} ε, t \in [0, N (ε)] .

∥ \hat{θ} (t) - θ_{0} ∥ \leq 10 (2)^{1/2} ε, t \in [0, N (ε)] .

∣ y (N (ε)) ∣ \geq (1.25)^{N (ε)} ε \Rightarrow \frac{y ( N ( ε ))}{ε} \geq (1.25)^{N (ε)};

∣ y (N (ε)) ∣ \geq (1.25)^{N (ε)} ε \Rightarrow \frac{y ( N ( ε ))}{ε} \geq (1.25)^{N (ε)};

\check{\theta}(t+1)=\left\{\begin{array}[]{ll}\hat{\theta}(t)&\;\;\mbox{ if $\phi(t)=0$}\\ \hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)&\;\;\mbox{otherwise,}\end{array}\right.

\check{\theta}(t+1)=\left\{\begin{array}[]{ll}\hat{\theta}(t)&\;\;\mbox{ if $\phi(t)=0$}\\ \hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)&\;\;\mbox{otherwise,}\end{array}\right.

\hat{θ} (t + 1) := π_{S} (\overset{ˇ}{θ} (t + 1)) .

\hat{θ} (t + 1) := π_{S} (\overset{ˇ}{θ} (t + 1)) .

∥ π_{S} (θ) - θ^{*} ∥ \leq ∥ θ - θ^{*} ∥,

∥ π_{S} (θ) - θ^{*} ∥ \leq ∥ θ - θ^{*} ∥,

e (t + 1) = - ϕ (t)^{T} [\hat{θ} (t) - θ^{*}] + d (t),

e (t + 1) = - ϕ (t)^{T} [\hat{θ} (t) - θ^{*}] + d (t),

∣ e (t + 1) ∣ \leq 2∥ S ∥ \times ∥ ϕ (t) ∥ + ∣ d (t) ∣.

∣ e (t + 1) ∣ \leq 2∥ S ∥ \times ∥ ϕ (t) ∥ + ∣ d (t) ∣.

∣ e (t + 1) ∣ > 2∥ S ∥ \times ∥ ϕ (t) ∥,

∣ e (t + 1) ∣ > 2∥ S ∥ \times ∥ ϕ (t) ∥,

\check{\theta}(t+1)=\left\{\begin{array}[]{l}\hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)\\ \;\;\;\;\;\mbox{ if $|e(t+1)|<(2\|{\cal S}\|+\delta)\|\phi(t)\|$}\\ \hat{\theta}(t)\\ \;\;\;\;\;\mbox{otherwise;}\end{array}\right.

\check{\theta}(t+1)=\left\{\begin{array}[]{l}\hat{\theta}(t)+\frac{\phi(t)}{\phi(t)^{T}\phi(t)}\,e(t+1)\\ \;\;\;\;\;\mbox{ if $|e(t+1)|<(2\|{\cal S}\|+\delta)\|\phi(t)\|$}\\ \hat{\theta}(t)\\ \;\;\;\;\;\mbox{otherwise;}\end{array}\right.

ρ_{δ} (ϕ (t), e (t + 1)) :=

ρ_{δ} (ϕ (t), e (t + 1)) :=

\left\{\begin{array}[]{ll}1&\;\;\mbox{ if }|e(t+1)|<(2\|{\cal S}\|+\delta)\|\phi(t)\|\\ 0&\;\;\mbox{otherwise, }\end{array}\right.

\left\{\begin{array}[]{ll}1&\;\;\mbox{ if }|e(t+1)|<(2\|{\cal S}\|+\delta)\|\phi(t)\|\\ 0&\;\;\mbox{otherwise, }\end{array}\right.

\overset{ˇ}{θ} (t + 1) = \hat{θ} (t) + ρ_{δ} (ϕ (t), e (t + 1)) \frac{ϕ ( t )}{ϕ ( t ) ^{T} ϕ ( t )} e (t + 1);

\overset{ˇ}{θ} (t + 1) = \hat{θ} (t) + ρ_{δ} (ϕ (t), e (t + 1)) \frac{ϕ ( t )}{ϕ ( t ) ^{T} ϕ ( t )} e (t + 1);

\hat{θ} (t + 1) := π_{S} (\overset{ˇ}{θ} (t + 1)) .

\hat{θ} (t + 1) := π_{S} (\overset{ˇ}{θ} (t + 1)) .

\tilde{θ} (t) := \hat{θ} (t) - θ^{*},

\tilde{θ} (t) := \hat{θ} (t) - θ^{*},

∥ \hat{θ} (t + 1) - \hat{θ} (t) ∥ \leq ρ_{δ} (ϕ (t), e (t + 1)) \frac{∣ e ( t + 1 ) ∣}{∥ ϕ ( t ) ∥}, t \geq t_{0},

∥ \hat{θ} (t + 1) - \hat{θ} (t) ∥ \leq ρ_{δ} (ϕ (t), e (t + 1)) \frac{∣ e ( t + 1 ) ∣}{∥ ϕ ( t ) ∥}, t \geq t_{0},

V (t)

V (t)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Classical Pole Placement Adaptive Control Revisited:

Exponential Stabilization

** Daniel E. Miller111This research was supported by a grant from the Natural Sciences Research Council of Canada. **

**Dept. of Elect. and Comp. Eng. **

**University of Waterloo, Waterloo, ON **

**Canada N2L 3G1 **

([email protected])

Abstract

While the original classical parameter adaptive controllers did not handle noise or unmodelled dynamics well, redesigned versions were proven to have some tolerance; however, exponential stabilization and a bounded gain on the noise was rarely proven. Here we consider a classical pole placement adaptive controller using the original projection algorithm rather than the commonly modified version; we impose the assumption that the plant parameters lie in a convex, compact set. We demonstrate that the closed-loop system exhibits very desireable closed-loop behaviour: there are linear-like convolution bounds on the closed loop behaviour, which confers exponential stability and a bounded noise gain, and can be leveraged to prove tolerance to unmodelled dynamics and plant parameter variation. We emphasize that there is no persistent excitation requirement of any sort; the improved performance arises from the vigilant nature of the parameter estimator.

Keywords: Adaptive control, Projection algorithm, Exponential stability, Bounded gain.

I Introduction

Adaptive control is an approach used to deal with systems with uncertain or time-varying parameters. The classical adaptive controller consists of a linear time-invariant (LTI) compensator together with a tuning mechanism to adjust the compensator parameters to match the plant. The first general proofs that adaptive controllers could work came around 1980, e.g. see [2], [19], [4], [25], and [26]. However, such controllers were typically not robust to unmodelled dynamics, did not tolerate time-variations well, and did not handle noise or disturbances well, e.g. see [27]. During the following two decades a great deal of effort was made to address these shortcomings. The most common approach was to make small controller design changes, such as the use of signal normalization, deadzones, and $\sigma-$ modification, to ameliorate these issues, e.g. see [13], [12], [28], [11], [8]. Indeed, simply using projection (onto a convex set of admissible parameters) has proved quite powerful, and the resulting controllers typically provide a bounded-noise bounded-state property, as well as tolerance of some degree of unmodelled dynamics and/or time-variations, e.g. see [34], [35], [22], [33], [32] and [9]. Of course, it is clearly desireable that the closed-loop system exhibit LTI-like system properties, such as a bounded gain and exponential stability. As far as the author is aware, in the classical approach to adaptive control a bounded gain on the noise 222Since the closed-loop system is nonlinear, a bounded-noise bounded-state property does not automatically imply a bounded gain on the noise. is proven only in [35]; however, a crisp exponential bound on the effect of the initial condition is not provided, and a minimum phase assumption is imposed. While it is possible to prove a form of exponential stability if the reference input is sufficiently persistently exciting, e.g. see [23], this places a stringent requirement on an exogenous input.

There are several non-classical approaches to adaptive control which provide LTI-like system properties. First of all, in [3] and [18] a logic-based switching approach was used to switch between a predefined list of candidate controllers; while exponential stability is proven, the transient behaviour can be quite poor and a bounded gain on the noise is not proven. A more sophisticated logic-based approach, labelled Supervisory Control, was proposed by Morse; here a supervisor switches in an efficient way between candidate controllers - see [20], [21], [6], [30] and [7]. In certain circumstances a bounded gain on the noise can be proven - see [31] and the Concluding Remarks section of [21]. A related approach, called localization-based switching adaptive control, uses a falsification approach to prove exponential stability as well as a degree of tolerance of disturbances, e.g. see [36].

Another non-classical approach, proposed by the author, is based on periodic estimation and control: rather than estimate the plant or controller parameters, the goal is to estimate what the control signal would be if the plant parameters and plant state were known and the ‘optimal controller’ were applied. Exponential stability and a bounded gain on the noise is achieved, as well as near optimal performance, e.g. see [14], [15], and [29]; a degree of unmodelled dynamics and time variations can be allowed. The cost of these desireable features is that the noise gain increases dramatically the closer that one gets to optimality.

In this paper we consider the discrete-time setting and we propose an alternative approach to obtaining LTI-like system properties. We return to a common approach in classical adaptive control - the use of the projection algorithm together with the Certainty Equivalence Principle. In the literature it is the norm to use a modified version of the ideal Projection Algorithm in order to avoid division by zero; 333 An exception is the work of Ydstie [34], [35], who considers the ideal Projection Algorithm as a special case; however, a crisp bound on the effect of the initial condition is not proven and a minimum phase assumption is imposed. it turns out that an unexpected consequence of this minor adjustment is that some inherent properties of the scheme are destroyed. Here we use the original version of the Projection Algorithm coupled with a pole placement Certainty Equivalence based controller. We obtain linear-like convolution bounds on the closed-loop behaviour, which immediately confers exponential stability and a bounded gain on the noise; such convolution bounds are, as far as the author is aware, a first in adaptive control, and it allows us to use a modular approach to analyse robustness and tolerance to time-varying parameters. To this end, the results will be presented in a very pedagogically desireable fashion: we first deal with the ideal plant (with disturbances); we then leverage that result to prove that a large degree of time-variations is tolerated; we then demonstrate that the approach tolerates a degree of unmodelled dynamics, in a way familiar to those versed in the analysis of LTI systems.

In a recent short paper we consider the first order case [16]. Here we consider the general case, which requires much more sophisticated analysis and proofs. Furthermore, in comparison to [16], here we (i) present a more general estimation algorithm, which alleviates the classical concern about dividing by zero, (ii) prove that the controller achieves the objective in the presence of a more general class of time-variations, and (iii) prove robustness to unmodelled dynamics. An early version of this paper has been submitted to a conference [17].

Before proceeding we present some mathematical preliminaries. Let ${\bf Z}$ denote the set of integers, ${\bf Z}^{+}$ the set of non-negative integers, ${\bf N}$ the set of natural numbers, ${\bf R}$ the set of real numbers, and ${\bf R}^{+}$ the set of non-negative real numbers. We let ${{\bf D}}^{0}$ denote the open unit disk of the complex plane. We use the Euclidean $2$ -norm for vectors and the corresponding induced norm for matrices, and denote the norm of a vector or matrix by $\|\cdot\|$ . We let ${l_{\infty}}({\bf R}^{n})$ denote the set of ${\bf R}^{n}$ -valued bounded sequences; we define the norm of $u\in{l_{\infty}}({\bf R}^{n})$ by $\|u\|_{\infty}:=\sup_{k\in{\bf Z}}\|u(k)\|$ . Occasionally we will deal with a map $F:{l_{\infty}}({\bf R}^{n})\rightarrow{l_{\infty}}({\bf R}^{n})$ ; the gain is given by $\sup_{u\neq 0}\frac{\|Fu\|_{\infty}}{\|u\|_{\infty}}$ and denoted by $\|F\|$ . With $T\in{\bf Z}$ , the truncation operator $P_{T}:{l_{\infty}}({\bf R}^{n})\rightarrow{l_{\infty}}({\bf R}^{n})$ is defined by

[TABLE]

We say that the map $F:{l_{\infty}}({\bf R}^{n})\rightarrow{l_{\infty}}({\bf R}^{n})$ is causal if $P_{T}FP_{T}=P_{T}F$ for every $T\in{\bf Z}$ .

If ${\cal S}\subset{\bf R}^{p}$ is a convex and compact set, we define $\|{\cal S}\|:=\max_{x\in{\cal S}}\|x\|$ and the function $\pi_{\cal S}:{\bf R}^{p}\rightarrow{\cal S}$ denotes the projection onto ${\cal S}$ ; it is well-known that $\pi_{\cal S}$ is well-defined.

II The Setup

In this paper we start with an $n^{th}$ order linear time-invariant discrete-time plant given by

[TABLE]

with $y(t)\in{\bf R}$ the measured output, $u(t)\in{\bf R}$ the control input, and $d(t)\in{\bf R}$ the disturbance (or noise) input. We assume that $\theta^{*}$ is unknown but belongs to a known set ${\cal S}\subset{\bf R}^{2n}$ . Associated with this plant model are the polynomials

[TABLE]

and the transfer function $\frac{B(z^{-1})}{A(z^{-1})}$ .

Remark 1

*It is straight-forward to verify that if the system has a disturbance at both the input and output, then it can be converted to a system of the above form. *

We impose an assumption on the set of admissible plant parameters.

Assumption 1: ${\cal S}$ is convex and compact, and for each $\theta^{*}\in{\cal S}$ , the corresponding pair of polynomials $A(z^{-1})$ and $B(z^{-1})$ are coprime.

The convexity part of the above assumption is common in a branch of the adaptive control literature - it is used to facilitate parameter projection, e.g. see [5]. The boundedness part is less common, but it is quite reasonable in practical situations; it is used here to ensure that we can prove uniform bounds and decay rates on the closed-loop behaviour.

The main goal here is to prove a form of stability, with a secondary goal that of asymptotic tracking of an exogenous reference signal $y^{*}(t)$ ; since the plant may be non-minimum phase, there are limits on how well the plant can be made to track $y^{*}(t)$ . To proceed we use a parameter estimator together with an adaptive pole placement control law. At this point, we discuss the most critical aspect - the parameter estimator.

II-A Parameter Estimation

We can write the plant as

[TABLE]

Given an estimate $\hat{\theta}(t)$ of $\theta^{*}$ at time $t$ , we define the prediction error by

[TABLE]

this is a measure of the error in $\hat{\theta}(t)$ . The common way to obtain a new estimate is from the solution of the optimization problem

[TABLE]

yielding the ideal (projection) algorithm

[TABLE]

Of course, if $\phi(t)$ is close to zero, numerical problems can occur, so it is the norm in the literature (e.g. [4] and [5]) to replace this by the following classical algorithm: with $0<\alpha<2$ and $\beta>0$ , define

[TABLE]

This latter algorithm is widely used, and plays a role in many discrete-time adaptive control algorithms; however, when this algorithm is used, all of the results are asymptotic, and exponential stability and a bounded gain on the noise are never proven. It is not hard to guess why - a careful look at the estimator shows that the gain on the update law is small if $\phi(t)$ is small. A more mathematically detailed argument is given in the following example.

Remark 2

Consider the simple first order plant

[TABLE]

with $a_{1}\in[-2,-1]$ and $b_{1}\in[1,2]$ . For simplicity, we assume that in the estimator (16) we have $\alpha=\beta=1$ , and, as in [34], [35], [22], [33], [32] and [9], we use projection to keep the parameters estimates inside ${\cal S}$ so as to guarantee a bounded-input bounded-state property. Further suppose $y^{*}=d=0$ , and that a classical pole placement adaptive controller places the closed-loop pole at zero: $u(t)={\frac{\hat{a}_{1}(t)}{\hat{b}_{1}(t)}}y(t)=:\hat{f}(t)y(t)$ . Suppose that

[TABLE]

so that $\hat{f}(0)=-0.5$ and $-a_{1}+b_{1}\hat{f}(0)=1.5$ , i.e. the system is initially unstable. An easy calculation verifies that $\hat{f}(t)\in[-2,-0.5]$ and $-a_{1}+b_{1}\hat{f}(t)\in[0,1.5]$ for $t\geq 0$ , which leads to a crude bound on the closed loop behaviour: $|y(t)|\leq(1.5)^{t}{\varepsilon}$ for $t\geq 0$ . With $N({\varepsilon}):=\mbox{int}[\frac{1}{2\ln(1.5)}\ln(\frac{1}{{\varepsilon}})]$ , it follows that

[TABLE]

A careful examination of the parameter estimator shows that

[TABLE]

From the form of $\hat{f}(t)$ , it follows that for small ${\varepsilon}$ we have $|-a+b_{1}\hat{f}(t)|\geq 1.25$ for $t\in[0,N({\varepsilon})]$ , in which case

[TABLE]

since $N({\varepsilon})\rightarrow\infty$ as ${\varepsilon}\rightarrow 0$ , we see that exponential stability is unachievable. A similar kind of analysis can be used to prove that a bounded gain on the noise is not achievable either.

Now we return to the problem as hand - analysing the ideal algorithm (15). We will be using the ideal algorithm with projection to ensure that the estimate remains in ${\cal S}$ for all time. With an initial condition of $\hat{\theta}(t_{0})=\theta_{0}\in{\cal S}$ , for $t\geq t_{0}$ we set

[TABLE]

which we then project onto ${\cal S}$ :

[TABLE]

Because of the closed and convex property of ${\cal S}$ , the projection function is well-defined; furthermore, it has the nice property that, for every $\theta\in{\bf R}^{2n}$ and every $\theta^{*}\in{\cal S}$ , we have

[TABLE]

i.e. projecting $\theta$ onto ${\cal S}$ never makes it further away from the quantity $\theta^{*}$ .

II-B Revised Parameter Estimation

Some readers may be concerned that the original problem of dividing by a number close to zero, which motivates the use of classical algorithm, remains. Of course, this is balanced against the soon-to-be-proved benefit of using (17)-(18). We propose a middle ground as follows. A straight-forward analysis of $e(t+1)$ reveals that

[TABLE]

which means that

[TABLE]

Therefore, if

[TABLE]

then the update to $\hat{\theta}(t)$ will be greater than $2\|{\cal S}\|$ , which means that there is little information content in $e(t+1)$ - it is dominated by the disturbance. With this as motivation, and with $\delta\in(0,\infty]$ , let us replace (17) with

[TABLE]

in the case of $\delta=\infty$ , we will adopt the understanding that $\infty\times 0=0$ , in which case the above formula collapses into the original one (17). In the case that $\delta<\infty$ , we can be assured that the update term is bounded above by $2\|{\cal S}\|+\delta$ , which should alleviate concern about having infinite gain. We would now like to rewrite the update to make it more concise. To this end, we now define ${\rho_{\delta}}:{\bf R}^{2n}\times{\bf R}\rightarrow\{0,1\}$ by

[TABLE]

yielding a more concise way to write the estimation algorithm update:

[TABLE]

once again, we project this onto ${\cal S}$ :

[TABLE]

II-C Properties of the Estimation Algorithm

Analysing the closed-loop system will require a careful analysis of the estimation algorithm. We define the parameter estimation error by

[TABLE]

and the corresponding Lyapunov function associated with $\tilde{\theta}(t)$ , namely $V(t):=\tilde{\theta}(t)^{T}\tilde{\theta}(t)$ . In the following result we list a property of $V(t)$ ; it is a generalization of what is well-known for the classical algorithm (16).

Proposition 1

For every $t_{0}\in{\bf Z}$ , $\phi_{0}\in{\bf R}^{2n}$ , ${\theta}_{0}\in{\cal S}$ , $\theta^{*}\in{\cal S}$ , $d\in{l_{\infty}}$ , and $\delta\in(0,\infty]$ , when the estimator (20) and (21) is applied to the plant (14), the following holds:

$\|\hat{\theta}(t+1)-\hat{\theta}(t)\|\leq{\rho_{\delta}(\phi(t),e(t+1))}\frac{|e(t+1)|}{\|\phi(t)\|},\;t\geq t_{0},$

(22)

$\displaystyle V(t)$ $\displaystyle\leq$ $\displaystyle V(t_{0})+\sum_{j=t_{0}}^{t-1}{\rho_{\delta}(\phi(j),e(j+1))}\times$

$\;\;\;[-\frac{1}{2}\frac{[e(j+1)]^{2}}{\|\phi(j)\|^{2}}+2\frac{[d(j)]^{2}}{\|\phi(j)\|^{2}}],\;\;t\geq t_{0}+1.$

Proof: See the Appendix. $\Box$

II-D The Control Law

The elements of $\hat{\theta}(t)$ are partitioned in a natural way as

[TABLE]

Associated with $\hat{\theta}(t)$ are the polynomials

[TABLE]

While we can use an $n-1^{th}$ order proper controller to carry out pole placement, it will be convenient to follow the lead of [33] and use an $n^{th}$ order strictly proper controller. In particular, we first choose a $2n^{th}$ order monic polynomial

[TABLE]

so that $z^{2n}A^{*}(z^{-1})$ has all of its zeros in ${{\bf D}}^{o}$ . Next, we choose two polynomial

[TABLE]

and

[TABLE]

which satisfy the equation

[TABLE]

given the assumption that the $\hat{A}(t,z^{-1})$ and $\hat{B}(t,z^{-1})$ are coprime, it is well known that there exist unique $\hat{L}(t,z^{-1})$ and $\hat{P}(t,z^{-1})$ which satisfy this equation. Indeed, it is easy to prove that the coefficients of $\hat{L}(t,z^{-1})$ and $\hat{P}(t,z^{-1})$ are analytic functions of $\hat{\theta}(t)\in{\cal S}$ .

In our setup we have an exogenous signal $y^{*}(t)$ . At time $t$ we choose $u(t)$ so that

[TABLE]

So the overall controller consists of the estimator (20)-(21) together with (24).555We also implicitly use a pole placement procedure to obtain the controller parameters from the plant parameter estimates; this entails solving a linear equation.

It turns out that we can write down a state-space model of our closed-loop system with $\phi(t)\in{\bf R}^{2n}$ as the state. Only two elements of $\phi$ have a complicated description:

[TABLE]

With $e_{i}\in{\bf R}^{2n}$ the $i^{th}$ normal vector, if we now define

[TABLE]

then the following key equation holds:

[TABLE]

notice that the characteristic equation of $\bar{A}(t)$ always equals $z^{2n}A^{*}(z^{-1})$ . Before proceeding, define

[TABLE]

III Preliminary Analysis

The closed-loop system given in (26) arises in classical adaptive control approaches in slightly modified fashion, so we will borrow some tools from there. More specifically, the following result was proven by Kreisselmeir [10], in the context of proving that a slowly time-varying adaptive control system is stable (in a weak sense); we are providing a special case of his technical lemma to minimize complexity.666Furthermore, in [10] it is assumed that $\alpha_{i}$ and $\beta_{i}$ are strictly greater than zero, but it is trivial to extend this to allow for zero as well.

Proposition 2

[10]** Consider the discrete-time system

$x(t+1)=[A_{nom}(t)+\Delta(t)]x(t)$

with $\Phi(t,\tau)$ denoting the corresponding state transition matrix. Suppose that there exist constants $\sigma\in(0,1)$ , $\gamma_{1}>1$ , $\alpha_{i}\geq 0$ , and $\beta_{i}\geq 0$ so that

(i) for all $t\geq t_{0}$ , we have $\|A_{nom}(t)^{i}\|\leq\gamma_{1}\sigma^{i},\;i\geq 0$ ;

(ii) for all $t>\tau$ we have

$\sum_{i=\tau}^{t-1}\|A_{nom}(i+1)-A_{nom}(i)\|\leq$

$\;\;\;\;\;\alpha_{0}+\alpha_{1}(t-\tau)^{1/2}+\alpha_{2}(t-\tau)$

and $\sum_{i=\tau}^{t-1}\|\Delta(i)\|\leq\beta_{0}+\beta_{1}(t-\tau)^{1/2}+\beta_{2}(t-\tau)$ ;

(iii) there exists a $\mu\in(\sigma,1)$ and $N\in{\bf N}$ satisfying $\alpha_{2}+\frac{\beta_{2}}{N}<\frac{1}{N\gamma_{1}}(\frac{\mu}{\gamma_{1}^{1/N}}-\sigma)$ .

Then there exists a constant $\gamma_{2}$ so that the transition matrix satisfies

$\|\Phi(t,\tau)\|\leq\gamma_{2}\mu^{t-\tau},\;t\geq\tau.$

Remark 3

We apply the above proposition in the following way. Suppose that $\sigma\in(0,1)$ , $\gamma_{1}>1$ , $\alpha_{i}\geq 0$ , $\beta_{i}\geq 0$ are such conditions (i) and (ii) hold. If $\mu\in(\sigma,1)$ , then it follows that $\frac{\mu}{\gamma_{1}^{1/N}}-\sigma>0$ for large enough $N\in{\bf N}$ , so condition (iii) will hold as well as long as $\alpha_{2}$ and $\beta_{2}$ are small enough.

In applying Proposition 2, the matrix $\bar{A}(t)$ will play the role of $A_{nom}(t)$ . A key requirement is that Condition (i) holds: the following provides relevant bounds. Before proceeding, let

[TABLE]

Lemma 1

For every $\delta\in(0,\infty]$ and $\sigma\in(\underline{\lambda},1)$ there exists a constant $\gamma\geq 1$ so that for every $t_{0}\in{\bf Z}$ , $\theta_{0}\in{\cal S}$ , $\hat{\theta}^{*}\in{\cal S}$ , and $y^{*},d\in{l_{\infty}}$ , when the controller (20), (21) and (24) is applied to the plant (14), the matrix $\bar{A}(t)$ satisfies, for every $t\geq t_{0}$ :

$\|\bar{A}(t)^{k}\|\leq\gamma\sigma^{k},\;k\geq 0,$

and for every $t>k\geq t_{0}$ :

$\sum_{j=k}^{t-1}\|\bar{A}(j+1)-\bar{A}(j)\|\leq\gamma\times$

$[\sum_{j=k}^{t-1}{\rho_{\delta}(\phi(j),e(j+1))}\frac{e(j+1)^{2}}{\|\phi(j)\|^{2}}]^{1/2}(t-k)^{1/2}.$

Proof: See the Appendix. $\Box$

IV The Main Result

Theorem 1

For every $\delta\in(0,\infty]$ and $\lambda\in(\underline{\lambda},1)$ there exists a $c>0$ so that for every $t_{0}\in{\bf Z}$ , $\theta_{0}\in{\cal S}$ , ${\theta}^{*}\in{\cal S}$ , $\phi_{0}\in{\bf R}^{2n}$ , and $y^{*},d\in\ell_{\infty}$ , when the adaptive controller (20), (21) and (24) is applied to the plant (14), the following bound holds:

$\|\phi(k)\|\leq c\lambda^{k-t_{0}}\|\phi_{0}\|+$

$\sum_{j=t_{0}}^{k-1}c\lambda^{k-1-j}{(|r(j)|+|d(j)|)},\;\;k\geq t_{0}.$

(27)

Remark 4

We see from (25) that $r(t)$ is a weighted sum of $\{y^{*}(t),...,y^{*}(t-n+1)\}$ . Hence, there exists a constant $\bar{c}$ so that the bound (27) can be rewritten as

[TABLE]

Remark 5

Theorem 1 implies that the system has a bounded gain (from $d$ and $r$ to $y$ ) in every $p-$ norm. More specifically, for $p=\infty$ we see immediately from (27) that

[TABLE]

Furthermore, for $1\leq p<\infty$ it follows from Young’s Inequality applied to (27) that

[TABLE]

Remark 6

Most pole placement adaptive controllers are proven to yield a weak form of stability, such as boundedness (in the presence of a non-zero disturbance) or asymptotic stability (in the case of a zero disturbance), which means that details surrounding initial conditions can be ignored. Here the goal is to prove a stronger, linear-like, convolution bound, so it requires more detailed analysis.

Remark 7

With $\hat{G}(t,z^{-1})=\sum_{i=1}^{2n}\hat{g}_{i}(t)z^{-i}:=\hat{B}(t,z^{-1})\hat{P}(t,z^{-1})$ it is possible to use arguments like those in [5] to prove, when the disturbance $d$ is identically zero, a weak tracking result of the form

[TABLE]

Since the main goal of the paper is on stability issues, we omit the proof. However, we do discuss step tracking in a later section.

Proof: Fix $\delta\in(0,\infty]$ and $\lambda\in(\underline{\lambda},1)$ . Let $t_{0}\in{\bf Z}$ , $\theta_{0}\in{\cal S}$ , $\theta^{*}\in{\cal S}$ , $\phi_{0}\in{\bf R}^{2n}$ , and $y^{*},d\in{l_{\infty}}$ be arbitrary. Define $r$ via (25). Now choose $\lambda_{1}\in(\underline{\lambda},\lambda)$ .

We have to be careful in how to apply Proposition 2 to (26) - we need the $\Delta(t)$ term to be something which we can bound using Proposition 1. So define

[TABLE]

it is easy to check that

[TABLE]

and that

[TABLE]

which is a term which plays a key role in Proposition 1. We can now rewrite (26) as

[TABLE]

If ${\rho_{\delta}(\phi(t),e(t+1))}=1$ then $\eta(t)=0$ , but if ${\rho_{\delta}(\phi(t),e(t+1))}=0$ then

[TABLE]

but we also know that

[TABLE]

combining these equations we have

[TABLE]

which implies that $\|\phi(t)\|\leq\frac{1}{\delta}|d(t)|$ ; it is easy to check that this holds even when $\delta=\infty$ . Using (30) we conclude that

[TABLE]

We now analyse (29). We let $\Phi(t,\tau)$ denote the transition matrix associated with $\bar{A}(t)+\Delta(t)$ ; this matrix clearly implicitly depends on $\theta_{0}$ , $\theta^{*}$ , $d$ and $r$ . From Lemma 1 there exists a constant $\gamma_{1}$ so that

[TABLE]

and for every $t>k\geq t_{0}$ , we have

[TABLE]

Using the definition of $\Delta$ given in (28) and the Cauchy-Schwarz inequality we also have

[TABLE]

At this point we consider two cases: the easier case in which there is no noise, and the harder case in which there is noise.

Case 1: $d(t)=0$ , $t\geq t_{0}$ .

Using the bound on $\eta(t)$ given in (31), in this case (29) becomes

[TABLE]

The bound on $V(t)$ given by Proposition 1 simplifies to

[TABLE]

Since $V(\cdot)\geq 0$ and $V(t_{0})=\|\theta_{0}-\theta^{*}\|^{2}\leq 4\|{\cal S}\|^{2}$ , this means that

[TABLE]

Hence, from (33) and (34) we conclude that

[TABLE]

Now we apply Proposition 2: we set

[TABLE]

Following Remark 3 it is now trivial to choose $N\in{\bf N}$ so that $\frac{\lambda}{\gamma_{1}^{1/N}}-\lambda_{1}>0$ , namely

[TABLE]

which means that

[TABLE]

From Proposition 2 we see that there exists a constant $\gamma_{2}$ so that the state transition matrix $\Phi(t,\tau)$ corresponding to $\bar{A}(t)+\Delta(t)$ satisfies

[TABLE]

If we now apply this to (35), we end up with the desired bound:

[TABLE]

Case 2: $d(t)\neq 0$ for some $t\geq t_{0}$ .

This case is much more involved since noise can radically affect parameter estimation. Indeed, even if the parameter estimate is quite accurate at a point in time, the introduction of a large noise signal (large relative to the size of $\phi(t)$ ) can create a highly inaccurate parameter estimate. To proceed we partition the timeline into two parts: one in which the noise is small versus $\phi$ and one where it is not; the actual choice of the line of division will become clear as the proof progresses. To this end, with ${\varepsilon}>0$ to be chosen shortly, partition $\{j\in{\bf Z}:j\geq t_{0}\}$ into two sets:

[TABLE]

clearly $\{j\in{\bf Z}:\;\;j\geq t_{0}\}=S_{good}\cup S_{bad}$ . Observe that this partition clearly depends on $\theta_{0}$ , $\theta^{*}$ , $\phi_{0}$ , $d$ and $r/y^{*}$ . We will apply Proposition 2 to analyse the closed-loop system behaviour on $S_{good}$ ; on the other hand, we will easily obtain bounds on the system behaviour on $S_{bad}$ . Before doing so, we partition the time index $\{j\in{\bf Z}:j\geq t_{0}\}$ into intervals which oscillate between $S_{good}$ and $S_{bad}$ . To this end, it is easy to see that we can define a (possibly infinite) sequence of intervals of the form $[k_{i},k_{i+1})$ satisfing:

(i) $k_{1}=t_{0}$ , and

(ii) $[k_{i},k_{i+1})$ either belongs to $S_{good}$ or $S_{bad}$ , and

(iii) if $k_{i+1}\neq\infty$ and $[k_{i},k_{i+1})$ belongs to $S_{good}$ (respectively, $S_{bad}$ ), then the interval $[k_{i+1},k_{i+2})$ must belong to $S_{bad}$ (respectively, $S_{good}$ ).

Now we turn to analysing the behaviour during each interval.

Sub-Case 2.1: $[k_{i},k_{i+1})$ lies in $S_{bad}$ .

Let $j\in[k_{i},k_{i+1})$ be arbitrary. In this case either $\phi(j)=0$ or $\frac{[d(j)]^{2}}{\|\phi(j)\|^{2}}\geq{\varepsilon}$ holds. In either case we have

[TABLE]

From (26) and (30) we see that

[TABLE]

If we combine this with (37) we conclude that

[TABLE]

Sub-Case 2.2: $[k_{i},k_{i+1})$ lies in $S_{good}$ .

Let $j\in[k_{i},k_{i+1})$ be arbitrary. In this case $\phi(j)\neq 0$ and

[TABLE]

which implies that

[TABLE]

From Proposition 1 we have that

[TABLE]

using (40) and the fact that $0\leq V(\cdot)\leq 4\|{\cal S}\|^{2}$ , we obtain

[TABLE]

Hence, using this in (33) and (34) yields

[TABLE]

as well as

[TABLE]

Now we will apply Proposition 2: we set

[TABLE]

With $N$ chosen as in Case 1 via (36), we have that $\underline{\delta}:=\frac{\lambda}{\gamma_{1}^{1/N}}-\lambda_{1}>0$ ; we need

[TABLE]

which will certainly be the case if we set ${\varepsilon}:=\frac{\underline{\delta}^{2}}{8\gamma_{1}^{2}(\gamma_{1}N+1)^{2}}$ . From Proposition 2 we see that there exists a constant $\gamma_{4}$ so that the state transition matrix $\Phi(t,\tau)$ corresponding to $\bar{A}(t)+\Delta(t)$ satisfies

[TABLE]

If we now apply this to (29) and use (31) to provide a bound on $\eta(t)$ , we end up with

[TABLE]

This completes Sub-Case 2.2.

Now we combine Sub-Case 2.1 and Sub-Case 2.2 into a general bound on $\phi(t)$ . Define

[TABLE]

It remains to prove

Claim: The following bound holds:

[TABLE]

Proof of the Claim:

If $[k_{1},k_{2})=[t_{0},k_{2})\subset S_{good}$ , then (42) holds for $k\in[t_{0},k_{2}]$ by (41). If $[t_{0},k_{2})\subset S_{bad}$ , then from (39) we obtain

[TABLE]

which means that (42) holds for $k\in[t_{0},k_{2}]$ for this case as well.

We now use induction - suppose that (42) holds for $k\in[k_{1},k_{i}]$ ; we need to prove that it holds for $k\in(k_{i},k_{i+1}]$ as well. If $[k_{i},k_{i+1})\subset S_{bad}$ then from (39) we have

[TABLE]

which means that (42) holds for $k\in(k_{i},k_{i+1}]$ . On the other hand, if $[k_{i},k_{i+1})\subset S_{good}$ , then $k_{i}-1\in S_{bad}$ ; from (39) we have that

[TABLE]

Using (41) to analyse the behaviour on $[k_{i},k_{i+1}]$ , we have

[TABLE]

as desired. $\Box$

This completes the proof.

$\Box$

V Tolerance to Time-Variations

The linear-like bound proven in Theorem 1 can be leveraged to prove that the same behaviour will result even in the presence of slow time-variations with occasional jumps. So suppose that the actual plant model is

[TABLE]

with $\theta^{*}(t)\in{\cal S}$ for all $t\in{\bf R}$ . We adopt a common model of acceptable time-variations used in adaptive control: with $c_{0}\geq 0$ and ${\varepsilon}>0$ , we let $s({\cal S},c_{0},{\varepsilon})$ denote the subset of ${l_{\infty}}({\bf R}^{2n})$ whose elements $\theta^{*}$ satisfy $\theta^{*}(t)\in{\cal S}$ for every $t\in{\bf Z}$ as well as

[TABLE]

for every $t_{1}\in{\bf Z}$ . We will now show that, for every $c_{0}\geq 0$ , the approach tolerates time-varying parameters in $s({\cal S},c_{0},{\varepsilon})$ if ${\varepsilon}$ is small enough.

Theorem 2

For every $\delta\in(0,\infty]$ , $\lambda_{1}\in(\underline{\lambda},1)$ and $c_{0}\geq 0$ , there exists a $c_{1}>0$ and ${\varepsilon}>0$ so that for every $t_{0}\in{\bf Z}$ , $\theta_{0}\in{\cal S}$ , $\theta^{*}\in s({\cal S},c_{0},{\varepsilon})$ , $\phi_{0}\in{\bf R}^{2n}$ , and $y^{*},d\in\ell_{\infty}$ , when the adaptive controller (20), (21) and (24) is applied to the time-varying plant (43), the following holds:

$\|\phi(k)\|\leq c_{1}\lambda_{1}^{k-t_{0}}\|\phi_{0}\|+\sum_{j=t_{0}}^{k-1}c_{1}\lambda_{1}^{k-1-j}{(|r(j)|+|d(j)|)},\;\;$

$\;\;\;\;\;\;\;\;k\geq t_{0}.$

Proof:

Fix $\delta\in(0,\infty]$ , $\lambda_{1}\in(\underline{\lambda},1)$ , $\lambda\in(\underline{\lambda},\lambda_{1})$ and $c_{0}>0$ . Let $t_{0}\in{\bf Z}$ , $\theta_{0}\in{\cal S}$ , $\phi_{0}\in{\bf R}^{2n}$ , and $y^{*},d\in\ell_{\infty}$ be arbitrary. With $m\in{\bf N}$ , we will consider $\phi(t)$ on intervals of the form $[t_{0}+im,t_{0}+(i+1)m]$ ; we will be analysing these intervals in groups of $m$ (to be chosen shortly); we set ${\varepsilon}=\frac{c_{0}}{m^{2}}$ , and let $\theta^{*}\in s({\cal S},c_{0},{\varepsilon})$ be arbitrary.

First of all, for $i\in{\bf Z}^{+}$ we can rewrite the plant equation as

[TABLE]

Theorem 1 applied to (45) says that there exists a constant $c>0$ so that

[TABLE]

The above is a difference inequality associated with a first order system; using this observation together with the fact that $c\geq 1$ , we see that if we define

[TABLE]

with $\psi(t_{0}+im)=\|\phi(t_{0}+im)\|$ , then

[TABLE]

Now we analyse this equation for $i=0,1,...,m-1$ .

Case 1: $|\tilde{n}(t)|\leq\frac{1}{2c}(\lambda_{1}-\lambda)\|\phi(t)\|$ for all $t\in[t_{0}+im,t_{0}+(i+1)m]$ .

In this case

[TABLE]

which means that

[TABLE]

This, in turn, implies that

[TABLE]

Case 2: $|\tilde{n}(t)|>\frac{1}{2c}(\lambda_{1}-\lambda)\|\phi(t)\|$ for some $t\in[t_{0}+im,t_{0}+(i+1)m]$ .

Since $\theta^{*}(t)\in{\cal S}$ for $t\geq t_{0}$ , we see

[TABLE]

This means that

[TABLE]

which means that

[TABLE]

This, in turn, implies that

[TABLE]

On the interval $[t_{0},t_{0}+m^{2}]$ there are $m$ sub-intervals of length $m$ ; furthermore, because of the choice of ${\varepsilon}$ we have that

[TABLE]

A simple calculation reveals that there are at most $N_{1}:=\frac{4c_{0}c}{\lambda_{1}-\lambda}$ sub-intervals which fall into the categorory of Case 2, with the remaining number falling into the category of Case 1. Henceforth we assume that $m>N_{1}$ . If we use (46) and (47) to analyse the behaviour of the closed-loop system on the interval $[t_{0},t_{0}+m^{2}]$ , we end up with a crude bound of

[TABLE]

At this point we would like to choose $m$ so that

[TABLE]

notice that $\frac{2\lambda_{1}}{\lambda_{1}+\lambda}>1$ , so if we take the log of both sides, we see that we need

[TABLE]

which will clearly be the case for large enough $m$ , so at this point we choose such an $m$ . It follows from (48) that there exists a constant $\gamma_{2}$ so that

[TABLE]

Indeed, by time-invariance of the closed-loop system we see that

[TABLE]

Solving iteratively yields

[TABLE]

We now combine this bound with the bounds which hold on the good intervals (46) and the bad intervals (47), and conclude that there exists a constant $\gamma_{3}$ so that

[TABLE]

as desired. $\Box$

VI Tolerance to Unmodelled Dynamics

Due to the linear-like bounds proven in Theorems 1 and 2, we can use the Small Gain Theorem to good effect to prove the tolerance of the closed-loop system to unmodelled dynamics. However, since the controller, and therefore the closed-loop system, is nonlinear, handling initial conditions is more subtle: in the linear-time invariant case we can separate out the effect of initial conditions from that of the forcing functions ( $r$ and $d$ ), but in our situation they are inter-twined. We proceed by looking at two cases - with and without initial conditions. In all of the cases we consider the time-varying plant (43) with $d_{\Delta}(t)$ added to represent the effect of unmodelled dynamics:

[TABLE]

To proceed, fix $\delta\in(0,\infty]$ , $\lambda_{1}\in(\underline{\lambda},1)$ and $c_{0}\geq 0$ ; from Theorem 2 there exists a $c_{1}>0$ and ${\varepsilon}>0$ so that for every $t_{0}\in{\bf Z}$ , $\phi_{0}\in{\bf R}^{2n}$ , $\theta_{0}\in{\cal S}$ , $y^{*},d\in\ell_{\infty}$ , and $\theta^{*}\in s({\cal S},c_{0},{\varepsilon})$ , when the adaptive controller (20), (21) and (24) is applied to the time-varying plant (50), the following bound holds:

[TABLE]

VI-A Zero Initial Conditions

In this case we assume that $\phi(t)=0$ for $t\leq t_{0}$ ; we derive a bound on the closed-loop system behavour in the presence of unmodelled dynamics. Suppose that the unmodelled dynamics is of the form $d_{\Delta}(t)=(\Delta\phi)(t)$ with $\Delta:{l_{\infty}}({\bf R}^{2n})\rightarrow{l_{\infty}}({\bf R}^{2n})$ a (possibly nonlinear time-varying) causal map with a finite gain of $\|\Delta\|$ . It is easy to prove that if $\|\Delta\|<\frac{1-\lambda_{1}}{c_{1}}$ , then

[TABLE]

i.e., a form of closed-loop stability is attained. Following the approach of Remark 5, we could also analyse the closed-loop system using $l_{p}$ -norms with $1\leq p<\infty$ .

VI-B Non-Zero Initial Conditions

Now we allow unmodelled LTI dynamics with non-zero initial conditions, and we develop convolution-like bounds on the closed-loop system. To this end suppose that the unmodelled dynamics are of the form

[TABLE]

with $\Delta_{j}\in{\bf R}^{1\times 2n}$ ; the corresponding transfer function is $\Delta(z^{-1}):=\sum_{j=0}^{\infty}\Delta_{j}z^{-j}$ . It is easy to see that this model subsumes the classical additive uncertainty, multiplicative uncertainty, and uncertainty in a coprime factorization, which is common in the robust control literature, e.g. see [37], with the only constraint being that the perturbations correspond to strictly causal terms. In order to obtain linear-like bounds on the closed-loop behaviour, we need to impose more constraints on $\Delta(z)$ than in the previous sub-section: after all, if $\Delta(z^{-1})=\Delta_{p}z^{-p}$ , it is clear that $\|\Delta\|=\|\Delta_{p}\|$ for all $p$ , but the effect on the closed-loop system varies greatly - a large value of $p$ allows the behaviour in the far past to affect the present. To this end, with $\mu>0$ and $\beta\in(0,1)$ , we shall restrict $\Delta(z^{-1})$ to a set of the form

[TABLE]

It is easy to see that every transfer function in ${\cal B}(\mu,\beta)$ is analytic in $\{z\in{\bf C}:|z|>\beta\}$ , so it has no poles in that region.

Now we fix $\mu>0$ and $\beta\in(0,1)$ and let $\Delta(z^{-1})$ belong to ${\cal B}(\mu,\beta)$ ; the goal is to analyse the closed-loop behaviour of (50) for $t\geq t_{0}$ when $d_{\Delta}$ is given by (52). We first partition $d_{\Delta}(t)$ into two parts - that which depends on $\phi(t)$ for $t\geq t_{0}$ and that which depends on $\phi(t)$ for $t<t_{0}$ :

[TABLE]

It is clear that

[TABLE]

If $\phi(t)$ is bounded on $\{t\in{\bf Z}:t<t_{0}\}$ then $\sum_{j=1}^{\infty}\beta^{j}\|\phi(t_{0}-j)\|$ is finite, in which case we see that $d_{\Delta}^{-}(t)$ goes to zero exponentially fast; henceforth, we make the reasonable assumption that this is the case. It turns out that we can easily bound $d_{\Delta}(t)$ with a difference equation. To this end, consider

[TABLE]

with ${m}(t_{0})=m_{0}:=\sum_{j=1}^{\infty}\beta^{j}\|\phi(t_{0}-j)\|$ ; it is straight-forward to prove that

[TABLE]

This model of unmodelled dynamics is similar to that used in the adaptive control literature, e.g. see [11].

Theorem 3

*For every $\beta\in(0,1)$ and $\lambda_{2}\in(\max\{\lambda_{1},\beta\},1)$ , there exist $\bar{\mu}>0$ and $c_{2}>0$ so that for every $t_{0}\in{\bf Z}$ , $\phi_{0}\in{\bf R}^{2n}$ , $m_{0}\in{\bf R}$ , $\theta_{0}\in{\cal S}$ , $y^{*},d\in{l_{\infty}}$ , $\theta^{*}\in s({\cal S},c_{0},{\varepsilon})$ and $\mu\in(0,\bar{\mu})$ , when the adaptive controller (20), (21) and (24) is applied to the time-varying plant (50) with $d_{\Delta}$ satisfying (53) and (54), the following bound holds: *

$\|\phi(k)\|\leq c_{2}\lambda_{2}^{k-t_{0}}(\|\phi_{0}\|+|m_{0}|)+$

$\;\;\;\sum_{j=t_{0}}^{k-1}c_{2}\lambda_{2}^{k-1-j}(|d(j)|+|r(j)|),\;\;k\geq t_{0}.$

Proof:

Fix $\beta\in(0,1)$ and $\lambda_{2}\in(\max\{\lambda_{1},\beta\},1)$ . The first step is to convert difference inequalities to difference equations. To this end, consider the difference equation

[TABLE]

together with the difference equation based on (53):

[TABLE]

It is easy to use induction together with (51), (53), and (54) to prove that

[TABLE]

If we combine the difference equations (55) with (56), we end up with

[TABLE]

Now we see that $A_{cl}(\mu)\rightarrow\left[\begin{array}[]{cc}\lambda_{1}&0\\ \beta&\beta\end{array}\right]$ as $\mu\rightarrow 0$ , and this matrix has eigenvalues of $\{\lambda_{1},\beta\}$ . Now choose $\bar{\mu}>0$ so that all eigenvalues are less than $(\frac{\lambda_{2}}{2}+\frac{1}{2}\max\{\lambda_{1},\beta\})$ in magnitude for $\mu\in(0,\bar{\mu}]$ , and define ${\varepsilon}:=\frac{\lambda_{2}}{2}-\frac{1}{2}\max\{\lambda_{1},\beta\}$ . Using the proof technique of Desoer in [1], we can conclude that for $\mu\in(0,\bar{\mu}]$ , we have

[TABLE]

if we use this in (58) and then apply the bounds in (57), it follows that

[TABLE]

as desired. $\Box$

VII Step Tracking

If the plant is non-minimum phase, it is not possible track an arbitrary bounded reference signal using a bounded control signal. However, as long as the plant does not have a zero at $z=1$ , it is possible to modify the controller design procedure to achieve asymptotic step tracking if there is no noise/disturbance. So at this point assume that the corresponding plant polynomial $B(z^{-1})$ has no zero at $z=1$ for any plant model $\theta^{*}\in{\cal S}$ . To proceed, we use the standard trick from the literature, e.g. see [5]: we still estimate $A(z^{-1})$ and $B(z^{-1})$ as before, but we now design the control law slightly differently. To this end, we first define

[TABLE]

and then let $A^{*}(z^{-1})$ be a $2(n+1)^{th}$ monic polynomial (rather than a $2n^{th}$ one) of the form

[TABLE]

so that $z^{2(n+1)}A^{*}(z^{-1})$ has all of its zeros in ${{\bf D}}^{o}$ . Next, we choose two polynomial

[TABLE]

and

[TABLE]

which satisfy the equation

[TABLE]

since $\tilde{A}(t,z^{-1})$ and $\hat{B}(t,z^{-1})$ are coprime, there exist unique $\tilde{L}(t,z^{-1})$ and $\hat{P}(t,z^{-1})$ which satisfy this equation. We now define

[TABLE]

at time $t$ we choose $u(t)$ so that

[TABLE]

We can use a modified version of the argument used in the proof of Theorem 1 to conclude that a similar type of result holds here; we can also prove that asymptotic step tracking will be attained if the noise is zero and the reference signal $y^{*}$ is constant. The details are omitted due to space considerations.

VIII A Simulation Example

Here we provide an example to illustrate the benefit of the proposed adaptive controller. To this end, consider the second order plant

[TABLE]

with $a_{1}(t)\in[0,2]$ , $a_{2}(t)\in[1,3]$ , $b_{1}(t)\in[0,1]$ , and $b_{2}(t)\in[-5,-2]$ . So every admissible model is unstable and non-minimum phase, which makes this a challenging plant to control. We set $\delta=\infty$ .

VIII-A Stability

In this sub-section we consider the problem of stability only - we set $y^{*}=0$ . First we compare the ideal algorithm (17)-(18) (with projection onto ${\cal S}$ ) with the classical one (16) (suitably modified to have projection onto ${\cal S}$ ); in both cases we couple the estimator with the adaptive pole placement controller (24) where we place all closed-loop poles at zero. In the case of the classical estimator (16) we arbitrarily set $\alpha=\beta=1$ . Suppose that the actual value of $(a_{1},a_{2},b_{1},b_{2})$ is $(2,3,1,-2)$ and the initial estimate is set to the midpoint of the interval. In the first simulation we set $y(0)=y(-1)=0.01$ and $u(-1)=0$ and set the noise $d(t)$ to zero - see the top plot of Figure 1. In the second simulation we set $y(0)=y(-1)=u(-1)=0$ and the noise to $d(t)=0.01*\sin(5t)$ - see the bottom plot of Figure 1. In both cases the controller based on the ideal algorithm (17)-(18) is clearly superior to the one based on the revised classical algorithm (16).

Now we further examine the case of the proposed controller when it is applied to the time-varying plant with unmodelled dynamics, a zero initial condition, and a non-zero noise. More specifically, we set

[TABLE]

For the unmodelled part of the plant we use a term of the form discussed in Section VI.B:

[TABLE]

We plot the result in Figure 2; we see that the parameter estimator approximately follows the system parameters, and the effect of the noise is small on average, even in the presence of unmodelled dynamics.

VIII-B Step Tracking

The plant in the previous sub-section has a large amount of uncertainty, as well as a wide range of unstable poles and non-minimum phase zeros, which means that there are limits on the quality of the transient behaviour even if the parameters were fixed and known. Hence, to illustrate the tracking ability we look at a sub-class of systems: one with $a_{1}$ and $b_{1}$ as before, namely $a_{1}(t)\in[0,2]$ and $b_{1}(t)\in[0,1]$ , but now with $a_{2}=1$ and $b_{2}=-3.5$ . With fixed parameters the corresponding system is still unstable and non-minimum phase.

We simulate the closed-loop pole placement step tracking controller of Section VII with a zero initial condition, initial parameter estimates at the midpoints of the admissible intervals, and with time-varying parameters:

[TABLE]

with a non-zero disturbance:

[TABLE]

and a square wave reference signal of $y^{*}(t)=\mbox{sgn}[sin(0.01t)]$ . We plot the result in Figure 3; we see that the parameter estimates crudely follows the system parameters, with less accuracy than in the previous sub-section, partly due to the fact that the constant setpoint dominates the estimation process and leads to higher inaccuracy. As a result, $y(t)$ does a good job of following $y^{*}$ on average, but with the occasional flurry of activity when the parameter estimates are highly inaccurate. When the noise is increased five-fold at $k=2500$ , the behaviour degrades only slightly.

IX Summary and Conclusions

Here we show that if the original, ideal, projection algorithm is used in the estimation process (subject to the assumption that the plant parameters lie in a convex, compact set), then the corresponding pole placement adaptive controller guarantees linear-like convolution bounds on the closed loop behaviour, which confers exponential stability and a bounded noise gain (in every $p$ -norm with $1\leq p\leq\infty$ ), unlike almost all other parameter adaptive controllers. This can be leveraged to prove tolerance to unmodelled dynamics and plant parameter variation. We emphasize that there is no persistent excitation requirement of any sort; the improved performance arises from the vigilant nature of the the ideal parameter estimation algorithm.

As far as the author is aware, the linear-like convolution bound proven here is a first in parameter adaptive control. It allows a modular approach to be used in analysing time-varying parameters and unmodelled dynamics. This approach avoids all of the fixes invented in the 1980s, such as signal normalization and deadzones, used to deal with the lack of robustness to unmodelled dynamics and time-varying parameters.

We are presently working on extending the approach to the model reference adaptive control setup. It will be interesting to see if the convexity assumption can be removed by using multi-estimators, i.e. cover the the set of admissible parameters by a finite number of convex sets, and then use an estimator for each such set. Extending the approach to the continuous-time setting may prove challenging, since a direct application would yield a non-Lipschitz continuous estimator, which brings with it mathematical solveability issues.

X Appendix

Proof of Proposition 1:

Since projection does not make the parameter estimate worse, it follows from (20) that

[TABLE]

so the first inequality holds.

We now turn to energy analysis. We first define $\tilde{\check{\theta}}(t):=\check{\theta}(t)-\theta^{*}$ and $\check{V}(t):={\tilde{\check{\theta}}}(T)^{T}{\tilde{\check{\theta}}}(t)$ . Next, we subtract $\theta^{*}$ from each side of (20), yielding

[TABLE]

Then

[TABLE]

Now let us analyse the three terms on the RHS: the fact that $W_{1}(t)^{2}=W_{1}(t)$ allows us to simplify the first term; the fact that $W_{1}(t)W_{2}(t)=W_{2}(t)$ means that the second term is zero; $W_{2}(t)^{T}W_{2}(t)={\rho_{\delta}(\phi(t),e(t+1))}\frac{1}{\phi(t)^{T}\phi(t)}$ , which simplifies the third term. We end up with

[TABLE]

Since projection never makes the estimate worse, it follow that

[TABLE]

$\Box$

Proof of Lemma 1: Fix $\delta\in(0,\infty]$ and $\sigma\in(\underline{\lambda},1)$ . First of all, it is well known that the characteristic polynomial of $\bar{A}(t)$ is exactly $z^{2n}A^{*}(z^{-1})$ for every $t\in{\bf Z}$ . Furthermore, it is well known that the coefficients of $\hat{L}(t,z^{-1})$ and $\hat{P}(t,z^{-1})$ are the solution of a linear equation, and are analytic functions of $\hat{\theta}(t)\in{\cal S}$ . Hence, there exists a constant $\gamma_{1}$ so that, for every set of initial conditions, $y^{*}\in{l_{\infty}}$ and $d\in{l_{\infty}}$ , we have $\sup_{t\geq t_{0}}\|\bar{A}(t)\|\leq\gamma_{1}$ .

To prove the first bound we now invoke the argument used in [1], who considered a more general time-varying situation but with more restrictions on $\sigma$ . By making a slight adjustment to the first part of the proof given there, we can prove that with $\gamma_{2}:=\sigma\frac{(\sigma+\gamma_{1})^{2n-1}}{(\sigma-\underline{\lambda})^{2n}}$ , then for every $t\geq t_{0}$ we have $\|\bar{A}(t)^{k}\|\leq\gamma_{2}\sigma^{k},\;\;k\geq 0$ , as desired.

Now we turn to the second bound. From Proposition 1 and the Cauchy-Schwarz inequality we obtain

[TABLE]

Now notice that

[TABLE]

The fact that the coefficients of $\hat{L}(t,z^{-1})$ and $\hat{P}(t,z^{-1})$ are analytic functions of $\hat{\theta}(t)\in{\cal S}$ means that there exists a constant $\gamma_{3}\geq 1$ so that

[TABLE]

so we conclude that the second bound holds as well. $\Box$ .

Bibliography37

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] C.A. Desoer, “Slowly Varying Discrete Time System x t + 1 = A t x t subscript 𝑥 𝑡 1 subscript 𝐴 𝑡 subscript 𝑥 𝑡 x_{t+1}=A_{t}x_{t} ”, Electronic Letters , vol. 6, no. 11, pp. 339 - 340, May 1970.
2[2] A. Feuer and A.S. Morse, “Adaptive Control of Single-input, Single-output Linear Systems”, IEEE Transactions on Automatic Control , vol. 23, No. 4, pp. 557-569, 1978.
3[3] M. Fu and B.R. Barmish, “Adaptive Stabilization of Linear Systems Via Switching Control”, IEEE Transactions on Automatic Control , vol. AC-31, pp. 1097-1103, Dec. 1986.
4[4] G.C. Goodwin, P.J. Ramadge, and P.E. Caines, “Discrete Time Multivariable Control”, IEEE Transactions on Automatic Control , vol. AC–25, pp. 449–456, 1980.
5[5] G.C. Goodwin and K.S. Sin, “Adaptive Filtering Prediction and Control, Prentice Hall, Englewood Cliffs, New Jersey, USA, 1984.
6[6] J. P. Hespanha, D. Liberzon, and A. S. Morse, “Hysteresis-based switching algorithms for supervisory control of uncertain systems”, Automatica , vol. 39, pp. 263-272, Feb 2003.
7[7] J. P. Hespanha, D. Liberzon, and A. S. Morse, “Overcoming the limitations of adaptive control by means of logic-based switching”, Systems and Control Letters , vol. 49, no. 1, pp. 49-65, May 2003.
8[8] P.A. Ioannou and K.S. Tsakalis, “A Robust Direct Adaptive Controller”, IEEE Transactions on Automatic Control , vol. AC-31 , no. 11, pp. 1033 – 1043, 1986.