New Results on Parameter Estimation via Dynamic Regressor Extension and Mixing: Continuous and Discrete-time Cases
Romeo Ortega, Stanislav Aranovskiy, Anton A. Pyrkin, Alessandro, Astolfi, Alexey A. Bobtsov

TL;DR
This paper advances parameter estimation techniques in linear regression models by unifying continuous and discrete-time approaches, introducing new regressor matrices, and ensuring finite-time convergence with improved transient performance.
Contribution
It provides a unified framework for continuous and discrete-time estimators, introduces two novel regressor matrices, and offers an estimator for finite-time parameter convergence.
Findings
Unified treatment of continuous and discrete-time cases
Introduction of two new extended regressor matrices
Finite-time parameter estimation with tracking of time-varying parameters
Abstract
We present some new results on the dynamic regressor extension and mixing parameter estimators for linear regression models recently proposed in the literature. This technique has proven instrumental in the solution of several open problems in system identification and adaptive control. The new results include: (i) a unified treatment of the continuous and the discrete-time cases; (ii) the proposal of two new extended regressor matrices, one which guarantees a quantifiable transient performance improvement, and the other exponential convergence under conditions that are strictly weaker than regressor persistence of excitation; and (iii) an alternative estimator ensuring parameter estimation in finite-time that retains its alertness to track time-varying parameters. Simulations that illustrate our results are also presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
New Results on Parameter Estimation via Dynamic Regressor Extension and Mixing: Continuous and Discrete-time Cases
Romeo Ortega, Fellow, IEEE, Stanislav Aranovskiy, Senior member, IEEE, Anton A. Pyrkin, Member, IEEE, Alessandro Astolfi, Fellow, IEEE, Alexey A. Bobtsov, Senior member, IEEE R. Ortega is with Laboratoire des Signaux et Systèmes, CNRS-SUPELEC, Plateau du Moulon, 91192, Gif-sur-Yvette, France and ITMO University, Kronverkskiy av. 49, St. Petersburg, 197101, Russia.S. Aranovskiy is with CentraleSupélec – IETR, Avenue de la Boulaie, 35576 Cesson-Sévigné, France A. Pyrkin is with Hangzhou Dianzi University, Hangzhou, 310018, China.S. Aranovskiy, A. Pyrkin and A. Bobtsov are with the Faculty of Control Systems and Robotics, ITMO University, Kronverkskiy av. 49, St. Petersburg, 197101, Russia.A. Astolfi is with the Department of Electrical and Electronic Engineering, Imperial College London, London SW7 2AZ, UK and the DICII, Universita di Roma “Tor Vergata”, Via del Politecnico 1, 00133 Roma, ItalyA. Pyrkin is a corresponding author. E-mail: [email protected]
Abstract
We present some new results on the dynamic regressor extension and mixing parameter estimators for linear regression models recently proposed in the literature. This technique has proven instrumental in the solution of several open problems in system identification and adaptive control. The new results include: (i) a unified treatment of the continuous and the discrete-time cases; (ii) the proposal of two new extended regressor matrices, one which guarantees a quantifiable transient performance improvement, and the other exponential convergence under conditions that are strictly weaker than regressor persistence of excitation; and (iii) an alternative estimator ensuring parameter estimation in finite-time that retains its alertness to track time-varying parameters. Simulations that illustrate our results are also presented.
I Introduction
Estimation of the parameters that describe an underlying physical setting is one of the central problems in control and systems theory that has attracted the attention of many researchers for several years. A typical scenario, which appears in system identification and adaptive control [9, 10, 16, 17, 21], is when the unknown parameters and the measured data are linearly related in a so-called linear regression equation (LRE). Classical solutions for this problem are gradient and least-squares (LS) estimators. The main drawback of these schemes is that convergence of the parameter estimates relies on the availability of signal excitation, a feature that is codified in the restrictive assumption of persistency of excitation (PE) of the regressor vector. Moreover, their transient performance is highly unpredictable and only a weak monotonicity property of the estimation errors can be guaranteed.
To overcome these two problems a new parameter estimation procedure, called dynamic regressor extension and mixing (DREM), has recently been proposed in [2] for continuous-time (CT) and in [5] for discrete-time (DT) systems. The construction of DREM estimators proceeds in two steps, first, the inclusion of a free linear operator that creates an extended, matrix LRE. Second, a nonlinear manipulation of the data that allow generating, out of an -dimensional LRE, scalar, and independent, LRE. DREM estimators have been successfully applied in a variety of identification and adaptive control problems. Interestingly, it has been shown in [18] that DREM can be reformulated as a functional Luenberger observer.
DREM estimators outperform classical gradient or LS estimators in the following precise aspects: independently of the excitation conditions, DREM guarantees monotonicity of each element of the parameter error vector that is much stronger than monotonicity of the vector norm, which is ensured with classical estimators. Moreover, parameter convergence in DREM is established without the PE condition. Instead of PE a non-square integrability condition on the determinant of a designer-dependent extended regressor matrix is imposed. A final interesting property of DREM that has been established in [8] is that it can be used to generate estimates with finite-time convergence (FTC), under interval excitation assumption.
The following new results on DREM are presented here:
(i) The unified treatment of the CT and the DT cases.
(ii) The definition of new linear operators that:
ensure parameter error convergence under excitation conditions that are strictly weaker than regressor PE;
guarantee a transient performance improvement;
show that DREM contains, as a particular case, the extended LRE proposed in [11], which is used also in the adaptive controllers recently proposed in [6, 7, 20].
(iii) An alternative estimator, ensuring FTC, that retains its alertness to track time-varying parameters.
The remainder of the paper is organized as follows. To set up the notation a brief description of gradient and DREM estimators is given in Section II. In Section III we present the new version of DREM that ensures convergence under excitation conditions that are strictly weaker than regressor PE. In Section LABEL:sec4 a general form of the free operator used in DREM is proposed to, on one hand, re-derive the extended regressor of [11] and, on the other hand, prove that transient performance is—quantifiably—improved. Section LABEL:sec5 presents a new DREM-based estimator with FTC. Simulation results are presented in Section LABEL:sec6. The paper is wrapped-up with future research in Section VII.
Notation. is the identity matrix. , , and denote the positive and non-negative real and integer numbers, respectively. For , we denote . Continuous-time (CT) signals are denoted , while for discrete-time (DT) sequences we use , with the sampling time. The action of an operator on a CT signal is denoted , while for an operator and a sequence we use . When a formula is applicable to CT signals and DT sequences the time argument is omitted.
II Background Material
We deal with the problem of on-line estimation of the unknown, constant parameters appearing in a LRE of the form
[TABLE]
where and are measurable CT or DT signals and is a (generic) exponentially decaying signal.111This signal may be stemming from the effect of the initial conditions of various filters used to generate the LRE. It is well-known that the availability of a LRE of the form (1) is instrumental for the development of most system identifiers and adaptive controllers [21]. Following standard practice, throughout the paper, the term is omitted.
II-A Gradient estimator and the PE condition
In this subsection we recall the well-known gradient estimator, derive its parameter error equation (PEE) and recall its stability properties. Although this material is very well-known, it is included to make the document self-contained and set up the notation. First, we introduce the following.
Definition 1**.**
A bounded signal is PE (denoted ) if there exist such that
[TABLE]
for some in CT or
[TABLE]
for some , with , in DT.
The following proposition is a milestone for systems theory and may be found in all identification and adaptive control textbooks, e.g., [21].
Proposition 1**.**
Consider the LRE (1).
(CT) The CT gradient-descent estimator
[TABLE]
with ensures the following.
The norm of the parameter error vector is monotonically non-increasing, that is,
[TABLE]
The CT PEE is given by
[TABLE]
and its zero equilibrium is globally exponentially stable (GES) if and only if . Moreover, there exist an optimal value of for which the rate of convergence is maximum.
(DT) The DT gradient-descent estimator
[TABLE]
ensures the following.
The norm of the parameter error vector verifies
[TABLE]
The DT PEE is given by
[TABLE]
and its zero equilibrium is GES if and only if .
In most applications, PE is an extremely restrictive condition, hence the interest of relaxing it. See [19] for a recent review of new estimators relaxing the PE condition, which include the ones reported in [6, 7, 20].
II-B Generation of scalar LRE via DREM
To overcome the limitation imposed by the PE condition and improve the transient performance of the estimator the DREM procedure, introduced in [2, 5], generates new, one–dimensional, LRE to independently estimate each of the parameters. The first step in DREM is to introduce a linear, single-input -output, bounded-input bounded-output (BIBO)–stable operator and define the vector and the matrix as
[TABLE]
Clearly, because of linearity and BIBO stability, these signals satisfy
[TABLE]
At this point the key step of regressor “mixing” of the DREM procedure is used to obtain a set of scalar equations as follows. First, recall that, for any (possibly singular) matrix we have [12] , where is the adjunct (also called “adjugate”) matrix. Now, multiplying from the left the vector equation (6) by the adjunct matrix of , we get
[TABLE]
where we have defined the scalar function
[TABLE]
and the vector
[TABLE]
Remark 1**.**
In [13] an extended regressor like (6) has been constructed in CT using linear time-invariant (LTI) filters in the operator used in (5)—see also [11], where this modification is also discussed. Unfortunately, besides some simulation evidence, no quantitative advantage—with respect to the gradient estimation—has been established for it.
II-C Properties of gradient parameter estimators in DREM
The availability of the scalar LREs (7) is the main feature of DREM that distinguishes it with respect to all other estimators. Indeed, as shown in the propostion below—the proof of which may be found in [2, 5]—it allows obtaining significantly stronger results using simple gradient estimators.
Proposition 2**.**
Consider the scalar LREs (7).
(CT) The CT gradient-descent estimators222In the sequel, the quantifier is omitted for brevity.
[TABLE]
with ensures the following.
The CT PEEs are given by
[TABLE]
The individual parameter errors are monotonically non-increasing, that is,
[TABLE]
The following equivalence holds
[TABLE]
and convergence can be made arbitrarily fast increasing .
If , the convergence is exponential.
(DT) The DT gradient-descent estimator
[TABLE]
ensures the following.
The DT PEEs are given by
[TABLE]
The elements of the parameter error vector verify
[TABLE]
The following equivalence holds
[TABLE]
and convergence can be made arbitrarily fast decreasing .
If , the convergence is exponential.
There are three important advantages of DREM over the standard gradient estimator.
P1 As shown in (14) the individual parameter errors are monotonically non-increasing, a property that is strictly stronger than monotonicity of their norm indicated in (3) and (4).
P2 Parameter convergence is established without the restrictive PE assumption—being replaced, instead, by a non square-integrability/summability assumption.
P3 Convergence rates of DREM can be made arbitrarily fast simply increasing in CT (or decreasing it in DT).
Remark 2**.**
Regarding the property P2, in [2] the relationship in CT between the conditions and is thoroughly discussed. In particular, in [2] it has been shown that, for arbitrary regressor vectors , these conditions are unrelated. On the other hand, for the case of identification of LTI systems, it has been shown in [4] that if and only if for almost all LTI operators .
III A DREM Estimator with Strictly Weaker Convergence Conditions
In this section we present a particular version of DREM for which it is possible to show that its convergence conditions are strictly weaker than . Since the construction, and the results, are very similar for CT and DT estimators, for brevity, we consider the latter case only.
Proposition 3**.**
Consider the DT version of the LRE (1). Fix an integer and define (5) using the LTV operator
[TABLE]
Assume and , with the size of the window given in Definition 1. The scalar, gradient-descent DT estimators (12), with and defined in (8) and (9), ensure the following additional properties.
The condition for parameter convergence of DREM, i.e., , is strictly weaker than . More precisely, the following implications hold:
[TABLE]
The condition for exponential parameter convergence of DREM, i.e., , is also weaker than in the following precise sense
[TABLE]
Proof.
To prove the claims we make the key observation that
[TABLE]
The implications (LABEL:phipeimpdelnotl2) and (LABEL:phipeimpdelpe) follow using the identity (17), Definition 1 and noting the obvious fact that if in a window of size , then it is also PE for any window of size .
The proof of (3) is established with the following scalar counterexample: with . Since tends to zero it is not PE, however,
Finally, the proof of (3) is established with the following chain of implications:
Figure 1: Transients of the parameter estimation errors for different estimators and the control input .
Figure 2: Transients of the parameter estimation errors for different estimators and the control input .
VI-B Alertness preserving DREM with FTC of Proposition LABEL:pro6
In this subsection we compare the two FTC DREMs presented in Section LABEL:sec5. Namely, the FTC DREM of Proposition LABEL:pro60, defined by (LABEL:hatthew), (LABEL:wi), and the new FTC DREM of Proposition LABEL:pro6 given by (LABEL:dotwd) and
[TABLE]
which is computed as soon as . The objective of the simulation is to prove that the new FTC DREM is able to track time-varying parameters when new excitation arrives. This is in contrast with the old FTC DREM estimator that, since , converges to the gradient estimator and loses its FTC alertness property.
We consider the simplest case of a scalar system and simulate the gradient estimator (10), that is,
[TABLE]
together with (LABEL:wi) and (LABEL:dotwd), which are computed for , with defined via the interval excitation criteria (LABEL:inttc) and (LABEL:inttcd), respectively.
We consider two scenarios: with and without excitation in . For the first case we consider the PE signal , and for the second one . Note that in the second case , hence it is not PE. However, , hence it satisfies the conditions for convergence of the DREM estimator.
For simulations we set , , and . These parameters have been chosen such that the transients of both FTC estimators coincide in the ideal case when is constant and the system is excited. To illustrate the FTC tracking capabilities of the estimators the unknown parameter is time-varying and given by
[TABLE]
i.e., it starts at , jumps to at , and then linearly returns to .
The transient of the estimators for and are given in Fig. 3, where we plot the gradient estimate , as well as the old and the new FTC estimates and . We observe that, as expected, both FTC estimators are overlapped and converge in finite time, while the gradient converges only asymptotically.
The behavior of the estimators for is shown in Figure 4, where we also plot the time-varying parameter . As predicted by the theory, the old FTC behaves as the gradient estimator and their trajectories coincide. On the other hand, the new estimator preserves FTC alertness after the first parameter jump and achieves fast tracking of the linearly time-varying .
For the non-PE case of , the transients of the estimators are given in Fig. 5. We observe that both FTC estimators, again, essentially coincide in the first few seconds and converge in finite time, while the gradient does it only asymptotically. After the first parameter change at the old FTC and the gradient coincide, while the new FTC manages to track in finite time the parameter jump. However, during the ramp parameter change—because of the lack of excitation—neither one of the estimators can track the parameter variation but the new FTC estimator performs much better.
VII Future Work
Current research is under way to derive some of the new results presented only for the CT time case, to the practically important, DT case. Moreover, in the spirit of [4], we are further exploring the role of the operator on the determinant of the extended regressor matrix and we plan to study the effect of an additive signal in the LRE (1), to study its input-to-state stability properties.
A widely open, long-term research topic is how to deal with nonlinear parameterizations, that is, the case in which (1) is replaced by , where is a nonlinear function. Some preliminary results exploiting convexity, concavity or monotonicity may be found in [1, 14, 15]. As pointed out in [2], DREM is directly applicable—without overparameterization—in the simplest case of separable nonlinearities, that is, when the regression is of the form . The more general case is a challenging open problem.
Acknowledgment
The authors would like to thank Vladimir Nikiforov and Dmitry Gerasimov for many useful discussions that helped us to improve the quality of our contribution.
This paper is partly supported by by Government of Russian Federation (GOSZADANIE 2.8878.2017/8.9, grant 08-08), the European Union’s Horizon 2020 Research and Innovation Programme under Grant 739551 (KIOS CoE).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A.Annaswamy, F. Skantze and A. Loh, Adaptive control of continuous time systems with convex/concave parametrization, Automatica , vol. 34, no. 1, pp. 33-49, 1998.
- 2[2] S. Aranovskiy, A. Bobtsov, R. Ortega and A. Pyrkin, Performance enhancement of parameter estimators via dynamic regressor extension and mixing, IEEE Trans. Automatic Control , vol. 62, no 7, pp.3546-3550, 2017.
- 3[3] A. Astolfi, D. Karagiannis and R. Ortega, Nonlinear and Adaptive Control with Applications , Springer-Verlag, Berlin, Communications and Control Engineering, 2008.
- 4[4] A. Belov, S. Aranovskiy, R. Ortega, N. Barabanov and A. Bobtsov, Enhanced parameter convergence for linear systems identification: The DREM approach, 2018 European Control Conference , Limassol, Cyprus, 12-15/06, 2018. (To appear in Int. J. on Adaptive Control and Signal Processing ).
- 5[5] A. Belov, R. Ortega and A. Bobtsov, Guaranteed performance adaptive identification scheme of discrete-dime systems using dynamic regressor extension and mixing, 18th IFAC Symposium on System Identification, (SYSID 2018) , Stockholm, Sweden, July 9-11, 2018.
- 6[6] N. Cho, H. Shin, Y. Kim and A. Tsourdos, Composite MRAC with parameter convergence under finite excitation, IEEE Trans. Automatic Control , vol. 63, no. 3, pp. 811-818, 2018.
- 7[7] G. Chowdhary, T. Yucelen, M. Mhlegg and E. Johnson, Concurrent learning adaptive control of linear systems with exponentially convergent bounds, Int. J. on Adaptive Control and Signal Processing , vol. 27, no. 4, pp. 280-301, 2013.
- 8[8] D. Gerasimov, R. Ortega and V. Nikiforov, Adaptive control of multivariable systems with reduced knowledge of high frequency gain: Application of dynamic regressor extension and mixing estimators, 18th IFAC Symposium on System Identification, (SYSID 2018) , Stockholm, Sweden, July 9-11, 2018.
