Partial Realization Theory and System Identification Redux
Anders Lindquist

TL;DR
This paper revisits a nonstandard matrix Riccati equation from 20 years ago, deriving it anew within system identification, and demonstrating its universality for solving various analytic interpolation problems in control systems.
Contribution
It provides a new derivation of a nonstandard Riccati equation and shows its broad applicability to general analytic interpolation problems in control theory.
Findings
The Riccati equation is universal for interpolation problems.
The derivation connects realization theory with system identification.
It addresses a question posed by R.E. Kalman in the 1970s.
Abstract
Some twenty years ago we introduced a nonstandard matrix Riccati equation to solve the partial stochastic realization problem. In this paper we provide a new derivation of this equation in the context of system identification. This allows us to show that the nonstandard matrix Riccati equation is universal in the sense that it can be used to solve more general analytic interpolation problems by only changing certain parameters. Such interpolation problems are ubiquitous in systems and control. In this context we also discuss a question posed by R.E. Kalman in beginning of the 1970s.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl Systems and Identification
Partial Realization Theory and System Identification Redux
Anders Lindquist1 1Anders Lindquist is with Department of Automation, Shanghai Jiao Tong University, China, and the Royal Institute of Technology, Stockholm, Sweden [email protected]
Abstract
Some twenty years ago we introduced a nonstandard matrix Riccati equation to solve the partial stochastic realization problem. In this paper we provide a new derivation of this equation in the context of system identification. This allows us to show that the nonstandard matrix Riccati equation is universal in the sense that it can be used to solve more general analytic interpolation problems by only changing certain parameters. Such interpolation problems are ubiquitous in systems and control. In this context we also discuss a question posed by R.E. Kalman in beginning of the 1970s.
I INTRODUCTION
A classical basic problem in system identification is to estimate the (unknown) constant coefficient matrices in a stable linear (SISO) stochastic system
[TABLE]
driven by white noise , from a record of observations
[TABLE]
of the output process , which is stationary in steady state. Modulo possible unobservable and/or unreachable modes, choice of coordinates and placement of zeros, this is equivalent to finding a shaping filter
[TABLE]
with a minimum-phase transfer function
[TABLE]
where
[TABLE]
are Schur polynomials, i.e., polynomials with all its roots in the open unit disc, and is a positive number. Then the rational function
[TABLE]
satisfying
[TABLE]
is positive real (see, e.g., [1]), and a simple calculation shows that
[TABLE]
is a Schur polynomial whose coefficients can be determined from the linear system of equations corresponding to the relation
[TABLE]
In fact, (7) is equivalent to
[TABLE]
where
[TABLE]
is the power spectral density of the stationary output process and
[TABLE]
are the covariance lags [1]. Then,
[TABLE]
is analytic in the complement of the unit disc in the complex plane and maps to the right half plane, estabishing the positive-real property.
Now, if we had an infinite observation record (2), i.e, , then we would have an infinite sequence of covariance lags that could be determined from the ergodic limits
[TABLE]
see, e.g., [1]. Then, identifying coefficients of powers of in as done in [2], we obtain
[TABLE]
for nonnegative powers and
[TABLE]
for negative powers. The coefficient matrix in (14) is a Hankel matrix that we denote . By Kronecker’s theorem [3],
[TABLE]
Hence, , and hence also , can be determined from finite sequence of covariance lags, where .
Therefore it may seem that only a finite covariance record is needed, and this also a basic assumption in the early work on so-called subspace identification [4, 5, 6], where in general the biased ergodic estimates
[TABLE]
were used to insure that the corresponding Toeplitz matrix is positive definite, as required. However, as we pointed out in [7], this is incorrect and may lead to an that is not positive real; also see [1, Chapter 13]. This is due to the possible difference between algebraic degree and positive degree (Section II-B). The classical subspace identification procedures were based on solving the deterministic partial realization problem [3, 9] rather than the stochastic one, namely the rational covariance extension problem, which we shall return to in Section II.
The focus of this paper will be on a certain nonstandard Riccati equation, called the Covariance Extension Equation (CEE), which solves the rational covariance extension problem. It will be introduced in Section II-C. Section II as a whole is devoted to background material for the paper. In addition, in Section II-D we consider a question posed by Kalman and give a partial answer. In Section III we provide a new derivation of CEE that will allow us to establish that versions of CEE can be used to solve more general analytic interpolation problems encountered in systems and control. Such an example will be given in Section IV. We conclude with Section V, where we some future directions of research are discussed.
II BACKGROUND
II-A Rational covariance Extension problem
The rational covariance extension problem, first formulated by Kalman in [8], can be stated in the following way. Given a positive covariance sequence , i.e., a sequence with the property that the Toeplitz matrix
[TABLE]
is positive definite. find an infinite extension such that the function (12) is a rational positive-real function of degree at most . This is the proper partial realization problem connected to the system identification problem posed in the introduction. If we ignore the condition that be positive real, we have a deterministic partial realization problem equivalent to Padé approximation [9]. This is the problem solved in the original papers on subspace identification.
The following theorem provides a smooth complete parameterization of the set set of solutions to rational covariance extension problem.
Theorem 1
Let be a positive covariance sequence. Then, given any Schur polynomial (4), there is one and only one Schur polynomial (5) and such that
[TABLE]
is a shaping filter for . The mapping from to is diffeomorphism.
The existence part of Theorem 1 was proved in [10] (also see [11]) and the rest of the theorem in [12]. Note that and may have common roots, so the degree of might be less than .
For each parameter there is a convex optimization problem solving for , which first appeared in [13] (also see [14, 15]), but here we shall consider a different method of solution described in Section II-C.
II-B Algebraic and positive degree
The algebraic degree of is the minimal degree of a deterministic partial realization of . It is related to the rank of a Hankel matrix [9] and has the generic value , i.e., sequences that fail to have this algebraic degree live on a thin (lower-dimensional) subset, and there is no nonempty open set of vectors having an algebraic degree different from .
The positive degree of is the minimum degree of any solution to the corresponding rational covariance extension problem. By contrast, it does not have a generic value, as seen from the following theorem proved in [2].
Theorem 2
For each such that , there is a nonempty open set in of sequences for which is the positive degree. The maximal positive degree is .
Since the original subspace identification algoritms are based on Hankel factorization, they produce solutions with the algebraic degree, which may not coincide with the required positive degree. In [16] we presented a system that produces data for which there is massive failure of the basic subspace identification algorithms. This led to quite a lot of work devising ad hoc fixes that are now included in the codes.
II-C Covariance extension equation
Given the Schur polynomial (4), we define
[TABLE]
Moreover, we represent the covariance data in terms of the first coefficients in the expansion
[TABLE]
about infinity and define
[TABLE]
Finally define the the function given by
[TABLE]
Then the following nonstandard Riccati equation
[TABLE]
where ′ denotes transposition, was called the Covariance Extension Equation (CEE) in [2]. It parameterizes the solutions to the rational covariance extension problem in terms of the covariance data and the numerator polynomial corresponding to desired spectral zeros. In fact, the following theorem was proved in [2].
Theorem 3
Let be a positive covariance sequence. Then, for each Schur polynomial (4), there is a unique symmetric solution of CEE satisfying . Moreover, for each there is a unique shaping filter (3) for , where and are given in terms of the corresponding by
[TABLE]
Here . The degree of equals the rank of .
In [17, 18] we presented a homotopy continuation algorithm for solving CEE.
II-D Kalman’s question
In view of the fact that the algebraic degree can be determined from the rank of a Hankel matrix of the covariance data [3, 9], in 1972 Kalman [19] posed the question whether there is a similar matrix-rank criterion for determining the positive degree. In view of Theorem 2, this would seem impossible. The closest we have found in this direction is the following result, which follows from Theorem 3 and is reported in [2].
Proposition 4
Let be the unique solution of (20). Then the positive degree of the covariance sequence equals the minimum of over all Schur polynomials .
III A NEW DERIVATION OF THE COVARIANCE EXTENSION EQUATION
We present a new derivation of CEE which is divided into two separate steps, namely one that just imposes the condition that be positive real and rational of degree at most , and a second one imposing the interpolation condition that should match the first covariances . In this way, we see that this nonstandard matrix Riccati equation is universal in the sense that it can be used to solve more general analytic interpolation problems only changing certain parameters. In fact, the first step remains the same in this more general context.
III-A Stochastic realization
The rational function defined by (6) has a realization
[TABLE]
where
[TABLE]
is the upward shift matrix, and is an -vector to be determined. Note that this need not be a minimal realization, as there could be cancellations of common zeros of and .
Lemma 5
The vector in (22) is given by
[TABLE]
where and are the -vectors of coefficients in the polynomials and , respectively, in (6).
Proof:
[TABLE]
to which we apply the matrix inversion lemma (Appendix) to obtain
[TABLE]
Hence, since is the denominator polynomial, we must have , from which (24) follows. ∎
In the same way, , given by (3), has a realization
[TABLE]
for some -vector . Since is a minimum-phase spectral factor of , from stochastic realization theory [1, Chapter 6] we have that
[TABLE]
where is symmetric minimum solution of the algebraic Riccati equation
[TABLE]
Lemma 6
The vectors and in (22) and (25) are given by
[TABLE]
where is the minimal solution of (27).
Proof:
Applying the matrix inversion lemma (Appendix) to
[TABLE]
yields
[TABLE]
However, since the denominator is , we must have
[TABLE]
and hence (28b) follows. Moreover, . This together with (28b) yields
[TABLE]
from which (28a) follows. ∎
Lemma 7
The minimal solution of (27) is also the minimal solution of
[TABLE]
Proof:
Note that (27) can be written
[TABLE]
from which we have
[TABLE]
which, in view of (28), in turn yields (29). ∎
III-B Interpolation condition
Next we introduce the interpolation condition that matches the first covariance lags . To this end, we identify coefficients of nonnegative powers in with given by (12). This yields
[TABLE]
where
[TABLE]
However, by Lemma 5, , which inserted into (30) yields . Combining this with (28a) we have , and hence (28a) can be written
[TABLE]
Note that here is an matrix and not a matrix with being the algebraic degree of as in (13). By Theorem 2, is an upper bound of the positive degree.
Finally we show that (32) is equivalent to (19). To this end, we first identify negative powers of in
[TABLE]
obtained from (17), to obtain
[TABLE]
from which we have
[TABLE]
Consequently, and , so (19) follows from (19).
IV GENERALIZATION TO ANALYTIC INTERPOLATION
Next we show that the nonstandard Riccati equation (20) is universal in the sense that it holds for more general analytic interpolation problems by merely redefining the parameters and . We shall demonstrate this for a Nevanlinna-Pick interpolation problem with rationality constraints.
Given distinct point in the complement of the unit disc of the complex plane and points in the open right half-plane, find a rational positive real function of degree at most satisfying the interpolation condition
[TABLE]
It is convenient to chose the points in conjugate pairs to ensure that is a real function.
Clearly all calculations in Section III-A remain intact, and it on remains to enforce the interpolation condition, which we may write as
[TABLE]
or, equivalently, as
[TABLE]
Since the points are distinct, the Vandermonde matrix (35b) is nonsingular, and hence we have
[TABLE]
Therefore, by Lemma 5,
[TABLE]
Consequently, in view of (28a),
[TABLE]
Assuming that is nonsingular, for the moment as a technical condition to be more carefully investigated in the future, we define
[TABLE]
where for clarity we have added an index to each identity matrix to indicate dimension. Then
[TABLE]
which has the same form as (19). In combination with (28a) and (26), this also yields
[TABLE]
which is of the same form as (21).
We have thus demonstrated that the Covariance extension equation can be used also in this case after changing the definition of the interpolation parameters . Hence the algorithms using homotopy continuation presented in [17, 18] could also be used here.
V CONCLUSIONS
In this paper we provide a new derivation of a nonstandard Riccati equation (CEE) for rational covariance extension that separates the part that only depends on the positivity and rationality constraints and the part that depends on the interpolation condition. In this way we see that the structure of CEE remains intact, and only certain interpolation parameters need to be modified when treating more general analytic interpolation problems with rationality constraints.
It should be possible to generalize this framework to the MIMO case. This, together with some remaining numerical and technical issues, will the topic of a future paper.
APPENDIX
For ease of reference, we here reproduce the well-known matrix inversion lemma. Provided all inverses exist, the formula
[TABLE]
holds for otherwise arbitrary matrices of compatible dimensions. This is seen by direct computation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Lindquist and G. Picci, Linear Stochastic Systems: A Geometric Approach to Modeling, Estimation and Identification. Berlin, Heidelberg: Springer-Verlag, 2015.
- 2[2] C. I. Byrnes and A. Lindquist, On the partial stochastic realization problem, IEEE Trans. Automatic Control, vol. 42, Aug. 1997, pp. 1049-1070.
- 3[3] R. E. Kalman, P. L. Falb and M. A. Arbib, Topics in Mathematical Systems Theory, Mc Graw-Hill, 1969.
- 4[4] M. Aoki, State Space Modeling of Time Series, Springer-Verlag, 1987.
- 5[5] P. Van Overschee and B. De Moor, Subspace algorithms for stochastic identification problem, Automatica, vol. 3, 1993, pp. 649-660.
- 6[6] P. Van Overschee and B. De Moor, Subspace Identification for Linear Systems: Theory Implementation Applications, Kluwer Academic Publishers, 1996.
- 7[7] A. Lindquist and G. Picci, Canonical correlation analysis, approximate covariance extension, and identification of stationary time series, Automatica, vol. 32, no. 5, 1996, pp. 709-733.
- 8[8] R. E. Kalman, Realization of covariance sequences, Proc. Toeplitz Memorial Conference, Tel Aviv, Israel, 1981.
