Local Linearizations of Rational Matrices with Application to Rational Approximations of Nonlinear Eigenvalue Problems
Froil\'an M. Dopico, Silvia Marcaida, Mar\'ia C. Quintana, Paul Van, Dooren

TL;DR
This paper introduces a comprehensive framework for local linearizations of rational matrices, enabling structure-preserving approximations of zeros, poles, and eigenvalues, with applications to nonlinear eigenvalue problems.
Contribution
It provides a unified definition of local linearizations that encompasses previous approaches and rigorously explains their properties, especially in the context of nonlinear eigenvalue problems.
Findings
New definition of local linearizations for rational matrices.
Unified framework explaining properties of existing pencils.
Application to rational approximation of nonlinear eigenvalue problems.
Abstract
This paper presents a definition for local linearizations of rational matrices and studies their properties. This definition allows us to introduce matrix pencils associated to a rational matrix that preserve its structure of zeros and poles in subsets of any algebraically closed field and also at infinity. Moreover, such definition includes, as particular cases, other definitions that have been used previously in the literature. In this way, this new theory of local linearizations captures and explains rigorously the properties of all the different pencils that have been used from the 1970's until 2019 for computing zeros, poles and eigenvalues of rational matrices. Particular attention is paid to those pencils that have appeared recently in the numerical solution of nonlinear eigenvalue problems through rational approximation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Local Linearizations of Rational Matrices with Application to Rational Approximations of Nonlinear Eigenvalue Problems
Froilán M. Dopico111Supported by “Ministerio de Economía, Industria y Competitividad (MINECO)” of Spain and “Fondo Europeo de Desarrollo Regional (FEDER)” of EU through grants MTM2015-65798-P and MTM2017-90682-REDT. The research of M. C. Quintana is funded by the “contrato predoctoral” BES-2016-076744 of MINECO.
Silvia Marcaida222Supported by “Ministerio de Economía, Industria y Competitividad (MINECO)” of Spain and “Fondo Europeo de Desarrollo Regional (FEDER)” of EU through grants MTM2017-83624-P and MTM2017-90682-REDT, and by UPV/EHU through grant GIU16/42.
María C. Quintana33footnotemark: 3
Paul Van Dooren444This work was partially developed while Paul Van Dooren held a “Chair of Excellence UC3M - Banco de Santander” at Universidad Carlos III de Madrid in the academic year 2017-2018.
Departamento de Matemáticas, Universidad Carlos III de Madrid, Avda. Universidad 30, 28911 Leganés, Spain.
Departamento de Matemática Aplicada y Estadística e Investigación Operativa, Universidad del País Vasco UPV/EHU, Apdo. Correos 644, Bilbao 48080, Spain.
Department of Mathematical Engineering, Université catholique de Louvain, Avenue Georges Lemaître 4, B-1348 Louvain-la-Neuve, Belgium.
Abstract
This paper presents a definition for local linearizations of rational matrices and studies their properties. This definition allows us to introduce matrix pencils associated to a rational matrix that preserve its structure of zeros and poles in subsets of any algebraically closed field and also at infinity. Moreover, such definition includes, as particular cases, other definitions that have been used previously in the literature. In this way, this new theory of local linearizations captures and explains rigorously the properties of all the different pencils that have been used from the 1970’s until 2019 for computing zeros, poles and eigenvalues of rational matrices. Particular attention is paid to those pencils that have appeared recently in the numerical solution of nonlinear eigenvalue problems through rational approximation.
keywords:
rational matrix , rational eigenvalue problem , nonlinear eigenvalue problem , linearization , polynomial system matrix , rational approximation , block full rank pencils AMS subject classifications: 65F15, 15A18, 15A22, 15A54, 93B18, 93B20, 93B60
1 Introduction
Rational matrices, i.e., matrices whose entries are rational functions of a scalar variable, are a classical topic inside matrix theory that has received a lot of attention since the 1950s, as a consequence of their fundamental role in linear systems and control theory [23, 24]. Classical references on rational matrices and their applications to these areas are, for instance, the pioneering monographs [19, 28]. The most relevant structural data of a rational matrix are its zeros and poles, together with their partial multiplicities or structural indices, and its minimal indices, which exist only when the matrix is singular, i.e., rectangular or square with identically zero determinant. These structural data are very important in the applications mentioned above, which motivated in the 1970s a considerable research activity on the development of numerical algorithms for computing them, see [33] and the references therein. Among the different algorithms developed for this purpose in the 1970-80s, the most reliable ones were based on constructing a matrix pencil, i.e., a matrix polynomial of degree , containing exactly all the information about the structural data of the considered rational matrix [33, 37], and then applying to this matrix pencil backward stable algorithms, developed also in the 1970s, for computing the eigenvalues and/or other structural data of general pencils [26, 32].
The pencils mentioned in the previous paragraph are among the first examples of linearizations of rational matrices. Such pencils are, in fact, particular instances of minimal polynomial system matrices of the considered rational matrix, a key concept introduced by Rosenbrock [28] that allows us, among other things, to include simultaneously all the information about the zeros and the poles of a rational matrix into a polynomial matrix.
Recently, rational matrices have received considerable attention from the different perspective of what are called rational eigenvalue problems (REPs). Such REPs may arise directly from applications [25], as approximations of other nonlinear eigenvalue problems (NLEPs) (see, for instance, [18, 21, 29, 31]), and, even more, REPs have also been used to approximate polynomial eigenvalue problems (PEPs) in order to take advantage of certain low rank structures [22]. Since NLEPs are nowadays a very active area of research (see the recent survey [17] and the references therein), REPs and rational matrices are currently a hot topic inside applied and numerical linear algebra. In this scenario, it is of interest to establish in the next paragraphs connections and differences between how rational matrices are viewed in the classic areas of linear systems and control theory and in the modern one of NLEPs, since, unfortunately, some modern and pioneering references on NLEPs seem to ignore classic results on rational matrices.
First, let us review the definition of REPs. Given a regular rational matrix , the corresponding REP is defined as computing numbers and nonzero vectors such that . These and are called eigenvalues and (right) eigenvectors of , respectively, a terminology inherited from other matrix eigenvalue problems but that has never been used in standard references on rational matrices [19, 28]. Observe that the definition of REP assumes implicitly that is defined at , i.e., none of their entries become infinite. Thus, using the classic definitions for the structural data of rational matrices, we can say that is a zero of but not a pole, and we can see REPs as particular cases of the computational problems on rational matrices investigated in the 1970-80s.
Second, we emphasize that rational approximations of NLEPs are only reliable in a certain target set. Moreover, in many works [18, 21, 29, 31], the matrix defining the NLEP is assumed to be analytic in the target region, and such region does not contain the poles of the rational matrix defining the approximating REP. In particular, the poles are already known from the approximation process. This means that for those rational matrices coming from approximating these NLEPs, the poles are of no interest (since they are known), and only those zeros (eigenvalues) in the target set have to be computed. In addition, the structure at infinity (see [19] for a definition) is also of no interest. This is in stark contrast with the situation for rational matrices arising in linear systems and control theory, which, usually, are transfer functions of time invariant linear systems and, therefore, all the finite and infinite structure of zeros and poles related to the transfer function is of interest and has to be computed [33].
As said before, some influential modern references on solving numerically NLEPs via rational approximations ignore classic results on rational matrices. Probably, this is a consequence of the differences mentioned in the previous paragraph and, also, of the fact that rational matrices coming from approximating NLEPs may appear represented in forms different from the most standard ones in linear system and control theory. This lack of connections with classical results is unfortunate, but has had also the positive effect of producing new results on and approaches to rational matrices. For instance, on the unfortunate side, it is surprising that the idea of solving REPs via linearizations was not used in modern references until the key paper [30] was published, despite the fact it had been intensively used much earlier (see [33] and the references therein), and it is one of the most reliable methods for solving REPs. On the positive side, [30] introduced a new companion-like linearization of any rational matrix that is very useful in computations. For this purpose, [30] expressed the rational matrix as the sum of a polynomial matrix and a state-space realization and approached the problem with the spirit of linearizations of polynomial matrices [16], instead of using the classical point of view of polynomial system matrices. (However, it is worth highlighting that, in Example 4.11, we will see that the linearization in [30] is nothing else than a polynomial system matrix of the considered rational matrix. We will see in Section 6 that the same happens for the linearizations in [18].)
Another point to be remarked is that reference [30] started a confusing practice, common to several references dealing with linearizations of rational matrices that approximate NLEPs. Namely, to term as “linearizations” pencils which are proved to contain only partial information about the corresponding rational matrix. For example, the papers [18, 21, 29, 30], which are excellent from the numerical point of view, only prove (at most) that the algebraic and geometric multiplicities of the eigenvalues are preserved in the “linearization”, but nothing is proved about the partial multiplicities. This is in contrast with the standard definition of (strong) linearization of polynomial matrices [16, 9], which guarantees that linearizations contain all the information about the eigenvalues of polynomial matrices (including at infinity in the strong case), as well as with the linear minimal polynomial system matrices used as linearizations of rational matrices in [33, 37], which contain all the information about poles and zeros of the rational matrices.
The partial results proved in [30] were among the motivations of the development of a rigorous definition and theory of strong linearizations of arbitrary (regular or singular, square or rectangular) rational matrices in [5]. Moreover, infinitely many examples of such strong linearizations have been constructed in [5, Section 5.2] through the family of so-called strong block minimal bases linearizations of rational matrices. In simple words, the main idea of the theory in [5] is to combine minimal polynomial system matrices of rational matrices with the theory of linearizations of polynomial matrices [9, 10, 16] in the following sense: strong linearizations of a rational matrix are linear minimal polynomial system matrices of rational matrices that may be different from , but that are related to it via unimodular polynomial matrices, biproper rational matrices, and direct sums with identities. In this way such strong linearizations contain all the information about poles and zeros of the considered rational matrices and extend the “linearizations” used in [33, 37], which correspond to the particular case when . Related works about linearizations containing all the pole-zero information of a rational matrix (in some cases not at infinity) are [1, 7, 8, 11].
However, the definitions of linearization and strong linearization in [5] do not capture always the pencils defined in [18, 21, 29, 30] for two reasons. First, the pencils in [18, 21, 29, 30] do not always satisfy the minimality requirements of the definitions in [5]. Second, and related to the first fact, some of these pencils may not content all the information about the poles of the rational matrix (neither the information of those zeros that are also poles), and a zero of the linearization could be a pole of the rational matrix but not a zero. But, we stress that this is not a drawback in the setting of [18, 21, 29, 30] because, as explained before, in these cases the poles are of no interest, and only the eigenvalues in a certain target set have to be computed. This motivates us to develop in this paper a theory of what we call local linearizations of rational matrices, where the word local means that the linearization is only guaranteed to contain all the information about those zeros and poles of the rational matrix which are located in a certain set.
The theory of local linearizations of rational matrices captures all the pencils that have been used (as far as we know) in the literature for solving REPs arising from approximating NLEPs. As illustration, we will apply in this paper this theory to the pencils in [18, 29, 30] in several different ways. The application to the pencils in [21] is postponed to [12] with the goal of limiting the length of this paper. In addition, we will see that the definition of local linearizations include the definitions of linearizations and strong linearizations of arbitrary rational matrices presented in [5], just by considering as set the whole underlying field and including infinity in the strong case. As a consequence, local linearizations also include the pencils originally used in [33, 37]. Thus, this new local theory is a flexible tool that generalizes and includes most of the previous results available in the literature in this area. This is in part possible due to a new and more flexible treatment of polynomial system matrices at infinity.
The theory of local linearizations of rational matrices is based on the extension of Rosenbrock’s fundamental concept of minimal polynomial system matrix to a local perspective. Such extension is performed in a very simple and applicable manner that avoids as much as possible the use of abstract algebraic concepts. This is in contrast with related local approaches as the one in [6] and the references therein, which, in addition, are focused on the underlying local equivalence relationships rather than on the properties of polynomial system matrices. The local linearization approach connects the concept of linearization with classical results as the local Smith form of polynomial matrices (see, for instance, [16, Section S1.5]) and the local Smith–McMillan form of rational matrices (see [27, Theorem II.9] and [34]).
The paper is organized as follows. Section 2 summarizes some basic results that will be used in the rest of the paper. Locally minimal polynomial system matrices are defined and studied in Section 3. Section 4 presents the main definitions and properties of local linearizations of rational matrices. Section 5 introduces the so-called block full rank pencils, which are linearizations of rational matrices that do not contain any information about the poles, and are closely related to the block minimal bases linearizations of polynomial matrices recently presented in [10]. The application of the local theory to the pencils in [18] is analyzed in depth and from two perspectives in Section 6. Finally, Section 7 discusses the conclusions and some lines of future research. Several examples that illustrate the theoretical results are scattered throughout the paper. They are often based on the pencils introduced in [29, 30].
2 Preliminaries
We assume throughout this paper that is an algebraically closed field that does not include infinity. As usual, denotes the ring of polynomials with coefficients in and the field of rational functions or, equivalently, the field of fractions of . A rational function is said to be proper if strictly proper if and biproper if , where stands for “degree of”.
, and denote the sets of matrices with elements in and respectively. The elements of are called polynomial matrices or matrix polynomials. In the sequel we will use both terms. A unimodular matrix is a square polynomial matrix with polynomial inverse or, equivalently, a square polynomial matrix with nonzero constant determinant. Moreover, the elements of are called rational matrices. A (strictly) proper rational matrix is a rational matrix whose entries are (strictly) proper rational functions. A biproper matrix is a square proper matrix with proper inverse or, equivalently, a square proper matrix whose determinant is a biproper rational function. The normal rank of a polynomial or rational matrix is the size of its largest nonidentically zero minor and is denoted by . See [19] and [35] for more information on these and other concepts related to polynomial and rational matrices.
As a first step to define local linearizations of rational matrices, we present local notions and results about rational matrices. We denote the point at infinity as
Definition 2.1**.**
Let . Let , and be nonempty.
- (i)
* is defined or bounded at if *
- (ii)
* is defined or bounded at if is defined at *
- (iii)
* is defined or bounded in if for all .*
Notice that a rational matrix being defined at is equivalent to having a Taylor expansion around Moreover, a rational matrix is defined at infinity if and only if is proper.
Definition 2.2**.**
Let . Let , and be nonempty.
- (i)
* is regular or invertible at if it is defined at and *
- (ii)
* is regular or invertible at if is regular at *
- (iii)
* is regular or invertible in if it is regular at each *
A rational matrix is said to be regular if it is regular for some That is, if is square and Note that is regular at if and only if both and have a Taylor expansion around Moreover, biproper matrices are those rational matrices that are regular at infinity, while unimodular matrices are those rational matrices that are regular in .
In regard to the previous definitions, we introduce some equivalence relations defined in the set of rational matrices [3, 4].
Definition 2.3**.**
Let . Let , and be nonempty.
- (i)
* and are equivalent at if there exist rational matrices and both regular at such that *
- (ii)
* and are equivalent at if there exist rational matrices and both regular at such that *
- (iii)
* and are equivalent in if there exist rational matrices and both regular in such that *
Note that if is considered in Definition 2.3, then and are both unimodular, and the standard definition of unimodular equivalence is recovered.
We now introduce the definition of the local Smith–McMillan form of a rational matrix at a point (finite and infinite). The notion of the Smith–McMillan form of a rational matrix was first studied by McMillan in [23, 24] and, then, in other works as [19, 27, 28, 35, 38]. The local Smith–McMillan form is a particular case of the very general (and abstract) result [27, Theorem II.9]. A description valid for rational matrices over the complex field can be found in [34], and a complete and rigorous modern treatment in [4]. Let be any rational matrix of normal rank . Let Then is equivalent at to a matrix of the form
[TABLE]
where are integers. The integers are uniquely determined by and , and are called the invariant orders at of . The matrix in (1) is called the local Smith–McMillan form of at Moreover, is equivalent at to a matrix of the form
[TABLE]
where are integers. These integers are uniquely determined by , and are called the invariant orders at infinity of . The matrix in (2) is called the Smith–McMillan form of at
In order to define zeros and poles we need to distinguish between positive and negative invariant orders [19, 35]. When we say that a rational matrix has as invariant orders at (infinity) we mean that may take values from 0 to and from to . For instance, if all the invariant orders are nonnegative; if, in addition, then they are all positive, but if and they are all 0.
Definition 2.4**.**
Let and . Let be the invariant orders at of . Then is said to be a pole of with partial multiplicities and a zero of with partial multiplicities In particular, the positive integers and are called the pole and zero partial multiplicities of at respectively. Moreover, for are called the pole elementary divisors of at , while for are called the zero elementary divisors of at Finally, the pole (zero) algebraic multiplicity of is the sum of its pole (zero) partial multiplicities, and the pole (zero) geometric multiplicity of is the number of its pole (zero) partial multiplicities.
If is a polynomial matrix then the polynomials with are simply called elementary divisors of at and the nonzero integers are all positive and are called partial multiplicities of at
Definition 2.5**.**
Let . Let be the invariant orders at of . Then is said to be a pole of with partial multiplicities and a zero of with partial multiplicities In particular, the integers and are called the pole and zero partial multiplicities of at respectively.
Some modern references, see for instance [1, 18, 30], also consider (finite) eigenvalues of rational matrices, a concept that is not mentioned at all in classical references of rational matrices. According to these modern references, we introduce the following definition.
Definition 2.6**.**
Let be a rational matrix. A finite eigenvalue of is any such that 555Note that here denotes the normal rank of while is the rank of the constant matrix with That is, is a finite zero of but not a pole.
Observe that if is regular, an eigenvalue of is any such that there exists a nonzero vector satisfying with which is the standard definition of REP (Rational Eigenvalue Problem).
As a consequence of [4, Theorem 2.3] (see [3, Section 2] for more details) we can also present the Smith–McMillan form of a rational matrix in a nonempty subset of , say . Let with normal rank . Then is equivalent in to a matrix of the form
[TABLE]
where, for , are nonzero irreducible rational functions, and are monic (leading coefficient equal to 1) polynomials which are either constants or whose roots are in and while , where stands for divisibility. We refer to (3) as the Smith–McMillan form in of . When we take , we obtain the (finite) Smith–McMillan form of , i.e., the classical Smith–McMillan form of . In this case, if is polynomial then are the invariant polynomials of , and (3) is called the Smith normal form of .
Notice that the Smith–McMillan form of a rational matrix in a nonempty set is invariant under multiplication by regular rational matrices in i.e., under equivalence in Analogously, the Smith–McMillan form at is invariant under multiplication by biproper matrices, i.e., under equivalence at
The next result shows that the equivalence of rational matrices in nonempty sets is a local property.
Proposition 2.7**.**
Let be nonempty. Two rational matrices of the same size are equivalent in if and only if they are equivalent at each
Proof.
If two rational matrices are equivalent in then, by Definitions 2.3 and 2.2, it is straightforward that they are equivalent at each . For the converse, suppose that and are equivalent at each Then, and have the same local Smith–McMillan forms at each In particular, and have the same pole and zero elementary divisors at each Let us consider and as the global Smith–McMillan forms of and respectively. Thus, there exist unimodular matrices for such that , , and we can write
[TABLE]
where are rational functions which are either equal to one or have poles and zeros in while and are rational functions that do not have neither poles nor zeros in Let us define Hence, Therefore, we deduce that and and are equivalent in since the matrices and are regular in ∎
3 Polynomial system matrices minimal in subsets of and at infinity
Polynomial system matrices are a classical tool for studying rational matrices. They were introduced by Rosenbrock and are analyzed in detail in [28]. Among them, minimal polynomial system matrices have been used in many problems dealing with rational matrices because they allow to extract all the information about finite poles and zeros. Recently, they have played a fundamental role in developing a rigorous theory of linearizations and strong linearizations of rational matrices [5]. In this section, we extend the concept of minimal polynomial system matrices from the classical global scenario to a local one. Some of the definitions in this section can also be found in [6] expressed in an abstract algebraic language.
3.1 Polynomial system matrices minimal in subsets of
In this section we introduce polynomial system matrices of rational matrices that are locally minimal, and study their properties. Consider the fact that any rational matrix can be written as
[TABLE]
for some polynomial matrices , and with nonsingular if (see [28]). Then the matrix polynomial
[TABLE]
is called a polynomial system matrix of [28]. That is, is the Schur complement of in . In that case, is called the state matrix of and is the transfer function matrix of If we assume that the matrices and are empty, and is a polynomial matrix. We emphasize that the definition of polynomial system matrix of a rational matrix includes a specific partition. Sometimes in this paper a certain polynomial matrix is partitioned in different ways giving rise to different polynomial system matrices of (possibly) different rational matrices. In such cases, we often use expressions as “ is a polynomial system matrix of with state matrix ” in order to avoid ambiguities, where the words “of ” may be omitted because and determine . In the case mentioned above, we will use “ is a polynomial system matrix with empty state matrix”. We stress that although in (4) the state matrix is in the -block, it might be a different submatrix of . In general, the fundamental property defining a polynomial system matrix is that the rational matrix is the Schur complement of the state matrix.
We remark that the relation between the normal ranks of and its transfer function matrix is
[TABLE]
since we can write as
[TABLE]
Next, we introduce two of the main definitions of this work.
Definition 3.1** (Polynomial system matrix minimal at a point in ).**
Let The polynomial system matrix in (4), with is said to be minimal at if
[TABLE]
Remark 3.2**.**
If is a polynomial system matrix as in (4), with then
[TABLE]
since is nonsingular. Thus is minimal at if and only if is neither an eigenvalue of nor of
Definition 3.3** (Polynomial system matrix minimal in a subset of ).**
Let be nonempty. The polynomial system matrix in (4), with is minimal in if is minimal at each point
Observe that Definitions 3.1 and 3.3 extend to points and subsets of the classical definition of minimal, or with least order, polynomial system matrices introduced in [28]. Rosenbrock’s definition coincides with Definition 3.3 when
Remark 3.4**.**
For convenience, if in (4), we adopt the agreement that is minimal at every point
In the next example, we illustrate Definition 3.3 with a rational matrix and a polynomial system matrix taken from the recent reference [29] dealing with numerical algorithms for solving NLEPs via rational approximation. We advance that we will use the matrices in Example 3.5 several times for illustrating different concepts introduced in this paper as well as for establishing a first connection between the theory developed in this paper and NLEPs. In this respect, we emphasize that [29] does not mention at all polynomial system matrices, and that the same happens with references [18, 30].
Example 3.5**.**
Let be a rational matrix of the form
[TABLE]
with and if . Let us consider the linear polynomial matrix
[TABLE]
These matrices are introduced in [29] to tackle a NLEP in a certain region where the matrix is of the form with and being scalar functions nonlinear in the variable and holomorphic in For solving a NLEP of this form, the nonlinear matrix is approximated in by a rational matrix as in (6), and is considered to linearize It is easy to see that is, in fact, a linear polynomial system matrix of by setting the matrix as state matrix in (4). Moreover, without any assumption, is minimal in In particular, and according to [29], is a subset of Therefore, is minimal in the target set For completeness, notice that a polynomial system matrix as is minimal in if and only if all the matrices are nonsingular. We also emphasize that the form of the rational matrix in (6) is very particular because it is the sum of a linear polynomial matrix and strictly proper rational matrices with linear denominators, which simplifies considerably working with it from different perspectives. We will consider later more complicated examples.
The next result provides the pole and zero elementary divisors of a rational matrix at any finite point from any polynomial system matrix of minimal at This result is the counterpart of [28, Chapter 3, Theorem 4.1] for polynomial system matrices minimal at a finite point instead of polynomial system matrices of least order.
Theorem 3.6**.**
Let Let and let
[TABLE]
be a polynomial system matrix minimal at whose transfer function matrix is Then the elementary divisors of at are the pole elementary divisors of at and the elementary divisors of at are the zero elementary divisors of at
Proof.
Let us consider the Smith normal form of Namely,
[TABLE]
with and unimodular matrices. Observe that is invertible as a rational matrix since We set Since is minimal at has no zeros at Therefore, is regular at Moreover, is a polynomial matrix, as it is equal to has full row rank, and has no zeros in Now, let us consider the Smith normal form of the polynomial matrix Namely,
[TABLE]
with and unimodular matrices. Observe that is invertible as a rational matrix since is invertible and We set Moreover, the matrix is also polynomial, as it is equal to , has full column rank, and has no zeros in Since is minimal at and is regular at has not zeros at Therefore, is regular at Let us define now the polynomial system matrix
[TABLE]
We claim that is a minimal polynomial system matrix in or in the classical sense of Rosenbrock [28]. For that, it remains to prove that the matrix
[TABLE]
has full row rank for all Let us suppose that there exists such that On the one hand, we know that
[TABLE]
since the Smith normal form of is equal to and is unimodular. On the other hand, we have that
[TABLE]
which is a contradiction. Therefore, is a minimal polynomial system matrix. Its transfer function matrix is Then, by [28, Chapter 3, Theorem 4.1], we know that the zero elementary divisors of are the elementary divisors of and that the pole elementary divisors of are the elementary divisors of Finally, the result follows by taking into account that the matrices and are equivalent at and that the matrices and are also equivalent at since and are both regular at that point. ∎
Theorem 3.6 can be extended to any subset of in a natural way, by applying this theorem to every point of that subset.
Theorem 3.7**.**
Let be nonempty. Let and let
[TABLE]
be a polynomial system matrix minimal in whose transfer function matrix is Then the elementary divisors of in are the pole elementary divisors of in and the elementary divisors of in are the zero elementary divisors of in
Example 3.8**.**
If Theorem 3.7 is applied to the matrices and and the set in Example 3.5, we obtain immediately that (without any hypothesis) the eigenvalues of in coincide exactly with the zeros of in , with exactly the same multiplicities (geometric, algebraic and partial). Observe also that all the zeros of in are, in fact, eigenvalues of because the only potential poles of are . This result is stronger than Lemma 3.1 and Corollary 3.2 in [29] from two perspectives: [29] deals with determinants and, so, only gives information on algebraic multiplicities, and the requests in [29] impose the additional hypothesis that is nonsingular. Note that, under the assumption that all the matrices are nonsingular, we obtain that (and ) allows us to obtain the complete information on finite poles and zeros (including all the multiplicities) of in
3.2 Polynomial system matrices minimal at infinity
Theorems 3.6 and 3.7 characterize polynomial system matrices that contain the information of the invariant orders at finite points of their transfer functions. The extension of these results for including the information at infinity is an old problem that has been considered in classical papers as, for instance, in [36, 37]. However, a satisfactory solution has been found, so far, only for polynomial system matrices with state matrix being a linear polynomial matrix and the other blocks being constant matrices. In other cases, recovering the information at infinity requires to embed the polynomial system matrix into a larger matrix. In this section, we propose a new approach for obtaining a counterpart of Theorem 3.6 at infinity. This approach is motivated by the recent work [5], but presents relevant differences with respect to [5], and is based on the use of “reversals” and local equivalences of rational matrices.
In order to develop our counterpart of Theorem 3.6 at infinity, first, we introduce the notion of -reversal of a rational matrix in Definition 3.9, where is any integer. In this definition we will use, for a particular value of the well-known fact that any rational matrix can be uniquely written as
[TABLE]
where is a polynomial matrix and is a strictly proper rational matrix. The equation (8) follows from the Euclidean division for polynomials applied to each entry of The matrices and are called the polynomial part and the strictly proper part of , respectively. A polynomial matrix is said to have degree if is the largest exponent of the variable of its entries with nonzero coefficient. In such a case, is denoted by
Definition 3.9** (-reversal of a rational matrix).**
Let be a rational matrix, and let be an integer. We define the -reversal of as the rational matrix
[TABLE]
Let be expressed as in (8). If whenever is not strictly proper, or if is strictly proper, then the -reversal is called the reversal of and it is often denoted by just
Note that if in (8) is a constant matrix, including the zero matrix, then . Definition 3.9 extends the definition of -reversal for polynomial matrices (see, for instance, [9, Definition 2.12]). However, we emphasize that in the definition of -reversal of a polynomial matrix considered previously in the literature, is always taken larger than or equal to the degree of the polynomial matrix, while in Definition 3.9 we only ask for to be an integer.
Given a polynomial system matrix as in (4), we have that
[TABLE]
where is the degree of is also a polynomial matrix. Moreover, is nonsingular since is nonsingular. Therefore, is also a polynomial system matrix. We now introduce Definition 3.10 about minimality at infinity of a polynomial system matrix.
Definition 3.10** (Polynomial system matrix minimal at infinity).**
The polynomial system matrix in (4) is minimal at if is minimal at
Example 3.11**.**
The polynomial system matrix with transfer function matrix in Example 3.5 is minimal at since
[TABLE]
is, obviously, minimal at
Remark 3.12**.**
A polynomial system matrix as in (4), with and is minimal at if and only if
[TABLE]
More precisely, let and be the matrix coefficients of in and respectively. Then the fact of being minimal at is equivalent to
[TABLE]
Notice that if then is a constant polynomial system matrix, and must be invertible. Therefore, in this case, the rank condition above is automatically satisfied, and is minimal at
Theorem 3.13 is essentially the counterpart of Theorem 3.6 at infinity. We state it in terms of reversals and their elementary divisors at [math] as we only have defined elementary divisors for finite points. The implications of Theorem 3.13 on the structure at infinity are made explicit in Theorem 3.15.
Theorem 3.13**.**
Let and let
[TABLE]
be a polynomial system matrix of degree minimal at whose transfer function matrix is Then the elementary divisors of at [math] are the pole elementary divisors of at and the elementary divisors of at [math] are the zero elementary divisors of at
Proof.
It can be easily proved that the transfer function matrix of is The theorem then follows by applying Theorem 3.6, since is minimal at ∎
Once we have obtained the elementary divisors of the -reversal of a rational matrix at [math], from one of its polynomial system matrices of degree minimal at we can then obtain its invariant orders at infinity as we state in Theorem 3.15. For proving that, we use Lemma 3.14.
Lemma 3.14**.**
Let with and let be an integer. Let be the invariant orders of at and let be the invariant orders at infinity of Then
[TABLE]
Proof.
From the local Smith–McMillan form at infinity of there exist biproper rational matrices and such that
[TABLE]
Let us perform the transformation on the variable of the equation above. Thus,
[TABLE]
By [4, Lemma 6.9], and are regular at We now multiply the previous equation by and we get that for are the invariant orders of at ∎
Theorem 3.15**.**
Let with and let
[TABLE]
be a polynomial system matrix of degree minimal at whose transfer function matrix is Let be the partial multiplicities of at [math] and let be the partial multiplicities of at Then the invariant orders at infinity of are
[TABLE]
Proof.
By Theorem 3.13, we know that and with and are the pole and zero partial multiplicities of at respectively. Thus, the invariant orders of at [math] are Then the use of Lemma 3.14 completes the proof. ∎
Example 3.16**.**
By combining Theorem 3.15 and Example 3.11, we see that in Example 3.5, contains the complete information about the invariant orders at of (without imposing any hypothesis). Note that, in this case, and that the -reversal of the state matrix, i.e., , has no partial multiplicities at [math]. This result on the relationship between the infinite structure of and the reversal of is not mentioned in [29]. In this context, it is worth emphasizing that modern references on NLEPs and their rational approximations do not pay attention to the structure at , while such structure plays an important role in many classic references of linear system theory and control [19, 20, 36, 37].
For polynomial system matrices that are minimal at infinity and, also, at every finite point, we state Definition 3.17 about strong minimality. This definition has already been introduced in [13, Definition 3.3]. However, in [13] the definition is given in terms of eigenvalues instead of minimality at every point, but both definitions are equivalent.
Definition 3.17** (Strongly minimal polynomial system matrix).**
The polynomial system matrix in (4) is strongly minimal if it is minimal at each point of
We emphasize that, as a consequence of Theorems 3.6 and 3.15, strongly minimal polynomial system matrices contain all the information about the invariant orders of their transfer function matrices, both at finite points and at infinity.
4 Local linearizations of rational matrices
In practice, one is often interested in studying the pole and zero structure of rational matrices not in the whole space but in a particular region (see [17, 18, 21, 29]). For instance, this happens when a rational eigenvalue problem arises from approximating a nonlinear eigenvalue problem, since the approximation is usually reliable only in a target region not containing poles. As a consequence, the eigenvalues (those zeros that are not poles) of the approximating rational eigenvalue problem need to be computed only in that region. In this scenario, one can use local linearizations of the corresponding rational matrix which contain the information about the poles and zeros in the target region, but might not in the whole space In addition, they do not satisfy, in general, the conditions of the strong linearizations of rational matrices introduced in [5]. Thereby local linearizations provide extra flexibility in solving nonlinear eigenvalue problems.
In this section, we give separately the definitions of linearizations of rational matrices in subsets of and at infinity, study their properties and establish connections with the linearizations introduced in [5]. These linearizations will be useful in order to study the pole and zero structure of rational matrices in different sets containing or not infinity. In particular, and as an application of these definitions, we will study in Section 6 the structure of the linearizations that appear in [18].
4.1 Linearizations in subsets of
In this subsection we introduce the definition of linearization of a rational matrix in a set not containing infinity and study some of its properties. We start by giving the definition of linearization at a finite point.
Definition 4.1** (Linearization at a point in ).**
Let and let Let
[TABLE]
be a linear polynomial system matrix and let
[TABLE]
be its transfer function matrix. is a linearization of at if the following conditions hold:
- (a)
* is minimal at , and*
- (b)
there exist nonnegative integers satisfying and rational matrices and regular at such that
[TABLE]
Remark 4.2**.**
Notice that, in Definition 4.1, the following two cases are allowed:
. Then we just have to check condition , since condition is satisfied by setting , and . 2. 2.
Then it is not necessary to take into account condition (it is automatically satisfied by the agreement in Remark 3.4) and, therefore, we just have to check condition with .
We remark these extreme cases since they are important in applications, and make Definition 4.1 very general.
We now extend, in a natural way, the notion of linearization at a finite point to linearization in subsets of
Definition 4.3** (Linearization in a subset of ).**
Let and let be nonempty. A linear polynomial system matrix is a linearization of in if is a linearization of at each point
Since linearizations of rational matrices are, in particular, polynomial system matrices, their definition includes a specific partition. Thus, a fixed linear polynomial matrix (also called a matrix pencil) may be partitioned in different ways giving rise to different linearizations of the same or of different rational matrices, or in different subsets. To deal with different partitions, we will use expressions as “ is a linearization of in with state matrix ” when it is necessary for avoiding any ambiguity. The expression “ is a linearization of in with empty state matrix” will cover the case in (10).
In condition (11), one can always take or according to and or and , respectively. This is a consequence of the local Smith–McMillan forms of and being equivalent to each other at . In the rest of the results of this subsection, we will consider and since it corresponds to the most interesting situation in applications.
Remark 4.4**.**
If we have a linearization of in a set then, for each point , there exist rational matrices and regular at such that In principle, for different values of the rational matrices (respectively, ) may be different from each other, that is, (resp., ) depends on However, Proposition 2.7 implies that the existence of and for each is equivalent to the existence of two rational matrices and both regular in (and independent of ) such that .
Remark 4.5**.**
When in Definition 4.3, condition (11) is satisfied with and unimodular matrices. Therefore, a linearization in or at every point of is a linearization in the sense of [5, Definition 3.2] and vice versa.
The next result gives the relation between the invariant orders at a finite point of a rational matrix and those of a rational matrix of the form with . It is motivated by equation (11).
Lemma 4.6**.**
Let , and let be the invariant orders of at Consider with Then the invariant orders of at are where for and for
Proof.
Let be the local Smith–McMillan form of at Then, for some rational matrices and regular at Moreover, Therefore, since the matrices and are regular at the local Smith–McMillan form of at is up to a permutation. ∎
Corollary 4.7 and Theorem 4.8 follow from Lemma 4.6. These results state the spectral information that one can obtain from local linearizations. More precisely, Theorem 4.8 is a spectral characterization of local linearizations in the spirit of [5, Theorem 3.10].
Corollary 4.7**.**
Let , and let
[TABLE]
be a linear polynomial system matrix minimal at Let be the transfer function matrix of Then is a linearization of at if and only if the following two conditions hold:
- (a)
, and 2. (b)
* and have exactly the same pole and zero elementary divisors at *
Proof.
If is a linearization of at then (a) and (b) are satisfied by Lemma 4.6, since and are equivalent at For the converse, suppose that are the invariant orders of at From (a) and (b), the Smith–McMillan form at of must be . Observe that this is also the Smith–McMillan form at of , as proved in the previous lemma. Thus, and are equivalent at . ∎
Theorem 4.8** (Spectral characterization of linearizations at a point in ).**
Let , and let
[TABLE]
be a linear polynomial system matrix minimal at Then is a linearization of at if and only if the following three conditions hold:
- (a)
, 2. (b)
the pole elementary divisors of at are the elementary divisors of at and 3. (c)
the zero elementary divisors of at are the elementary divisors of at
Proof.
Let be the transfer function matrix of . By (5), . Moreover, by Theorem 3.6, the pole elementary divisors of at are the elementary divisors of at , and the zero elementary divisors of at are the elementary divisors of at . The result follows from Corollary 4.7. ∎
It is immediate to obtain counterparts of Corollary 4.7 and Theorem 4.8 for linear polynomial system matrices minimal in sets and for linearizations in We omit to state such results for brevity.
The following proposition is a straightforward consequence of the definition of linearization in a subset of by taking and i.e., it corresponds to case 1 in Remark 4.2. However, we emphasize this result since it gives a sufficient condition that is easy to verify in order to ensure that a linear polynomial system matrix is a linearization of a rational matrix.
Proposition 4.9**.**
Let be nonempty. Let
[TABLE]
be a linear polynomial system matrix and let be its transfer function matrix. If is minimal in then is a linearization of in
In plain words, any linear polynomial system matrix is a linearization of its transfer function matrix in the sets where is minimal.
Example 4.10**.**
Consider the matrices and and the set in Example 3.5, that were originally introduced in [29]. By combining the discussion in Example 3.5 with Proposition 4.9, we immediately obtain that is a linearization of in With a bit more effort, it is also easy to obtain the following stronger result: is a linearization of in where
As mentioned in Example 3.5, the form of the rational matrix in (6) is very particular since its polynomial part and the denominators in the strictly proper part are linear. Thus, we finish this section by discussing in Example 4.11 a rational matrix with non linear polynomial part and with a general state space realization of the strictly proper part. For such general representation of rational matrices, an influential companion-like pencil associated to it was introduced in [30]. We will analyze this pencil from three different perspectives.
Example 4.11**.**
It is well-known that any rational matrix can be written in the form:
[TABLE]
By assuming with from the expression above we define the pencil
[TABLE]
This pencil was introduced in [30] for regular rational matrices and is a particular case of the pencils considered in [5, Theorem 5.11] (modulo some permutations). In fact, [5, Theorem 5.11] proves that if is considered as a polynomial system matrix with state matrix and is minimal in then is a strong linearization of in the sense of [5, Definition 3.4] (we will revise this in subsection 4.23). Thus, under these conditions, contains all the information about the poles and zeros of
We now consider from other two points of view different from the one in [5]. They will correspond to the two extreme cases described in Remark 4.2. First, we consider the following regular submatrix of obtained by removing the first block row and the penultimate block column:
[TABLE]
and we see as a polynomial system matrix with state matrix That is, once the state matrix is chosen, the other matrices in (4) are and With such partition of it is easy to see that the transfer function matrix of is precisely i.e., For that, just take into account that the two last block columns of are and . Then, Proposition 4.9 guarantees that, without any extra hypothesis, is a linearization of in where With a bit more effort, it is also easy to see that if for all then is minimal in and, thus, is a linearization of in with state matrix in (15). Observe that, if we do not impose any hypothesis of minimality in and is just a linearization in then we can not guarantee that has any information about the poles of since they are necessarily contained in Moreover, the set might contain eigenvalues of This is not a problem in REPs coming from approximating NLEPs [18, 21, 29] because, in such cases, the target set is outside However, it is in classical applications of rational matrices [19].
The second point of view is to consider as a linearization of in with empty state matrix. To this purpose, we define the following rational matrices regular at :
[TABLE]
Then, we check that which means that and are equivalent in and, so, that is a linearization of in with empty state matrix (recall Remark 4.2(2)).
The two approaches described in Example 4.11 for viewing in (14) as a linearization of in can be extended with more effort to many other of the pencils described in [5, Theorem 5.11]. We postpone these developments to future research to keep this paper concise.
4.2 Linearizations at infinity and in sets containing infinity
Our definition of linearization of a rational matrix at infinity is based on the notion of -reversal of a rational matrix introduced in Definition 3.9.
Definition 4.12** (Linearization at infinity of grade ).**
Let Let
[TABLE]
be a linear polynomial system matrix and let
[TABLE]
be its transfer function matrix. Let be an integer. is a linearization of at of grade if the following conditions hold:
- (a)
* is minimal at and*
- (b)
there exist nonnegative integers with and rational matrices and regular at [math] such that
[TABLE]
where
Observe that Definition 4.12 allows, for completeness, the possibility of being equal to We admit that this case has a very limited interest in applications, since it corresponds to and being constant matrices. However, it includes linearizations at of rational matrices such that, for some integer has all its invariant orders at zero equal to zero. Moreover, notice that, in any case, is also a linear polynomial system matrix since is nonsingular. We then have the following characterization of linearizations at infinity.
Proposition 4.13**.**
A linear polynomial system matrix as in (30) is a linearization of a rational matrix at of grade if and only if is a linearization of at
Proof.
The proposition follows from the fact that with is the transfer function matrix of Then we make use of Definition 4.1. ∎
Conditions and in Definition 4.12 can be stated in a different way as we show in Remarks 4.14 and 4.15, respectively.
Remark 4.14**.**
As a particular case of what is discussed in Remark 3.12, condition in Definition 4.12 is equivalent to
[TABLE]
if is nonconstant, i.e., if If is constant, i.e., condition is automatically satisfied since is a polynomial system matrix and, therefore, is invertible.
Remark 4.15**.**
By [4, Lemma 6.9], a rational matrix is regular at [math] if and only if is biproper. Therefore, condition in Definition 4.12 is equivalent to the matrices and being equivalent at infinity according to Definition 2.3. More precisely, a linear polynomial system matrix as in (30) is a linearization of a rational matrix at of grade if and only if
- (a)
is minimal at and
- (b)
there exist nonnegative integers with and biproper matrices and such that
[TABLE]
We state in Theorem 4.16 a characterization of linearizations at infinity analogous to the one in Theorem 4.8 for linearizations at finite points. In this characterization, we consider the most usual situation and assuming and The proof of Theorem 4.16 is omitted since it follows immediately from Theorem 4.8 and Proposition 4.13.
Theorem 4.16** (Spectral characterization of linearizations at infinity).**
Let and let
[TABLE]
be a linear polynomial system matrix such that is minimal at [math] and let Then is a linearization of at of grade if and only if the following three conditions hold:
- (a)
, 2. (b)
the pole elementary divisors of at [math] are the elementary divisors of at and 3. (c)
the zero elementary divisors of at [math] are the elementary divisors of at
Next, we study in Proposition 4.17 how to recover the invariant orders at infinity of rational matrices from linearizations at infinity of grade
Proposition 4.17**.**
Let with and let
[TABLE]
be a linearization at infinity of grade of with Let be the partial multiplicities of at [math], and let be the partial multiplicities of at [math]. Then the invariant orders at infinity of are
[TABLE]
Proof.
This proof is analogous to the one for Theorem 3.15. It follows just from combining Theorem 4.16 and Lemma 3.14. ∎
The following result is the counterpart of Proposition 4.9 but for linearizations at infinity. It shows when a linear polynomial system matrix is a linearization at infinity of its transfer function matrix. The proof is immediate and, therefore, omitted.
Proposition 4.18**.**
Let
[TABLE]
be a linear polynomial system matrix and let be its transfer function matrix. Then the following statements hold:
- (a)
If then is a linearization of at of grade
- (b)
If is constant then is a linearization of at of grade
Example 4.19**.**
Consider the matrices in Example 3.5. By Proposition 4.18, the linear polynomial system matrix is a linearization of at of grade
Example 4.20**.**
Consider the matrices in Example 4.11. Let us view as a polynomial system matrix with state matrix in (15). With such partition, is the transfer function matrix of Then, by Proposition 4.18, is a linearization of at of grade if has full column rank. However, the condition is very restrictive, since it implies also Moreover, the structure of at is, in such a case, trivial because it is very easy to see that the invariant orders at of are all equal to This is consistent with Proposition 4.17, because if then has full column rank and, thus, does not have partial multiplicities at zero. Moreover, as is the pencil in (15), then it is easy to see that has partial multiplicities at zero all equal to
Observe that, if we consider in (15) as state matrix of is minimal at [math] if and only if Thus, this hypothesis can not be avoided under such choice of state matrix. However, it is important to emphasize that if is viewed as a polynomial system matrix with empty state matrix then is a linearization of at of grade without imposing any hypothesis. We postpone the proof of this result to Example 5.12.
A linear polynomial system matrix that satisfies Definition 4.3 in and Definition 4.12, for a certain grade allows us to recover the complete information about the poles and zeros of the corresponding rational matrix, finite and at infinity. This is due to Theorem 4.8 and Proposition 4.17. This important case leads us to introduce the following definition.
Definition 4.21** (-strong linearization).**
Let and let be an integer. A linear polynomial system matrix is said to be a strong linearization of grade , or a -strong linearization, of if is a linearization of in and also at of grade
Example 4.22**.**
Consider again the matrices in Example 3.5. Then the linear polynomial system matrix is a -strong linearization of if and only if all the matrices are nonsingular.
4.3 Comparison with another definition of strong linearization
Recently, another definition of “strong linearization” of a rational matrix has been presented in [5, Definition 3.4]. In contrast to Definition 4.21, that definition does not make any reference to a “grade ”, but the linear polynomial system matrices satisfying [5, Definition 3.4] also allow us to recover the information about poles and zeros of including those at infinity. Therefore, it is convenient to establish a relation between [5, Definition 3.4] and Definition 4.21. This is the purpose of Proposition 4.23. Before stating and proving that proposition, we introduce some comments. Let us consider a linear polynomial system matrix
[TABLE]
with transfer function matrix and let be a rational matrix written as in (8). We recall that, according to [5, Remark 3.5], is a strong linearization of in the sense of [5, Definition 3.4] if the following statements hold:
- (a)
is a linearization of in and,
- (b)
is invertible if and there exist integers and rational matrices and regular at [math] such that
[TABLE]
As we stated in Remark 4.5, condition is equivalent to being a linearization of in the sense of [5, Definition 3.2]. For condition notice that, if and , in Definition 4.21 we do not require to be invertible but according to Remark 4.14. Observe, in addition, that these rank conditions are satisfied if is invertible. Moreover, in contrast to (34), in (31) we consider instead of where , and instead of for an integer In this way, Definition 4.12 looks for to be a linearization at [math] of the -reversal of , because the transfer function matrix of is Note that, is the transfer function matrix of if and only if is not strictly proper and the degree of the polynomial part of is equal to the degree of . Thus, condition (34) is different from (31) in some cases. Nevertheless, as we will see in Proposition 4.23, in most cases strong linearizations of in the sense of [5, Definition 3.4] are -strong linearizations of of a certain grade
Proposition 4.23**.**
Let and let
[TABLE]
be a strong linearization of according to [5, Definition 3.4]. Let be expressed uniquely as in (8), and let if is not strictly proper and otherwise. Then the following statements hold:
- (a)
If or then is a -strong linearization of .
- (b)
If and then is a -strong linearization of .
- (c)
If and or then is not a -strong linearization of for any integer
Proof.
We remark that this proof is somewhat technical and that can be skipped without affecting the understanding of the rest of the paper. We will use throughout the proof that without mentioning it explicitly. Let be a strong linearization of in the sense of [5, Definition 3.4] and let be the transfer function matrix of . Then is a linearization of in Moreover, if is invertible, which implies Then, it only remains to study the different cases that may occur in condition (34) in order to satisfy (31), that is, in order to be a -strong linearization of for some integer
We consider first the trivial case In this case, is a polynomial matrix and Therefore, where Thus, satisfies (31) with , and it is a -strong linearization of
In the rest of the proof, we assume , which implies . In this case, can be written as where is a proper rational matrix. Therefore, where if and otherwise. Then, we have two different cases. If then and, therefore, is a -strong linearization of If then and there are two different sub-cases:
and that is, and have the same size. So, in (34), we have Then the invariant orders at [math] of are equal to those of which implies that the invariant orders at [math] of are also equal to those of Multiplication by of and yields that and have the same invariant orders at The same happens with and Thus, there exist and rational matrices regular at [math] such that which proves according to (31) that is a -strong linearization of
- 2.
or that is, and have different sizes and In this case, there does not exist any integer such that the invariant orders at [math] of are equal to the invariant orders at [math] of As a consequence, is not a -strong linearization of for any grade since (31) is never satisfied. In order to prove this, note that by (34), and the invariant orders at zero of are equal to those of Moreover, is proper since Therefore, all the invariant orders at [math] of are nonnegative. So, the same happens to Then, has invariant orders at [math] equal to zero, and its remaining invariant orders at [math] are positive numbers. In contrast, if then has invariant orders at [math] equal to zero, and its remaining invariant orders at [math] are positive numbers. If notice that the largest invariant order at [math] of is less than the largest of
∎
We emphasize that, as far as we know, no explicit examples of strong linearizations in the sense of [5] with or and have been constructed so far in the literature. Thus, in plain words, Proposition 4.23 states that strong linearizations according to [5] are particular cases of -strong linearizations according to Definition 4.21, except for a very particular instance.
In the rest of this section, we first revise important examples of strong linearizations in [5] from the perspective of Definition 4.21. Then, in Example 4.26, we provide a -strong linearization that is not a strong linearization in the sense of [5]. This example illustrates that the local approach followed in this paper yields, apart from the flexibility of constructing local linearizations, a concept of “global” strong linearization, more general than that of [5].
Example 4.24**.**
Let be a rational matrix written as in (8), i.e., with Then, the strong block minimal bases linearizations constructed in [5, Theorem 5.11] are -strong linearizations of according to Definition 4.21. Note that, with the notation in Proposition 4.23, these linearizations satisfy since and
Example 4.25**.**
Let us now consider a rational matrix written as in (8) with or and let be a minimal state-space realization of Then, the following strong linearization
[TABLE]
is considered in [5] (paragraph below equation ). In this case, with the notation in Proposition 4.23, we have and Then if or if In any case, is a -strong linearization by Proposition 4.23. Observe that in this example is the transfer function of Thus, the fact that is a -strong linearization can also be obtained directly from Propositions 4.9 and 4.18.
Finally, we discuss the announced example of a linear polynomial system matrix that is a strong linearization in the sense of Definition 4.21 but not in the sense of [5, Definition 3.4].
Example 4.26**.**
Let us consider the rational matrix
[TABLE]
It can be easily proved that
[TABLE]
is a linear polynomial system matrix of Moreover, note that is minimal for all Therefore, by Proposition 4.9, is a linearization of in By Proposition 4.18, is also a linearization of at of grade since Thus, is a -strong linearization of according to Definition 4.21. However, is not a strong linearization according to [5, Definition 3.4] since is singular. Nevertheless, we can recover easily the invariant orders at from by applying Proposition 4.17 with For this purpose, note that does not have elementary divisors at since is regular at Moreover, the only elementary divisor at [math] of is Therefore, the invariant orders at infinity of are and by Proposition 4.17. The invariant orders of at any finite point can be recovered from by using Theorem 4.8. It is worthwhile to emphasize that the grade of as a strong linearization of is different from the degree of the polynomial part of Observe that this also happens in Example 4.25 when is a constant matrix.
5 Block full rank pencils
In this section, we introduce a wide family of pencils that give us the information about the zeros of rational matrices locally. More precisely, they are linearizations with empty state matrix of rational matrices in some subsets of , as well as at under some conditions. These pencils will be called block full rank pencils, since they generalize the block minimal bases pencils introduced in [10, Definition 3.1]. The definition of block full rank pencils is motivated by the fact that most of the linearizations for rational approximations of NLEPs that have been constructed so far are pencils of this type. The key results in this section are Theorems 5.4 and 5.8, which will be applied in the following section to establish rigorously and very easily the properties of the linearizations used in [18]. Note that, according to Theorem 4.8, the results in this section are not useful for studying, or computing, the finite poles of rational matrices because the considered linearizations have empty state matrix. This may be a drawback in certain situations, but we emphasize again that it is not in the development of algorithms for solving large-scale NLEPs via rational approximations [17, 18, 21, 29]. This is due to the fact that, in those cases, the poles of the rational matrix are known, since they are chosen for constructing the approximation, and/or are located outside the target set.
The theory we develop for block full rank pencils is based on the results for block minimal bases pencils from [10]. It is also possible to develop directly such theory at the cost of proving some preliminary lemmas. However, we think that our approach has the advantages of establishing a connection between both families of pencils and of emphasizing the relevance of these families in the study of rational and polynomial matrices.
Next, we present a few definitions and results from [10] for making easier the reading of this section. The notion of (strong) block minimal bases pencil is recalled in Definition 5.1. It relies on the concept of minimal bases of rational subspaces, which are certain polynomial bases of such subspaces defined in [14, 19]. As in [10], we will say for brevity that a polynomial matrix (with ) is a minimal basis if its rows form a minimal basis of the rational subspace they span. One of the most useful characterizations of minimal bases (see [14, Main Theorem] or [10, Theorem 2.2]) is that is a minimal basis if and only if has full row rank for all and is row reduced, i.e., its highest row degree coefficient matrix has full row rank (see [10, Definition 2.1]). Moreover, a minimal basis is said to be dual to if and [10, Definition 2.5].
Definition 5.1**.**
[10, Definition 3.1]* ((Strong) block minimal bases pencil). A block minimal bases pencil is a linear polynomial matrix over with the following structure*
[TABLE]
where and are both minimal bases. In addition, if (respectively ) is a minimal basis with all its row degrees equal to and with the row degrees of a minimal basis (respectively ) dual to (respectively ) all equal, then is called a strong block minimal bases pencil. Moreover, given a polynomial matrix it is said that is associated with if
Theorem 3.3 in [10] uses the standard definitions of linearizations and strong linearizations of polynomial matrices (see, for instance, [9]) to prove the most important property of a (strong) block minimal bases pencil as in (36), namely, is a (strong) linearization of the polynomial matrix for any and minimal bases dual to and , respectively. In the strong case, this result considers as a polynomial matrix with grade . We can state [10, Theorem 3.3] in the language of this paper through Definitions 4.3 and 4.21 as “a block minimal bases pencil is a linearization of in with empty state matrix and a strong block minimal bases pencil is a -strong linearization of with empty state matrix”. In order to see this, recall that the “empty state matrix” condition implies that the minimality condition is automatically satisfied (see Remarks 3.4 and 4.2) and that in the definitions cited above.
Next, we relax to a minimum the conditions on and in (36) for defining a wider family of pencils that includes block minimal bases pencils as a particular case.
Definition 5.2**.**
(Block full rank pencil)* A block full rank pencil is a linear polynomial matrix over with the following structure*
[TABLE]
where and are pencils with full row normal rank.
Note that Definition 5.2 includes the cases when or are empty matrices, that is, when has only one block row or only one block column, respectively.
We introduce some auxiliary concepts and results before establishing the most important properties of block full rank pencils in Theorems 5.4 and 5.8. We will say that a rational matrix has full row rank in if, for all , , i.e., is defined or bounded at , and . Observe that this implies that has no poles in . The following lemma connects rational matrices with full row rank in with minimal bases, and establishes other properties that will be used later.
Lemma 5.3**.**
Let be a rational matrix with full row normal rank and let be a minimal basis of the row space of . Then the following statements hold:
- (a)
There exists a unique regular rational matrix such that . 2. (b)
* has full row rank in if and only if in (a) is regular in .* 3. (c)
* is a polynomial matrix if and only if in (a) is a polynomial matrix.* 4. (d)
If is a matrix pencil, then in (a) and are both matrix pencils.
Proof.
Part (a). Each row of is uniquely defined because its entries are the unique rational coefficients that allow us to express the corresponding row of as a unique linear combination of the rows of . Moreover, must be regular since, otherwise, there would exist a nonzero vector such that . So, , which contradicts that .
Part (b). It is obvious that if is regular in , then has full row rank in , because is defined in , as is a polynomial matrix, and has full row rank in , since is a minimal basis. The proof of the converse implication starts by proving that if has full row rank in , then is defined in . To see this, note that the Smith form of is , because is a minimal basis and, therefore, does not have finite zeros. Thus, there exist unimodular matrices and such that , and . This shows that is defined in , because and are both defined in ( by hypothesis and because is unimodular and so a polynomial matrix). Therefore, is defined in . This implies that we can write for each , which in turns implies that is invertible because has full row rank.
Part (c). It follows directly from [14, Main Theorem, part 4].
Part (d). From [14, Main Theorem, part 4], we have that
[TABLE]
where denotes the th row of and the maximum is taken over the nonzero entries of . Since all the rows of are different from zero, (38) implies that for each nonzero entry of . Moreover, each column of has at least one nonzero entry, because is regular, which, combined with (38), implies that , for each . ∎
The last concepts we need before stating and proving the main Theorem 5.4 are those of rational basis and dual rational bases. A rational matrix (with ) is said to be a rational basis if it is a basis of the rational subspace spanned by its rows, i.e., if it has full row normal rank. Two rational bases and are said to be dual if , and . We are finally ready for presenting the main result of this section.
Theorem 5.4**.**
Let be a block full rank pencil as in (37) and let and be any rational bases dual to and , respectively. Let be nonempty. If and have full row rank in , for then is a linearization with empty state matrix of the rational matrix in .
Proof.
In order to simplify the notation, throughout this proof we do not specify the sizes of different identity matrices and all of them are denoted by . Let and be minimal bases of the row spaces of , , and , respectively. Then, Lemma 5.3 implies that there exist regular rational matrices , , and such that
[TABLE]
Moreover, and are all matrix pencils. Then, can be factorized as follows,
[TABLE]
where the first and third factors are regular in . Note that the factor in the middle is a block minimal bases pencil associated with the polynomial matrix , since the regularity of and implies that and are dual minimal bases for . Then, there exist unimodular matrices and such that
[TABLE]
where and are regular in . From combining (39) and (51), we obtain that and are equivalent in . This proves that is a linearization with empty state matrix of in according to Definitions 4.1 and 4.3, since the minimality condition is automatically satisfied if the state matrix is empty. ∎
Remark 5.5**.**
In the scenario of Theorem 5.4, Theorem 4.8 guarantees that the elementary divisors of in coincide exactly with the zero elementary divisors of in . Moreover, it is clear from the expression that does not have poles in since the matrices must be defined in but they are not defined at the poles of . Thus, has only eigenvalues in , and all the information about them, i.e., geometric, algebraic and partial multiplicities, is contained in .**
Remark 5.6**.**
If in Theorem 5.4, (resp. ) is an empty matrix, we can take any rational matrix (resp. ) regular in , where (resp. ) is the number of colums (resp. rows) of . The standard choices are and .
Remark 5.7**.**
Under the conditions of Theorem 5.4, we will say for brevity that “ is a block full rank pencil associated with in ”. We emphasize that this “association” is not one-to-one because there are infinitely many rational bases and dual to and .
Next, we present sufficient conditions for a block full rank pencil to be a linearization of at of a certain grade . In order to avoid cases with limited interest in applications that complicate the statement, in Theorem 5.8 we assume .
Theorem 5.8**.**
Let be a block full rank pencil as in (37) with and let and be rational bases dual to and , respectively. If, for , has full row rank at zero, and there exists an integer number such that has full row rank at zero, then is a linearization with empty state matrix of the rational matrix at of grade .
Proof.
Note that
[TABLE]
is a block full rank pencil. Moreover, for , has full row normal rank, and implies . Therefore, is a rational basis dual to . Then, Theorem 5.4 applied to proves that is a linearization with empty state matrix at zero of
[TABLE]
which combined with Proposition 4.13 proves the result. ∎
As a consequence of Theorems 5.4 and 5.8, we obtain Corollary 5.9. Although it follows immediately from them, we state it since it generalizes the structure of most of the linearizations of rational approximations of NLEPs that appear in the literature. Moreover, it is very useful in order to characterize easily some pencils as linearizations of rational matrices locally and to obtain the information about the zeros of such rational matrices in subsets not containing poles.
Corollary 5.9**.**
Let
[TABLE]
be a rational matrix written in terms of some matrix pencils and rational matrices . Define
[TABLE]
and assume that has full row normal rank. Let L(\lambda)=\left[\begin{array}[]{c}M(\lambda)\\ \hdashline[2pt/2pt]K_{1}(\lambda)\end{array}\right] be a block full rank pencil of degree with only one block column and such that and are dual rational bases. Let be nonempty. Then the following statements hold:
- (a)
If and have full row rank in then is a linearization with empty state matrix of in
- (b)
If has full row rank at and there exists an integer such that has full row rank at [math], then is a linearization with empty state matrix of at of grade
Remark 5.10**.**
We emphasize that in some relevant applications the rational matrices of Corollary 5.9 are just of the form where are scalar rational functions, and/or most of the pencils are constant matrices or a linear scalar function times a constant matrix. Moreover, in some other applications a low rank structure is present in , that is, some of the terms in have a rank much smaller than , and the corresponding rational matrices are written in the form , where is a constant matrix with
In the next two examples, we revisit the pencils introduced in Examples 3.5 and 4.11 from the perspective of the block full rank pencils. These examples illustrate how the theory of block full rank pencils may simplify the analysis of the properties of important linearizations of rational matrices when one is not interested on the information about the poles.
Example 5.11**.**
Let us consider the rational matrix and the pencil in Example 3.5. We partition as follows:
[TABLE]
Observe that, in the above partition, we are considering a permuted version of the structure of the pencil in Corollary 5.9. Note now that has full row rank in , and
[TABLE]
is a rational basis dual to with full row rank in . Then, by Corollary 5.9, is a linearization with empty state matrix of in Moreover, note that and \operatorname{rev}_{0}N_{1}(\lambda)=\left[\begin{array}[]{ccccc}\frac{\lambda}{\lambda\sigma_{1}-1}I&\frac{\lambda}{\lambda\sigma_{2}-1}I&\ldots&\frac{\lambda}{\lambda\sigma_{s}-1}I&I\end{array}\right] both have full row rank at [math]. Thus, by Corollary 5.9, is a linearization with empty state matrix of at of grade
Example 5.12**.**
Let us consider the rational matrix and the pencil in Example 4.11. We now consider the following partition of :
[TABLE]
Since has full row normal rank, has the structure of the block full rank pencil in Corollary 5.9. Observe that
[TABLE]
is a rational basis dual to and that and have both full row rank in Thus, Corollary 5.9 implies that is a linearization with empty state matrix of in This example, together with Example 4.11, illustrates a very important fact that we have already mentioned: the same pencil can be viewed as a linearization with different state matrices. Moreover, different views may require different conditions, may lead to different sets where the pencil is a linearization, and may differ in the difficulty to get the results. For instant, when the developments in this example are compared with the direct application of the definition of linearization presented in the second approach in Example 4.11 through the matrices and in (22) and (29), respectively, we can conclude that the “block full rank pencil” view leads to the same results in a much simpler way. We have experimented the simplicity of the “block full rank pencil” approach in many other examples.
Finally, note that the pencil in (52) satisfies that has full row rank at [math] and that in (53) satisfies that has also full row rank at [math]. Thus, Corollary 5.9 implies that is a linearization with empty state matrix of at of grade . By comparing this result with the result in Example 4.20, we see that considering as a block full rank pencil leads to much stronger results on the structure at infinity than considering as a polynomial system matrix with state matrix in (15). In the former case, we do not need any extra hypothesis in order to be a linearization at infinity, while in the latter the condition is needed.
As previously announced, the results in this section will be used in Section 6. In addition, in a future work [12], we will extend them. More precisely, we will define block full rank linearizations of rational matrices with non empty state matrix that, therefore, will contain information about the poles. Moreover, we will apply these results to establish rigorously and very easily the properties of the linearizations introduced in [21].
6 Application of the local linearization theory to NLEIGS pencils
In this section we study in depth the pencils introduced in the influential reference [18]. This reference presents one of the first systematic approaches for solving large scale NLEPs. The approach in [18] consists essentially of three steps: (1) the matrix defining the NLEP is approximated by a rational matrix via Hermite’s interpolation in a certain compact target set where the eigenvalues of interest are located; (2) the obtained rational matrix is linearized by using a certain pencil ; and (3) a highly structured rational Krylov method is applied to the pencil to compute the eigenvalues of in . For brevity of exposition, and also for recognizing the key contribution of [18], we will call NLEIGS pencils to the pencils introduced in this reference. The main goal of this section is to replace the vague usage of the word “linearization” in [18] by a number of rigorous results on NLEIGS pencils which, combined with the results in Sections 4 and 5, establish the precise properties enjoyed with respect to eigenvalues (and poles) of the NLEIGS pencils. We remark that NLEIGS pencils were the initial motivation for developing the results of this paper, since is not a linearization of the rational matrix , according to the definitions of linearization and strong linearization presented in [5] or [1].
Since we are interested in rational matrices and their linearizations, all the delicate details about how the rational matrices are constructed as approximations of the original NLEPs are omitted. Such details can be found in [18]. As in the rest of the paper, the results in this section are valid and are stated in any algebraically closed field that does not include infinity. Note, however, that reference [18] considers only the complex field and that this restriction is important in the approximation phase of the NLEP. Moreover, although [18] deals with regular rational matrices , we will not impose such condition initially in our developments.
Reference [18] uses two families of rational matrices, and corresponding pencils, depending on whether or not a certain low rank structure is present in the original NLEP. We will refer to them as the NLEIGS basic problem and the NLEIGS low rank structured problem, respectively. The NLEIGS pencils corresponding to each of these two cases will be studied from two perspectives giving rise to the four subsections included in this section. These two perspectives are considering NLEIGS pencils as block full rank pencils and, thus, as linearizations with empty state matrices, and considering them as polynomial system matrices with transfer function matrices equivalent to everywhere except at a point . Both perspectives allow us to state in a rigorous sense that NLEIGS pencils are linearizations of , but the one based on block full rank pencils is much simpler, does not require any hypothesis and covers fully the applications of interest in [18]. In contrast, the polynomial system matrix perspective provides more information on but at the cost of extra hypotheses which are not imposed in [18] and that require considerable effort to check.
6.1 The NLEIGS basic problem from the point of view of block full rank pencils
The families of rational matrices considered in [18] are defined in terms of the following parameters: a list of nodes in , a list of nonzero poles in , and a list of nonzero scaling parameters in . It is important to bear in mind that [18] assumes that the poles are all distinct from the nodes. However, we do not assume such property, except in a few results where it will be explicitly stated. With these parameters, the following sequence of rational scalar functions is defined:
[TABLE]
Let us now define the linear scalar functions
[TABLE]
for , and Then, the rational functions satisfy the simple recursion
[TABLE]
which will be useful in the sequel. Note that the rational functions could not be proper, since for any infinite pole the corresponding factor is just equal to , and, therefore, has a nonconstant polynomial part.
With all this information, we are in the position of introducing the first family of rational matrices considered in [18], whose elements are defined as
[TABLE]
where are constant matrices.
In this section, the nodes , the poles , the scaling parameters and the matrices are arbitrary parameters that allow us to define the considered family of rational matrices. However, in [18] these parameters are carefully chosen in such a way that approximates satisfactorily the matrix defining the NLEP to be solved in the target set containing the desired eigenvalues of the NLEP. In this scenario, it is important to stress that the poles are always chosen outside the region of interest [18, p. A2852], which implies that all the zeros of located in are eigenvalues of . Thus, the REP associated with is an explicit example of a problem with a property that has been mentioned before in this paper, i.e., the poles are known and located outside the region of interest and, then, it is not needed to compute them. Note, however, the following subtlety: though it is clear that the finite poles of are included in the list , it is easy to construct examples of matrices as in (56) for which some of the finite numbers in are not poles due to some cancellations. Thus, all the finite numbers in are not necessarily finite poles of and, even more, the partial multiplicities of such poles are not immediately visible from (56). Despite these comments, we will call the numbers poles, following the usage in [18].
In order to solve the REP , the authors of [18] solve the generalized eigenvalue problem corresponding to the pencil
[TABLE]
where
[TABLE]
and
[TABLE]
In [18] the use of for solving the REP associated to is supported by [18, Theorem 3.2], which states that is a strong linearization of the rational matrix without specifying the exact meaning of “strong linearization” in this rational context. Moreover, the proof of [18, Theorem 3.2] consists of a reference to [2, Theorem 3.1], which is a paper dealing with strong linearizations of polynomial matrices in the classical sense of [16]. However, as a consequence of the results in Section 5, it is very easy to prove that is always a linearization of in a set including the region of interest in [18], as well as at infinity. This is proved in Theorem 6.1, where the nomenclature introduced in Remark 5.7 is used.
Theorem 6.1**.**
Let be the rational matrix in (56) and be the pencil in (57). Let and be, respectively, the set of finite poles and the number of infinite poles in the list . Then, the following statements hold:
- (a)
* partitioned as in (57) is a block full rank pencil with only one block column associated with in .* 2. (b)
* is a linearization with empty state matrix of in .* 3. (c)
* is a linearization with empty state matrix of at of grade .*
Proof.
It is immediate to check that
[TABLE]
is a rational basis dual to . Note also that and have both full row rank in . In addition, an easy direct computation proves . Thus, parts (a) and (b) follow from Theorem 5.4. Observe that parts (a) and (b) can also be proved from Corollary 5.9, since the structures of , and are particular cases of those described in that corollary.
In order to prove part (c), note first that has full row rank at zero. We now consider the rational matrix , which is of the form
[TABLE]
where the entries are defined at Denote by the number of infinite poles in the list . Then, for a certain rational function with Thus, we obtain that has full row rank at taking into account that if and if Then, part (c) follows from Theorem 5.8. ∎
Combining Theorems 6.1 and 4.8, we get that contains all the information about the finite eigenvalues of in , including all type of multiplicities (algebraic, geometric and partial). Moreover, Proposition 4.17 allows us to recover the complete pole-zero structure of at from the eigenvalue structure at [math] of , just by noting that, in this case, in Proposition 4.17 since we are taking an empty state matrix. We stress that all these results hold for any rational matrix either regular or singular. However, no information is provided on the finite poles of and some of them could also be zeros. As explained above, this is not an issue in [18], since is outside the target set . Nevertheless, at the cost of imposing extra hypotheses, we will solve this problem in Section 6.3 for completeness and also because it is of interest for the theory of REPs.
Remark 6.2**.**
Let and be the denominators of and in (54), respectively. Then, under the hypothesis , , , is a strong block minimal bases pencil (recall Definition 5.1) associated with the polynomial matrix . This follows easily from the facts that in (57) is a minimal basis with all its row degrees equal to one, that
[TABLE]
is a minimal basis dual to with all its row degrees equal to , and that . Thus, using the results stated in the paragraph after Definition 5.1, we get that is a -strong linearization of the polynomial matrix with empty state matrix. Since and are equivalent in , we obtain again the result in Theorem 6.1(b) through a different path which requires to use extra hypotheses. **
6.2 The NLEIGS low rank problem from the point of view of block full rank pencils
The second family of rational matrices considered in [18] comes from approximating NLEPs, , such that the associated matrix is the sum of a polynomial matrix plus a matrix of the form , where the constant matrices have much smaller rank than the size of and are scalar nonlinear functions of . This type of NLEPs arise in several applications [17] and are approximated in [18, eq. (6.2)] by a family of rational matrices of the form
[TABLE]
where are the scalar rational functions in (54), , and are constant matrices, and . For the functions in (55), let us consider the simpler notation and . Then, in order to solve the REP efficiently by taking advantage of the low rank structure of the following pencil is introduced in [18, Sec. 6.4]:
[TABLE]
where
[TABLE]
and
[TABLE]
A result analogous to Theorem 6.1 can be proved for the pencil and the matrix . This is accomplished in Theorem 6.3. We remark, nevertheless, that the result concerning the linearizations at is weaker in Theorem 6.3 than in Theorem 6.1. This is an unavoidable consequence of the used approach and the low rank structure of .
Theorem 6.3**.**
Let be the rational matrix in (59) and be the pencil in (60). Let and be, respectively, the set of finite poles and the number of infinite poles in the list . Then, the following statements hold:
- (a)
* partitioned as in (60) is a block full rank pencil with only one block column associated with in .* 2. (b)
* is a linearization with empty state matrix of in .* 3. (c)
If, in addition, the poles are all finite, then is a linearization with empty state matrix of at of grade .
Proof.
The proof is similar to that of Theorem 6.1 with some differences coming from the presence of the low rank term in . It is immediate to check that
[TABLE]
is a rational basis dual to , that and have both full row rank in and that . Thus, parts (a) and (b) follow from Theorem 5.4.
In order to prove part (c), note first that has full row rank at zero as a consequence of the fact that the poles are all finite. We now consider the rational matrix , which is of the form
[TABLE]
where the entries are defined at Denote by the number of infinite poles in the list . Then, for a certain rational function with Taking into account that the poles are all finite, we have that if and if Therefore, has full row rank at [math] because . Thus, part (c) follows from Theorem 5.8. ∎
A discussion similar to the one in the last paragraph of Section 6.1 can be developed on the basis of Theorem 6.3. The details are omitted for brevity. The open problem corresponding to the information of the finite poles will be solved in Section 6.4.
6.3 The NLEIGS basic problem from the point of view of polynomial system matrices
As discussed previously, the approach presented in Section 6.1 to the NLEIGS pencil in (57) considers as a linearization with empty state matrix and, thus, it does not provide any information on the finite poles of . In order to get this information, we need to identify a convenient square regular submatrix of that may be used as a state matrix. The block structure of makes it not possible to find such a matrix in a way that it includes the information of all the potential poles . This is related with the comment included in [18, p. A2849] on the fact that plays a special role and that it is convenient to choose . In what follows we will not assume that , though the obtained results are simpler and stronger under such assumption, but we will focus on getting information on the finite poles in . With this spirit, we consider the following partition of in (57), where will play the role of the state matrix,
[TABLE]
and the rest of the blocks are easily described from the blocks in (57) as follows: is the first block column of , is obtained by removing the first block of and is obtained by removing the first block column of .
The next technical lemma reveals which is the transfer function matrix of , with the partition above, and establishes necessary and sufficient conditions for to be minimal in the whole field . Of course, the conditions in Lemma 6.4(b) come from imposing that \left[\begin{array}[]{cc}B_{N}(\lambda_{0})&A_{N}(\lambda_{0})\end{array}\right]\in\mathbb{F}^{m(N-1)\times mN} and \left[\begin{array}[]{cc}-C_{N}(\lambda_{0})^{T}&A_{N}(\lambda_{0})^{T}\end{array}\right]^{T}\in\mathbb{F}^{mN\times m(N-1)} have, respectively, full row and column rank for any , but have an important advantage with respect to these direct conditions for minimality. More precisely, the conditions in Lemma 6.4(b) require to evaluate the rational matrix of size , which for practical problems is much smaller than .
Lemma 6.4**.**
Let us consider the pencil in (57) as a polynomial system matrix with state matrix , where is defined through the partition (62), and let be the rational matrix in (56). Then the following statements hold:
- (a)
The transfer function matrix of is 2. (b)
Let us define the rational matrix , whose explicit expression is
[TABLE]
let be the set of finite poles in the list , and assume , , . Then, is minimal in if and only if the matrix is nonsingular for all .
Proof.
Part (a). According to (62), the transfer function matrix of is . The computation of this transfer function is very easy because B_{N}(\lambda)=\left[\begin{array}[]{cccc}-h_{0}(\lambda)I_{m}&0&\cdots&0\end{array}\right]^{T}, which implies that only the first block column of is needed. It is immediate to check that this first block column is
[TABLE]
The rest of the proof of part (a) is just an elementary and short algebraic manipulation.
Part (b). The proof is elementary but long. Thus, it is postponed to A. ∎
We emphasize that Lemma 6.4(a) holds for any rational matrix expressed as in (56) without imposing any extra condition. Moreover, the constant matrix is invertible for any and, so, is minimal in . Combining these results with the fact that and are equivalent in if or in if is finite, we immediately obtain from Definitions 4.1 and 4.3 that is a linearization of with state matrix in , which is a result analogous to Theorem 6.1(b). This approach, of course, does not give any information on the finite poles of , because the finite eigenvalues of coincide with . Such information is obtained from the next result, which is the main result of this section and is a corollary of Lemma 6.4.
Theorem 6.5**.**
Let be the rational matrix in (56), be the pencil in (57), be the submatrix of in (62), and be the rational matrix in (63). Consider the set of finite poles in the list , and assume , , . If is nonsingular for every , then is a linearization of with state matrix in , if , or in , if is finite.
Proof.
Under the hypotheses of Theorem 6.5, is minimal in . Moreover, its transfer function matrix, i.e., is equivalent to in , if , or in , if is finite. The result follows immediately from Definitions 4.1 and 4.3 with . ∎
We emphasize that the hypotheses that the constant matrices in Theorem 6.5 are nonsingular are not mentioned at all in [18], but, fortunately, are generic, in the sense that they are satisfied by almost all regular rational matrices expressed as in (56).
Remark 6.6**.**
Under the conditions of Theorem 6.5, the pole elementary divisors of in , if , or in , if is finite, are the elementary divisors of , as a consequence of Theorem 4.8. These elementary divisors can be very easily determined as follows: first express ; second note that if is the Smith form of , then is the Smith form of ; third, use the fact that , , , to prove that the greatest common divisor of all minors of is equal to , which implies, according to [15, Ch. VI], that there is only one invariant polynomial of different from and that is equal to
[TABLE]
where is a constant that makes monic. Finally, we get that has invariant polynomials different from all equal to . This allows us to obtain easily the finite elementary divisors of and, thus, the finite pole elementary divisors of (in if , or in if is finite). In particular, they are of the form and, in order to obtain the partial multiplicities we have to take into account possible repetitions in . Observe that the infinite for do not contribute at all to the finite pole elementary divisors of Moreover, if , then we can state the compact and simple result that the denominators of the global Smith–McMillan form of are all equal to . However, with this choice of state matrix, there is no way of obtaining information on the pole structure of when it is finite. This is the reason why, even if is minimal in , is not a linearization of in
6.4 The NLEIGS low rank problem from the point of view of polynomial system matrices
The results in this section are the counterpart for in (59) and in (60) of those presented in Section 6.3 for and . For brevity, we avoid in this section to introduce auxiliary comments similar to the corresponding ones in Section 6.3 and just some relevant differences are remarked. The motivation of this section is to obtain from information about the finite poles of . For this purpose, we consider the following partition of in (60), where will play the role of the state matrix,
[TABLE]
and the rest of the blocks are easily described from the blocks in (60) as follows: is the first block column of , is obtained by removing the first block of , and is obtained by removing the first block column of .
The next lemma is the counterpart of Lemma 6.4. Note that the low rank structure in complicates the minimality conditions in part (b) of Lemma 6.7, which are expressed in terms of matrices of size .
Lemma 6.7**.**
Let us consider the pencil in (60) as a polynomial system matrix with state matrix , where is defined through the partition (64), and let be the rational matrix in (59). Then the following statements hold:
- (a)
The transfer function matrix of is 2. (b)
Let us define the rational matrices
[TABLE]
and
[TABLE]
Let be the set of finite poles in the list , and assume that and that , , . Then, is minimal in if and only if the matrix has full column rank for all .
Proof.
Part (a). The proof is similar to that of Lemma 6.4(a) with some differences coming from the presence of the low rank term in . According to (64), the transfer function matrix of is . The computation of this matrix is very easy because, again, \widetilde{B}_{N}(\lambda)=\left[\begin{array}[]{cccc}-h_{0}I_{m}&0&\cdots&0\end{array}\right]^{T}, and only the first block column of is needed, which, in this case, is equal to
[TABLE]
Part (b). The proof is elementary but long. Thus, it is postponed to B. ∎
Remark 6.8**.**
If, in addition to and , , , we assume that , then the necessary and sufficient conditions for minimality in Lemma 6.7(b) can be considerably simplified, since we get as an immediate corollary of Lemma 6.7(b) that “ is minimal in if and only if the matrix has full column rank for every ”. Note that the hypothesis implies that the “no-low rank” term of is a polynomial matrix, as often happens in NLEPs [18].
Observe also that if is the matrix obtained from in (65) by removing the second block row, then under the assumptions and , , , we get, as another immediate corollary of Lemma 6.7(b), the following sufficient condition for minimality: “if is invertible for every , then is minimal in ”. **
Theorem 6.9 is the main result in this section and is an easy corollary of Lemma 6.7. Its proof is omitted because is very similar to that of Theorem 6.5.
Theorem 6.9**.**
Let be the rational matrix in (59), be the pencil in (60), be the submatrix of in (64), and be the rational matrix in (65). Consider the set of finite poles in the list . If , , , , and has full column rank for every , then is a linearization of with state matrix in , if , or in , if is finite.
Finally, note that the conditions in Theorem 6.9 on the full column rank of the matrices can be simplified as in Remark 6.8 under extra hypotheses.
7 Conclusions and future work
A theory of local linearizations of rational matrices has been carefully presented in this paper, by developing as starting point the extension of Rosenbrock’s minimal polynomial system matrices to a local scenario. Moreover, this theory has been applied to a number of pencils that have appeared recently in some influential papers on solving numerically NLEPs by combining rational approximations, linearizations of the resulting rational matrices, and efficient numerical algorithms for generalized eigenvalue problems adapted to the structure of such linearizations. It has been emphasized throughout the paper that the theory of local linearizations allows us to view these pencils, and to explain their properties, from rather different perspectives, which depend on the particular choice of the submatrix of the pencil to be considered as state matrix. In particular, we have seen that the choice of an empty state matrix is simple and adequate for those rational matrices and pencils arising in NLEPs, when the poles are already known from the approximation process. This has led us to define and analyze the very general family of block full rank pencils, as a template that covers many of the pencils, available in the literature, that linearize the rational approximations in the corresponding target set. We plan to extend these ideas in [12], where other ways to choose the state matrices will be explored. In addition, the results in this paper and also the new ones in [12] will be applied to the pencils defined in [21], as well as to other pencils. Finally, we also plan to study numerical properties of some of the linearizations analyzed in this work. In particular, given a linearization of the REP in a set, it is important to study the backward stability in terms of the structure of the rational matrix defining the REP when applying a numerical method to compute the eigenvalues of the linearization. In addition, we plan to investigate the conditioning of eigenvalues, that is, the sensitivity to perturbations, both in the original REP and its linearization, of a zero that is not a pole of the rational matrix.
Appendix A Proof of Lemma 6.4(b)
Let us consider partitioned as in (62) and as a polynomial system matrix with state matrix . Recall throughout the proof that the parameters are all different from zero. Observe first that , and , implies that \left[\begin{array}[]{cc}B_{N}(\lambda_{0})&A_{N}(\lambda_{0})\end{array}\right] has full row rank for any . On the other hand, if we define
[TABLE]
then has full column rank for every , because is invertible in . Therefore, combining the discussion above with Definition 3.3, we obtain that is minimal in if and only if has full column rank for every . The rest of the proof proceeds as follows: we will find a rational matrix such that is equivalent to in and has a simple structure that allows us to see that (and, so, ) has full column rank for every if and only if is invertible for every , where is the rational matrix in (63).
For brevity, we use the notation and for the scalar functions in (55). In addition, in (66) is partitioned as
[TABLE]
where
[TABLE]
Note that the matrix is invertible in and that the last block column of is
[TABLE]
Next, a sequence of equivalence transformations in are applied to . Such transformations are described by using the notation in (67) and (68), and the first one is
[TABLE]
The second transformation is designed to turn zero the second block row of as follows
[TABLE]
The third transformation turns zero the block of and performs a convenient scalar multiplication in its first block row. Such transformation is
[TABLE]
where is the rational matrix in (63). The last transformation makes zero the first blocks of size in the first block row of and yields the announced matrix equivalent to in . More precisely,
[TABLE]
The block of satisfies for all . Therefore, (and, so, ) has full column rank for every if and only if is invertible for all , and the result is proved.
Appendix B Proof of Lemma 6.7(b)
The first part of the proof is completely analogous to the first part of the proof of Lemma 6.4(b). So, some details are ommited. Let us consider partitioned as in (64) and as a polynomial system matrix with state matrix . Then the hypotheses and , and , imply that \left[\begin{array}[]{cc}\widetilde{B}_{N}(\lambda_{0})&\widetilde{A}_{N}(\lambda_{0})\end{array}\right] has full row rank for any . Also, if we define
[TABLE]
then has full column rank for every , because is invertible in . Therefore, is minimal in if and only if has full column rank for every . In the rest of the proof we will find a rational matrix such that is equivalent to in and that allows us to see that (and, so, ) has full column rank for every if and only if in (65) has full column rank for every . We advance that this second part of the proof is considerably more involved than the corresponding part of the proof of Lemma 6.4(b), as a consequence of the presence in of two kinds of blocks, one kind corresponding to the “full rank” part of , i.e., the first summation in (59), and another kind corresponding to the “low rank” part of . Nevertheless, the equivalence transformations in used in the sequel are similar to those in the proof of Lemma 6.4(b), and many details will be omitted for brevity. Recall that we use the notation in (55) omitting the dependence on for simplicity, i.e., we write simply and .
The first two equivalence transformations in that we perform affect only to the last block rows of , i.e., those containing matrices. Thus, in this part of the proof, it is convenient to partition as
[TABLE]
with comprising the first block rows of . In order to construct the first equivalence transformation, we pay attention to the following submatrix of ,
[TABLE]
which is invertible in and has the same structure as in (67). The last block column of has a structure similar to (68) and is denoted by . Then, the first two equivalence transformations are
[TABLE]
where
[TABLE]
with . In order to describe the outcome of the next two transformations, we consider the following submatrix of :
[TABLE]
The next equivalence transformations in are
[TABLE]
where is the rational matrix appearing in (65). Observe that the structure of the last block row of allows us to perform an equivalence transformation in that turns the block \left[\begin{array}[]{ccc}\frac{g_{N}}{h_{N-1}}\widetilde{L}_{p+1}&\cdots&\frac{g_{N}}{h_{N-1}}\widetilde{L}_{N-2}\end{array}\right] into [math] without changing the remaining blocks. The resulting matrix is called . Now, denote by the matrix obtained from by removing its first block row and its last block column, and observe that is invertible in and has the same structure as in (67) with replaced by . The last block column of is denoted by . With this information, the following equivalence transformations are
[TABLE]
where , and
[TABLE]
where is the rational matrix appearing in (65). Finally, the announced matrix is obtained from by using its third block row to transform the block \left[\begin{array}[]{ccc}\!\!\frac{g_{N}}{h_{N-1}}\widetilde{D}_{1}&\cdots&\!\!\!\!\frac{g_{N}}{h_{N-1}}\widetilde{D}_{p-1}\end{array}\right] into [math] without changing the remaining blocks. The structure of implies immediately that has full column rank for every if and only if in (65) has full column rank for every .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Alam, N. Behera, Linearizations for rational matrix functions and Rosenbrock system polynomials , SIAM J. Matrix Anal. Appl. 37(1) (2016) 354–380.
- 2[2] A. Amiraslani, R. M. Corless, P. Lancaster, Linearization of matrix polynomials expressed in polynomial bases , IMA J. Numer. Anal. 29 (2009) 141–157.
- 3[3] A. Amparan, S. Marcaida, I. Zaballa, On the structure invariants of proper rational matrices with prescribed finite poles , Linear and Multilinear Algebra 61(11) (2013) 1464–1486.
- 4[4] A. Amparan, S. Marcaida, I. Zaballa, Finite and infinite structures of rational matrices: a local approach , Electron. J. Linear Algebra 30 (2015) 196–226.
- 5[5] A. Amparan, F. M. Dopico, S. Marcaida, I. Zaballa, Strong linearizations of rational matrices , SIAM J. Matrix Anal. Appl. 39(4) (2018) 1670–1700.
- 6[6] D. J. Cullen, Local system equivalence , Math. Systems Theory 19 (1986) 67-78.
- 7[7] R. Das, R. Alam, Recovery of minimal bases and minimal indices of rational matrices from Fiedler-like pencils , Linear Algebra Appl. 566 (2019) 34–60.
- 8[8] R. Das, R. Alam, Affine spaces of strong linearizations for rational matrices and the recovery of eigenvectors and minimal bases , Linear Algebra Appl. 569 (2019) 335–368.
