Local Linearizations of Rational Matrices with Application to Rational   Approximations of Nonlinear Eigenvalue Problems

Froil\'an M. Dopico; Silvia Marcaida; Mar\'ia C. Quintana; Paul Van; Dooren

arXiv:1907.10972·math.NA·July 26, 2019

Local Linearizations of Rational Matrices with Application to Rational Approximations of Nonlinear Eigenvalue Problems

Froil\'an M. Dopico, Silvia Marcaida, Mar\'ia C. Quintana, Paul Van, Dooren

PDF

TL;DR

This paper introduces a comprehensive framework for local linearizations of rational matrices, enabling structure-preserving approximations of zeros, poles, and eigenvalues, with applications to nonlinear eigenvalue problems.

Contribution

It provides a unified definition of local linearizations that encompasses previous approaches and rigorously explains their properties, especially in the context of nonlinear eigenvalue problems.

Findings

01

New definition of local linearizations for rational matrices.

02

Unified framework explaining properties of existing pencils.

03

Application to rational approximation of nonlinear eigenvalue problems.

Abstract

This paper presents a definition for local linearizations of rational matrices and studies their properties. This definition allows us to introduce matrix pencils associated to a rational matrix that preserve its structure of zeros and poles in subsets of any algebraically closed field and also at infinity. Moreover, such definition includes, as particular cases, other definitions that have been used previously in the literature. In this way, this new theory of local linearizations captures and explains rigorously the properties of all the different pencils that have been used from the 1970's until 2019 for computing zeros, poles and eigenvalues of rational matrices. Particular attention is paid to those pencils that have appeared recently in the numerical solution of nonlinear eigenvalue problems through rational approximation.

Equations235

\left[\begin{array}[]{cc}\operatorname{diag}\left((\lambda-\lambda_{0})^{\nu_{1}},\ldots,(\lambda-\lambda_{0})^{\nu_{r}}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right],

\left[\begin{array}[]{cc}\operatorname{diag}\left((\lambda-\lambda_{0})^{\nu_{1}},\ldots,(\lambda-\lambda_{0})^{\nu_{r}}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right],

\left[\begin{array}[]{cc}\operatorname{diag}\left(\frac{1}{{\lambda}^{\mu_{1}}},\ldots,\frac{1}{{\lambda}^{\mu_{r}}}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right]

\left[\begin{array}[]{cc}\operatorname{diag}\left(\frac{1}{{\lambda}^{\mu_{1}}},\ldots,\frac{1}{{\lambda}^{\mu_{r}}}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right]

\left[\begin{array}[]{cc}\operatorname{diag}\left(\frac{\epsilon_{1}(\lambda)}{\psi_{1}(\lambda)},\ldots,\frac{\epsilon_{r}(\lambda)}{\psi_{r}(\lambda)}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right]

\left[\begin{array}[]{cc}\operatorname{diag}\left(\frac{\epsilon_{1}(\lambda)}{\psi_{1}(\lambda)},\ldots,\frac{\epsilon_{r}(\lambda)}{\psi_{r}(\lambda)}\right)&0\\ 0&0_{(p-r)\times(m-r)}\end{array}\right]

M_{G} (λ) M_{H} (λ) = diag (f_{1} (λ) g_{1} (λ), \dots, f_{r} (λ) g_{r} (λ), 0_{(p - r) \times (m - r)}), and = diag (f_{1} (λ) h_{1} (λ), \dots, f_{r} (λ) h_{r} (λ), 0_{(p - r) \times (m - r)}),

M_{G} (λ) M_{H} (λ) = diag (f_{1} (λ) g_{1} (λ), \dots, f_{r} (λ) g_{r} (λ), 0_{(p - r) \times (m - r)}), and = diag (f_{1} (λ) h_{1} (λ), \dots, f_{r} (λ) h_{r} (λ), 0_{(p - r) \times (m - r)}),

G (λ) = D (λ) + C (λ) A (λ)^{- 1} B (λ)

G (λ) = D (λ) + C (λ) A (λ)^{- 1} B (λ)

P (λ) = [A (λ) - C (λ) B (λ) D (λ)]

P (λ) = [A (λ) - C (λ) B (λ) D (λ)]

rank P (λ) = n + rank G (λ),

rank P (λ) = n + rank G (λ),

P (λ) = [I_{n} - C (λ) A (λ)^{- 1} 0 I_{p}] [A (λ) 0 0 G (λ)] [I_{n} 0 A (λ)^{- 1} B (λ) I_{m}] .

P (λ) = [I_{n} - C (λ) A (λ)^{- 1} 0 I_{p}] [A (λ) 0 0 G (λ)] [I_{n} 0 A (λ)^{- 1} B (λ) I_{m}] .

rank [A (λ_{0}) C (λ_{0})] = rank [A (λ_{0}) B (λ_{0})] = n .

rank [A (λ_{0}) C (λ_{0})] = rank [A (λ_{0}) B (λ_{0})] = n .

rank [A (λ) C (λ)] = rank [A (λ) B (λ)] = n

rank [A (λ) C (λ)] = rank [A (λ) B (λ)] = n

G (λ) = - B_{0} + λ A_{0} + \frac{B _{1}}{λ - σ _{1}} + \dots + \frac{B _{s}}{λ - σ _{s}} \in C (λ)^{p \times p},

G (λ) = - B_{0} + λ A_{0} + \frac{B _{1}}{λ - σ _{1}} + \dots + \frac{B _{s}}{λ - σ _{s}} \in C (λ)^{p \times p},

P(\lambda)=\left[\begin{array}[]{cccc|c}(\lambda-\sigma_{1})I&&&&I\\ &(\lambda-\sigma_{2})I&&&I\\ &&\ddots&&\vdots\\ &&&(\lambda-\sigma_{s})I&I\\ \hline\cr-B_{1}&-B_{2}&\cdots&-B_{s}&\lambda A_{0}-B_{0}\\ \end{array}\right].

P(\lambda)=\left[\begin{array}[]{cccc|c}(\lambda-\sigma_{1})I&&&&I\\ &(\lambda-\sigma_{2})I&&&I\\ &&\ddots&&\vdots\\ &&&(\lambda-\sigma_{s})I&I\\ \hline\cr-B_{1}&-B_{2}&\cdots&-B_{s}&\lambda A_{0}-B_{0}\\ \end{array}\right].

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

U (λ) [A (λ) B (λ)] V (λ) = [S (λ) 0],

U (λ) [A (λ) B (λ)] V (λ) = [S (λ) 0],

U (λ) [H_{1} (λ) A (λ) - C (λ)] V (λ) = [S (λ) 0],

U (λ) [H_{1} (λ) A (λ) - C (λ)] V (λ) = [S (λ) 0],

P (λ) := [H_{1} (λ) 0 0 I_{p}] [A (λ) - C (λ) B (λ) D (λ)] [H_{2} (λ) 0 0 I_{m}] = [H_{1} (λ) A (λ) H_{2} (λ) - C (λ) H_{2} (λ) H_{1} (λ) B (λ) D (λ)] .

P (λ) := [H_{1} (λ) 0 0 I_{p}] [A (λ) - C (λ) B (λ) D (λ)] [H_{2} (λ) 0 0 I_{m}] = [H_{1} (λ) A (λ) H_{2} (λ) - C (λ) H_{2} (λ) H_{1} (λ) B (λ) D (λ)] .

Z (λ) := [H_{1} (λ) A (λ) H_{2} (λ) H_{1} (λ) B (λ)]

Z (λ) := [H_{1} (λ) A (λ) H_{2} (λ) H_{1} (λ) B (λ)]

rank [H_{1} (λ_{1}) A (λ_{1}) V (λ_{1}) H_{1} (λ_{1}) B (λ_{1})] = n,

rank [H_{1} (λ_{1}) A (λ_{1}) V (λ_{1}) H_{1} (λ_{1}) B (λ_{1})] = n,

rank [H_{1} (λ_{1}) A (λ_{1}) V (λ_{1}) H_{1} (λ_{1}) B (λ_{1})] = rank (Z (λ_{1}) [S (λ_{1}) 0 0 I_{m}]) \leq rank Z (λ_{1}) < n,

rank [H_{1} (λ_{1}) A (λ_{1}) V (λ_{1}) H_{1} (λ_{1}) B (λ_{1})] = rank (Z (λ_{1}) [S (λ_{1}) 0 0 I_{m}]) \leq rank Z (λ_{1}) < n,

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

G (λ) = Q (λ) + G_{s p} (λ)

G (λ) = Q (λ) + G_{s p} (λ)

rev_{g} G (λ) := λ^{g} G (\frac{1}{λ}) .

rev_{g} G (λ) := λ^{g} G (\frac{1}{λ}) .

rev P (λ) = [rev_{d} A (λ) - rev_{d} C (λ) rev_{d} B (λ) rev_{d} D (λ)],

rev P (λ) = [rev_{d} A (λ) - rev_{d} C (λ) rev_{d} B (λ) rev_{d} D (λ)],

\mbox{rev}P(\lambda)=\left[\begin{array}[]{cccc|c}(1-\lambda\sigma_{1})I&&&&\lambda I\\ &(1-\lambda\sigma_{2})I&&&\lambda I\\ &&\ddots&&\vdots\\ &&&(1-\lambda\sigma_{s})I&\lambda I\\ \hline\cr-\lambda B_{1}&-\lambda B_{2}&\cdots&-\lambda B_{s}&A_{0}-\lambda B_{0}\\ \end{array}\right]

\mbox{rev}P(\lambda)=\left[\begin{array}[]{cccc|c}(1-\lambda\sigma_{1})I&&&&\lambda I\\ &(1-\lambda\sigma_{2})I&&&\lambda I\\ &&\ddots&&\vdots\\ &&&(1-\lambda\sigma_{s})I&\lambda I\\ \hline\cr-\lambda B_{1}&-\lambda B_{2}&\cdots&-\lambda B_{s}&A_{0}-\lambda B_{0}\\ \end{array}\right]

rank [rev_{d} A (0) rev_{d} C (0)] = rank [rev_{d} A (0) rev_{d} B (0)] = n .

rank [rev_{d} A (0) rev_{d} C (0)] = rank [rev_{d} A (0) rev_{d} B (0)] = n .

rank [A_{d} C_{d}] = rank [A_{d} B_{d}] = n .

rank [A_{d} C_{d}] = rank [A_{d} B_{d}] = n .

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

P (λ) = [A (λ) - C (λ) B (λ) D (λ)] \in F [λ]^{(n + p) \times (n + m)}

e_{i} = q_{i} + g i = 1, \dots, r .

e_{i} = q_{i} + g i = 1, \dots, r .

G (λ) = B_{1} (λ) diag ((1/ λ)^{q_{1}}, \dots, (1/ λ)^{q_{r}}, 0_{(p - r) \times (m - r)}) B_{2} (λ) .

G (λ) = B_{1} (λ) diag ((1/ λ)^{q_{1}}, \dots, (1/ λ)^{q_{r}}, 0_{(p - r) \times (m - r)}) B_{2} (λ) .

G (1/ λ) = B_{1} (1/ λ) diag (λ^{q_{1}}, \dots, λ^{q_{r}}, 0_{(p - r) \times (m - r)}) B_{2} (1/ λ) .

G (1/ λ) = B_{1} (1/ λ) diag (λ^{q_{1}}, \dots, λ^{q_{r}}, 0_{(p - r) \times (m - r)}) B_{2} (1/ λ) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Local Linearizations of Rational Matrices with Application to Rational Approximations of Nonlinear Eigenvalue Problems

Froilán M. Dopico111Supported by “Ministerio de Economía, Industria y Competitividad (MINECO)” of Spain and “Fondo Europeo de Desarrollo Regional (FEDER)” of EU through grants MTM2015-65798-P and MTM2017-90682-REDT. The research of M. C. Quintana is funded by the “contrato predoctoral” BES-2016-076744 of MINECO.

[email protected]

Silvia Marcaida222Supported by “Ministerio de Economía, Industria y Competitividad (MINECO)” of Spain and “Fondo Europeo de Desarrollo Regional (FEDER)” of EU through grants MTM2017-83624-P and MTM2017-90682-REDT, and by UPV/EHU through grant GIU16/42.

[email protected]

María C. Quintana33footnotemark: 3

[email protected]

Paul Van Dooren444This work was partially developed while Paul Van Dooren held a “Chair of Excellence UC3M - Banco de Santander” at Universidad Carlos III de Madrid in the academic year 2017-2018.

[email protected]

Departamento de Matemáticas, Universidad Carlos III de Madrid, Avda. Universidad 30, 28911 Leganés, Spain.

Departamento de Matemática Aplicada y Estadística e Investigación Operativa, Universidad del País Vasco UPV/EHU, Apdo. Correos 644, Bilbao 48080, Spain.

Department of Mathematical Engineering, Université catholique de Louvain, Avenue Georges Lemaître 4, B-1348 Louvain-la-Neuve, Belgium.

Abstract

This paper presents a definition for local linearizations of rational matrices and studies their properties. This definition allows us to introduce matrix pencils associated to a rational matrix that preserve its structure of zeros and poles in subsets of any algebraically closed field and also at infinity. Moreover, such definition includes, as particular cases, other definitions that have been used previously in the literature. In this way, this new theory of local linearizations captures and explains rigorously the properties of all the different pencils that have been used from the 1970’s until 2019 for computing zeros, poles and eigenvalues of rational matrices. Particular attention is paid to those pencils that have appeared recently in the numerical solution of nonlinear eigenvalue problems through rational approximation.

keywords:

rational matrix , rational eigenvalue problem , nonlinear eigenvalue problem , linearization , polynomial system matrix , rational approximation , block full rank pencils AMS subject classifications: 65F15, 15A18, 15A22, 15A54, 93B18, 93B20, 93B60

1 Introduction

Rational matrices, i.e., matrices whose entries are rational functions of a scalar variable, are a classical topic inside matrix theory that has received a lot of attention since the 1950s, as a consequence of their fundamental role in linear systems and control theory [23, 24]. Classical references on rational matrices and their applications to these areas are, for instance, the pioneering monographs [19, 28]. The most relevant structural data of a rational matrix are its zeros and poles, together with their partial multiplicities or structural indices, and its minimal indices, which exist only when the matrix is singular, i.e., rectangular or square with identically zero determinant. These structural data are very important in the applications mentioned above, which motivated in the 1970s a considerable research activity on the development of numerical algorithms for computing them, see [33] and the references therein. Among the different algorithms developed for this purpose in the 1970-80s, the most reliable ones were based on constructing a matrix pencil, i.e., a matrix polynomial of degree $1$ , containing exactly all the information about the structural data of the considered rational matrix [33, 37], and then applying to this matrix pencil backward stable algorithms, developed also in the 1970s, for computing the eigenvalues and/or other structural data of general pencils [26, 32].

The pencils mentioned in the previous paragraph are among the first examples of linearizations of rational matrices. Such pencils are, in fact, particular instances of minimal polynomial system matrices of the considered rational matrix, a key concept introduced by Rosenbrock [28] that allows us, among other things, to include simultaneously all the information about the zeros and the poles of a rational matrix into a polynomial matrix.

Recently, rational matrices have received considerable attention from the different perspective of what are called rational eigenvalue problems (REPs). Such REPs may arise directly from applications [25], as approximations of other nonlinear eigenvalue problems (NLEPs) (see, for instance, [18, 21, 29, 31]), and, even more, REPs have also been used to approximate polynomial eigenvalue problems (PEPs) in order to take advantage of certain low rank structures [22]. Since NLEPs are nowadays a very active area of research (see the recent survey [17] and the references therein), REPs and rational matrices are currently a hot topic inside applied and numerical linear algebra. In this scenario, it is of interest to establish in the next paragraphs connections and differences between how rational matrices are viewed in the classic areas of linear systems and control theory and in the modern one of NLEPs, since, unfortunately, some modern and pioneering references on NLEPs seem to ignore classic results on rational matrices.

First, let us review the definition of REPs. Given a regular rational matrix $G(\lambda)$ , the corresponding REP is defined as computing numbers $\lambda_{0}$ and nonzero vectors $x$ such that $G(\lambda_{0})x=0$ . These $\lambda_{0}$ and $x$ are called eigenvalues and (right) eigenvectors of $G(\lambda)$ , respectively, a terminology inherited from other matrix eigenvalue problems but that has never been used in standard references on rational matrices [19, 28]. Observe that the definition of REP assumes implicitly that $G(\lambda_{0})$ is defined at $\lambda_{0}$ , i.e., none of their entries become infinite. Thus, using the classic definitions for the structural data of rational matrices, we can say that $\lambda_{0}$ is a zero of $G(\lambda)$ but not a pole, and we can see REPs as particular cases of the computational problems on rational matrices investigated in the 1970-80s.

Second, we emphasize that rational approximations of NLEPs are only reliable in a certain target set. Moreover, in many works [18, 21, 29, 31], the matrix defining the NLEP is assumed to be analytic in the target region, and such region does not contain the poles of the rational matrix defining the approximating REP. In particular, the poles are already known from the approximation process. This means that for those rational matrices coming from approximating these NLEPs, the poles are of no interest (since they are known), and only those zeros (eigenvalues) in the target set have to be computed. In addition, the structure at infinity (see [19] for a definition) is also of no interest. This is in stark contrast with the situation for rational matrices arising in linear systems and control theory, which, usually, are transfer functions of time invariant linear systems and, therefore, all the finite and infinite structure of zeros and poles related to the transfer function is of interest and has to be computed [33].

As said before, some influential modern references on solving numerically NLEPs via rational approximations ignore classic results on rational matrices. Probably, this is a consequence of the differences mentioned in the previous paragraph and, also, of the fact that rational matrices coming from approximating NLEPs may appear represented in forms different from the most standard ones in linear system and control theory. This lack of connections with classical results is unfortunate, but has had also the positive effect of producing new results on and approaches to rational matrices. For instance, on the unfortunate side, it is surprising that the idea of solving REPs via linearizations was not used in modern references until the key paper [30] was published, despite the fact it had been intensively used much earlier (see [33] and the references therein), and it is one of the most reliable methods for solving REPs. On the positive side, [30] introduced a new companion-like linearization of any rational matrix that is very useful in computations. For this purpose, [30] expressed the rational matrix as the sum of a polynomial matrix and a state-space realization and approached the problem with the spirit of linearizations of polynomial matrices [16], instead of using the classical point of view of polynomial system matrices. (However, it is worth highlighting that, in Example 4.11, we will see that the linearization in [30] is nothing else than a polynomial system matrix of the considered rational matrix. We will see in Section 6 that the same happens for the linearizations in [18].)

Another point to be remarked is that reference [30] started a confusing practice, common to several references dealing with linearizations of rational matrices that approximate NLEPs. Namely, to term as “linearizations” pencils which are proved to contain only partial information about the corresponding rational matrix. For example, the papers [18, 21, 29, 30], which are excellent from the numerical point of view, only prove (at most) that the algebraic and geometric multiplicities of the eigenvalues are preserved in the “linearization”, but nothing is proved about the partial multiplicities. This is in contrast with the standard definition of (strong) linearization of polynomial matrices [16, 9], which guarantees that linearizations contain all the information about the eigenvalues of polynomial matrices (including at infinity in the strong case), as well as with the linear minimal polynomial system matrices used as linearizations of rational matrices in [33, 37], which contain all the information about poles and zeros of the rational matrices.

The partial results proved in [30] were among the motivations of the development of a rigorous definition and theory of strong linearizations of arbitrary (regular or singular, square or rectangular) rational matrices in [5]. Moreover, infinitely many examples of such strong linearizations have been constructed in [5, Section 5.2] through the family of so-called strong block minimal bases linearizations of rational matrices. In simple words, the main idea of the theory in [5] is to combine minimal polynomial system matrices of rational matrices with the theory of linearizations of polynomial matrices [9, 10, 16] in the following sense: strong linearizations of a rational matrix $G(\lambda)$ are linear minimal polynomial system matrices of rational matrices $\widehat{G}(\lambda)$ that may be different from $G(\lambda)$ , but that are related to it via unimodular polynomial matrices, biproper rational matrices, and direct sums with identities. In this way such strong linearizations contain all the information about poles and zeros of the considered rational matrices and extend the “linearizations” used in [33, 37], which correspond to the particular case when $\widehat{G}(\lambda)=G(\lambda)$ . Related works about linearizations containing all the pole-zero information of a rational matrix (in some cases not at infinity) are [1, 7, 8, 11].

However, the definitions of linearization and strong linearization in [5] do not capture always the pencils defined in [18, 21, 29, 30] for two reasons. First, the pencils in [18, 21, 29, 30] do not always satisfy the minimality requirements of the definitions in [5]. Second, and related to the first fact, some of these pencils may not content all the information about the poles of the rational matrix (neither the information of those zeros that are also poles), and a zero of the linearization could be a pole of the rational matrix but not a zero. But, we stress that this is not a drawback in the setting of [18, 21, 29, 30] because, as explained before, in these cases the poles are of no interest, and only the eigenvalues in a certain target set have to be computed. This motivates us to develop in this paper a theory of what we call local linearizations of rational matrices, where the word local means that the linearization is only guaranteed to contain all the information about those zeros and poles of the rational matrix which are located in a certain set.

The theory of local linearizations of rational matrices captures all the pencils that have been used (as far as we know) in the literature for solving REPs arising from approximating NLEPs. As illustration, we will apply in this paper this theory to the pencils in [18, 29, 30] in several different ways. The application to the pencils in [21] is postponed to [12] with the goal of limiting the length of this paper. In addition, we will see that the definition of local linearizations include the definitions of linearizations and strong linearizations of arbitrary rational matrices presented in [5], just by considering as set the whole underlying field and including infinity in the strong case. As a consequence, local linearizations also include the pencils originally used in [33, 37]. Thus, this new local theory is a flexible tool that generalizes and includes most of the previous results available in the literature in this area. This is in part possible due to a new and more flexible treatment of polynomial system matrices at infinity.

The theory of local linearizations of rational matrices is based on the extension of Rosenbrock’s fundamental concept of minimal polynomial system matrix to a local perspective. Such extension is performed in a very simple and applicable manner that avoids as much as possible the use of abstract algebraic concepts. This is in contrast with related local approaches as the one in [6] and the references therein, which, in addition, are focused on the underlying local equivalence relationships rather than on the properties of polynomial system matrices. The local linearization approach connects the concept of linearization with classical results as the local Smith form of polynomial matrices (see, for instance, [16, Section S1.5]) and the local Smith–McMillan form of rational matrices (see [27, Theorem II.9] and [34]).

The paper is organized as follows. Section 2 summarizes some basic results that will be used in the rest of the paper. Locally minimal polynomial system matrices are defined and studied in Section 3. Section 4 presents the main definitions and properties of local linearizations of rational matrices. Section 5 introduces the so-called block full rank pencils, which are linearizations of rational matrices that do not contain any information about the poles, and are closely related to the block minimal bases linearizations of polynomial matrices recently presented in [10]. The application of the local theory to the pencils in [18] is analyzed in depth and from two perspectives in Section 6. Finally, Section 7 discusses the conclusions and some lines of future research. Several examples that illustrate the theoretical results are scattered throughout the paper. They are often based on the pencils introduced in [29, 30].

2 Preliminaries

We assume throughout this paper that $\mathbb{F}$ is an algebraically closed field that does not include infinity. As usual, $\mathbb{F}[\lambda]$ denotes the ring of polynomials with coefficients in $\mathbb{F}$ and $\mathbb{F}(\lambda)$ the field of rational functions or, equivalently, the field of fractions of $\mathbb{F}[\lambda]$ . A rational function $r(\lambda)=\frac{n(\lambda)}{d(\lambda)}$ is said to be proper if $\deg(n(\lambda))\leq\deg(d(\lambda)),$ strictly proper if $\deg(n(\lambda))<\deg(d(\lambda)),$ and biproper if $\deg(n(\lambda))=\deg(d(\lambda))$ , where $\deg(\cdot)$ stands for “degree of”.

$\mathbb{F}^{p\times m}$ , $\mathbb{F}[\lambda]^{p\times m}$ and $\mathbb{F}(\lambda)^{p\times m}$ denote the sets of $p\times m$ matrices with elements in $\mathbb{F},$ $\mathbb{F}[\lambda]$ and $\mathbb{F}(\lambda),$ respectively. The elements of $\mathbb{F}[\lambda]^{p\times m}$ are called polynomial matrices or matrix polynomials. In the sequel we will use both terms. A unimodular matrix is a square polynomial matrix with polynomial inverse or, equivalently, a square polynomial matrix with nonzero constant determinant. Moreover, the elements of $\mathbb{F}(\lambda)^{p\times m}$ are called rational matrices. A (strictly) proper rational matrix is a rational matrix whose entries are (strictly) proper rational functions. A biproper matrix is a square proper matrix with proper inverse or, equivalently, a square proper matrix whose determinant is a biproper rational function. The normal rank of a polynomial or rational matrix $G(\lambda)$ is the size of its largest nonidentically zero minor and is denoted by $\mathop{\rm rank}\nolimits G(\lambda)$ . See [19] and [35] for more information on these and other concepts related to polynomial and rational matrices.

As a first step to define local linearizations of rational matrices, we present local notions and results about rational matrices. We denote the point at infinity as $\infty.$

Definition 2.1.

Let $R(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ . Let $\lambda_{0}\in\mathbb{F}$ , and $\Sigma\subseteq\mathbb{F}$ be nonempty.

(i)

$R(\lambda)$ * is defined or bounded at $\lambda_{0}$ if $R(\lambda_{0})\in\mathbb{F}^{p\times m}.$ *

(ii)

$R(\lambda)$ * is defined or bounded at $\infty$ if $R(1/\lambda)$ is defined at $0.$ *

(iii)

$R(\lambda)$ * is defined or bounded in $\Sigma$ if $R(\lambda_{0})\in\mathbb{F}^{p\times m}$ for all $\lambda_{0}\in\Sigma$ .*

Notice that a rational matrix being defined at $\lambda_{0}\in\mathbb{F}$ is equivalent to having a Taylor expansion around $\lambda_{0}.$ Moreover, a rational matrix is defined at infinity if and only if is proper.

Definition 2.2.

Let $R(\lambda)\in\mathbb{F}(\lambda)^{m\times m}$ . Let $\lambda_{0}\in\mathbb{F}$ , and $\Sigma\subseteq\mathbb{F}$ be nonempty.

(i)

$R(\lambda)$ * is regular or invertible at $\lambda_{0}$ if it is defined at $\lambda_{0}$ and $\det R(\lambda_{0})\neq 0.$ *

(ii)

$R(\lambda)$ * is regular or invertible at $\infty$ if $R(1/\lambda)$ is regular at $0.$ *

(iii)

$R(\lambda)$ * is regular or invertible in $\Sigma$ if it is regular at each $\lambda_{0}\in\Sigma.$ *

A rational matrix $R(\lambda)$ is said to be regular if it is regular for some $\lambda_{0}\in\mathbb{F}.$ That is, if $R(\lambda)$ is square and $\det R(\lambda)\not\equiv 0.$ Note that $R(\lambda)$ is regular at $\lambda_{0}\in\mathbb{F}$ if and only if both $R(\lambda)$ and $R(\lambda)^{-1}$ have a Taylor expansion around $\lambda_{0}.$ Moreover, biproper matrices are those rational matrices that are regular at infinity, while unimodular matrices are those rational matrices that are regular in $\mathbb{F}$ .

In regard to the previous definitions, we introduce some equivalence relations defined in the set of rational matrices [3, 4].

Definition 2.3.

Let $G(\lambda),H(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ . Let $\lambda_{0}\in\mathbb{F}$ , and $\Sigma\subseteq\mathbb{F}$ be nonempty.

(i)

$G(\lambda)$ * and $H(\lambda)$ are equivalent at $\lambda_{0}$ if there exist rational matrices $R_{1}(\lambda)\in\mathbb{F}(\lambda)^{p\times p}$ and $R_{2}(\lambda)\in\mathbb{F}(\lambda)^{m\times m}$ both regular at $\lambda_{0}$ such that $R_{1}(\lambda)G(\lambda)R_{2}(\lambda)=H(\lambda).$ *

(ii)

$G(\lambda)$ * and $H(\lambda)$ are equivalent at $\infty$ if there exist rational matrices $R_{1}(\lambda)\in\mathbb{F}(\lambda)^{p\times p}$ and $R_{2}(\lambda)\in\mathbb{F}(\lambda)^{m\times m}$ both regular at $\infty$ such that $R_{1}(\lambda)G(\lambda)R_{2}(\lambda)=H(\lambda).$ *

(iii)

$G(\lambda)$ * and $H(\lambda)$ are equivalent in $\Sigma$ if there exist rational matrices $R_{1}(\lambda)\in\mathbb{F}(\lambda)^{p\times p}$ and $R_{2}(\lambda)\in\mathbb{F}(\lambda)^{m\times m}$ both regular in $\Sigma$ such that $R_{1}(\lambda)G(\lambda)R_{2}(\lambda)=H(\lambda).$ *

Note that if $\Sigma=\mathbb{F}$ is considered in Definition 2.3 $\rm(iii)$ , then $R_{1}(\lambda)$ and $R_{2}(\lambda)$ are both unimodular, and the standard definition of unimodular equivalence is recovered.

We now introduce the definition of the local Smith–McMillan form of a rational matrix at a point (finite and infinite). The notion of the Smith–McMillan form of a rational matrix was first studied by McMillan in [23, 24] and, then, in other works as [19, 27, 28, 35, 38]. The local Smith–McMillan form is a particular case of the very general (and abstract) result [27, Theorem II.9]. A description valid for rational matrices over the complex field can be found in [34], and a complete and rigorous modern treatment in [4]. Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ be any rational matrix of normal rank $r$ . Let $\lambda_{0}\in\mathbb{F}.$ Then $G(\lambda)$ is equivalent at $\lambda_{0}$ to a matrix of the form

[TABLE]

where $\nu_{1}\leq\cdots\leq\nu_{r}$ are integers. The integers $\nu_{1},\ldots,\nu_{r}$ are uniquely determined by $G(\lambda)$ and $\lambda_{0}$ , and are called the invariant orders at $\lambda_{0}$ of $G(\lambda)$ . The matrix in (1) is called the local Smith–McMillan form of $G(\lambda)$ at $\lambda_{0}.$ Moreover, $G(\lambda)$ is equivalent at $\infty$ to a matrix of the form

[TABLE]

where $\mu_{1}\leq\cdots\leq\mu_{r}$ are integers. These integers $\mu_{1},\ldots,\mu_{r}$ are uniquely determined by $G(\lambda)$ , and are called the invariant orders at infinity of $G(\lambda)$ . The matrix in (2) is called the Smith–McMillan form of $G(\lambda)$ at $\infty.$

In order to define zeros and poles we need to distinguish between positive and negative invariant orders [19, 35]. When we say that a rational matrix has $\nu_{1}\leq\cdots\leq\nu_{k}<0=\nu_{k+1}=\cdots=\nu_{u-1}<\nu_{u}\leq\cdots\leq\nu_{r}$ as invariant orders at $\lambda_{0}$ (infinity) we mean that $k$ may take values from 0 to $r$ and $u$ from $1$ to $r+1$ . For instance, if $k=0$ all the invariant orders are nonnegative; if, in addition, $u=1$ then they are all positive, but if $k=0$ and $u=r+1$ they are all 0.

Definition 2.4.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and $\lambda_{0}\in\mathbb{F}$ . Let $\nu_{1}\leq\cdots\leq\nu_{k}<0=\nu_{k+1}=\cdots=\nu_{u-1}<\nu_{u}\leq\cdots\leq\nu_{r}$ be the invariant orders at $\lambda_{0}$ of $G(\lambda)$ . Then $\lambda_{0}$ is said to be a pole of $G(\lambda)$ with partial multiplicities $-\nu_{k},\ldots,-\nu_{1},$ and a zero of $G(\lambda)$ with partial multiplicities $\nu_{u},\ldots,\nu_{r}.$ In particular, the positive integers $-\nu_{k},\ldots,-\nu_{1}$ and $\nu_{u},\ldots,\nu_{r}$ are called the pole and zero partial multiplicities of $G(\lambda)$ at $\lambda_{0},$ respectively. Moreover, $(\lambda-\lambda_{0})^{-\nu_{i}}$ for $i=1,\ldots,k$ are called the pole elementary divisors of $G(\lambda)$ at $\lambda_{0}$ , while $(\lambda-\lambda_{0})^{\nu_{i}}$ for $i=u,\ldots,r$ are called the zero elementary divisors of $G(\lambda)$ at $\lambda_{0}.$ Finally, the pole (zero) algebraic multiplicity of $\lambda_{0}$ is the sum of its pole (zero) partial multiplicities, and the pole (zero) geometric multiplicity of $\lambda_{0}$ is the number of its pole (zero) partial multiplicities.

If $G(\lambda)$ is a polynomial matrix then the polynomials $(\lambda-\lambda_{0})^{\nu_{i}}$ with $\nu_{i}\neq 0$ are simply called elementary divisors of $G(\lambda)$ at $\lambda_{0},$ and the nonzero integers $\nu_{i}\neq 0$ are all positive and are called partial multiplicities of $G(\lambda)$ at $\lambda_{0}.$

Definition 2.5.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ . Let $\mu_{1}\leq\cdots\leq\mu_{\ell}<0=\mu_{{\ell}+1}=\cdots=\mu_{t-1}<\mu_{t}\leq\cdots\leq\mu_{r}$ be the invariant orders at $\infty$ of $G(\lambda)$ . Then $\infty$ is said to be a pole of $G(\lambda)$ with partial multiplicities $-\mu_{\ell},\ldots,-\mu_{1},$ and a zero of $G(\lambda)$ with partial multiplicities $\mu_{t},\ldots,\mu_{r}.$ In particular, the integers $-\mu_{\ell},\ldots,-\mu_{1}$ and $\mu_{t},\ldots,\mu_{r}$ are called the pole and zero partial multiplicities of $G(\lambda)$ at $\infty,$ respectively.

Some modern references, see for instance [1, 18, 30], also consider (finite) eigenvalues of rational matrices, a concept that is not mentioned at all in classical references of rational matrices. According to these modern references, we introduce the following definition.

Definition 2.6.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ be a rational matrix. A finite eigenvalue of $G(\lambda)$ is any $\lambda_{0}\in\mathbb{F}$ such that $\mathop{\rm rank}\nolimits G(\lambda_{0})<\mathop{\rm rank}\nolimits G(\lambda),$ 555Note that here $\mathop{\rm rank}\nolimits G(\lambda)$ denotes the normal rank of $G(\lambda),$ while $\mathop{\rm rank}\nolimits G(\lambda_{0})$ is the rank of the constant matrix $G(\lambda_{0}).$ with $G(\lambda_{0})\in\mathbb{F}^{p\times m}.$ That is, $\lambda_{0}$ is a finite zero of $G(\lambda)$ but not a pole.

Observe that if $G(\lambda)\in\mathbb{F}(\lambda)^{p\times p}$ is regular, an eigenvalue of $G(\lambda)$ is any $\lambda_{0}\in\mathbb{F}$ such that there exists a nonzero vector $x\in\mathbb{F}^{p}$ satisfying $G(\lambda_{0})x=0$ with $G(\lambda_{0})\in\mathbb{F}^{p\times p},$ which is the standard definition of REP (Rational Eigenvalue Problem).

As a consequence of [4, Theorem 2.3] (see [3, Section 2] for more details) we can also present the Smith–McMillan form of a rational matrix in a nonempty subset of $\mathbb{F}$ , say $\Sigma$ . Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ with normal rank $r$ . Then $G(\lambda)$ is equivalent in $\Sigma$ to a matrix of the form

[TABLE]

where, for $i=1,\ldots,r$ , $\frac{\epsilon_{i}(\lambda)}{\psi_{i}(\lambda)}$ are nonzero irreducible rational functions, $\epsilon_{i}(\lambda)$ and $\psi_{i}(\lambda)$ are monic (leading coefficient equal to 1) polynomials which are either constants or whose roots are in $\Sigma$ and $\epsilon_{1}(\lambda)\mid\cdots\mid\epsilon_{r}(\lambda)$ while $\psi_{r}(\lambda)\mid\cdots\mid\psi_{1}(\lambda)$ , where $\mid$ stands for divisibility. We refer to (3) as the Smith–McMillan form in $\Sigma$ of $G(\lambda)$ . When we take $\Sigma=\mathbb{F}$ , we obtain the (finite) Smith–McMillan form of $G(\lambda)$ , i.e., the classical Smith–McMillan form of $G(\lambda)$ . In this case, if $G(\lambda)$ is polynomial then $\psi_{1}(\lambda)=\cdots=\psi_{r}(\lambda)=1,$ $\epsilon_{1}(\lambda),\ldots,\epsilon_{r}(\lambda)$ are the invariant polynomials of $G(\lambda)$ , and (3) is called the Smith normal form of $G(\lambda)$ .

Notice that the Smith–McMillan form of a rational matrix in a nonempty set $\Sigma\subseteq\mathbb{F}$ is invariant under multiplication by regular rational matrices in $\Sigma,$ i.e., under equivalence in $\Sigma.$ Analogously, the Smith–McMillan form at $\infty$ is invariant under multiplication by biproper matrices, i.e., under equivalence at $\infty.$

The next result shows that the equivalence of rational matrices in nonempty sets is a local property.

Proposition 2.7.

Let $\Sigma\subseteq\mathbb{F}$ be nonempty. Two rational matrices of the same size are equivalent in $\Sigma$ if and only if they are equivalent at each $\lambda_{0}\in\Sigma.$

Proof.

If two rational matrices are equivalent in $\Sigma$ then, by Definitions 2.3 and 2.2, it is straightforward that they are equivalent at each $\lambda_{0}\in\Sigma$ . For the converse, suppose that $G(\lambda)$ and $H(\lambda)$ are equivalent at each $\lambda_{0}\in\Sigma.$ Then, $G(\lambda)$ and $H(\lambda)$ have the same local Smith–McMillan forms at each $\lambda_{0}\in\Sigma.$ In particular, $G(\lambda)$ and $H(\lambda)$ have the same pole and zero elementary divisors at each $\lambda_{0}\in\Sigma.$ Let us consider $M_{G}(\lambda)$ and $M_{H}(\lambda)$ as the global Smith–McMillan forms of $G(\lambda)$ and $H(\lambda),$ respectively. Thus, there exist unimodular matrices $U_{i}^{G}(\lambda),$ $U_{i}^{H}(\lambda)$ for $i=1,2,$ such that $G(\lambda)=U_{1}^{G}(\lambda)M_{G}(\lambda)U_{2}^{G}(\lambda)$ , $H(\lambda)=U_{1}^{H}(\lambda)M_{H}(\lambda)U_{2}^{H}(\lambda)$ , and we can write

[TABLE]

where $f_{i}(\lambda)$ are rational functions which are either equal to one or have poles and zeros in $\Sigma,$ while $g_{i}(\lambda)$ and $h_{i}(\lambda)$ are rational functions that do not have neither poles nor zeros in $\Sigma.$ Let us define $R(\lambda):=\operatorname{diag}\left(\dfrac{h_{1}(\lambda)}{g_{1}(\lambda)},\ldots,\dfrac{h_{r}(\lambda)}{g_{r}(\lambda)},I_{m-r}\right).$ Hence, $M_{H}(\lambda)=M_{G}(\lambda)R(\lambda).$ Therefore, we deduce that $H(\lambda)=U_{1}^{H}(\lambda)U_{1}^{G}(\lambda)^{-1}G(\lambda)U_{2}^{G}(\lambda)^{-1}R(\lambda)U_{2}^{H}(\lambda),$ and $G(\lambda)$ and $H(\lambda)$ are equivalent in $\Sigma$ since the matrices $U_{1}^{H}(\lambda)U_{1}^{G}(\lambda)^{-1}$ and $U_{2}^{G}(\lambda)^{-1}R(\lambda)U_{2}^{H}(\lambda)$ are regular in $\Sigma.$ ∎

3 Polynomial system matrices minimal in subsets of $\mathbb{F}$ and at infinity

Polynomial system matrices are a classical tool for studying rational matrices. They were introduced by Rosenbrock and are analyzed in detail in [28]. Among them, minimal polynomial system matrices have been used in many problems dealing with rational matrices because they allow to extract all the information about finite poles and zeros. Recently, they have played a fundamental role in developing a rigorous theory of linearizations and strong linearizations of rational matrices [5]. In this section, we extend the concept of minimal polynomial system matrices from the classical global scenario to a local one. Some of the definitions in this section can also be found in [6] expressed in an abstract algebraic language.

3.1 Polynomial system matrices minimal in subsets of $\mathbb{F}$

In this section we introduce polynomial system matrices of rational matrices that are locally minimal, and study their properties. Consider the fact that any rational matrix $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ can be written as

[TABLE]

for some polynomial matrices $A(\lambda)\in\mathbb{F}[\lambda]^{n\times n},$ $B(\lambda)\in\mathbb{F}[\lambda]^{n\times m}$ , $C(\lambda)\in\mathbb{F}[\lambda]^{p\times n}$ and $D(\lambda)\in\mathbb{F}[\lambda]^{p\times m}$ with $A(\lambda)$ nonsingular if $n>0$ (see [28]). Then the matrix polynomial

[TABLE]

is called a polynomial system matrix of $G(\lambda)$ [28]. That is, $G(\lambda)$ is the Schur complement of $A(\lambda)$ in $P(\lambda)$ . In that case, $A(\lambda)$ is called the state matrix of $P(\lambda)$ and $G(\lambda)$ is the transfer function matrix of $P(\lambda).$ If $n=0,$ we assume that the matrices $A(\lambda),$ $B(\lambda)$ and $C(\lambda)$ are empty, and $P(\lambda)=G(\lambda)=D(\lambda)$ is a polynomial matrix. We emphasize that the definition of polynomial system matrix of a rational matrix includes a specific partition. Sometimes in this paper a certain polynomial matrix is partitioned in different ways giving rise to different polynomial system matrices of (possibly) different rational matrices. In such cases, we often use expressions as “ $P(\lambda)$ is a polynomial system matrix of $G(\lambda)$ with state matrix $A(\lambda)$ ” in order to avoid ambiguities, where the words “of $G(\lambda)$ ” may be omitted because $P(\lambda)$ and $A(\lambda)$ determine $G(\lambda)$ . In the case $n=0$ mentioned above, we will use “ $P(\lambda)$ is a polynomial system matrix with empty state matrix”. We stress that although in (4) the state matrix is in the $(1,1)$ -block, it might be a different submatrix of $P(\lambda)$ . In general, the fundamental property defining a polynomial system matrix is that the rational matrix is the Schur complement of the state matrix.

We remark that the relation between the normal ranks of $P(\lambda)$ and its transfer function matrix $G(\lambda)$ is

[TABLE]

since we can write $P(\lambda)$ as

[TABLE]

Next, we introduce two of the main definitions of this work.

Definition 3.1 (Polynomial system matrix minimal at a point in $\mathbb{F}$ ).

Let $\lambda_{0}\in\mathbb{F}.$ The polynomial system matrix $P(\lambda)$ in (4), with $n>0,$ is said to be minimal at $\lambda_{0}$ if

[TABLE]

Remark 3.2.

If $P(\lambda)$ is a polynomial system matrix as in (4), with $n>0,$ then

[TABLE]

since $A(\lambda)$ is nonsingular. Thus $P(\lambda)$ is minimal at $\lambda_{0}$ if and only if $\lambda_{0}$ is neither an eigenvalue of $\begin{bmatrix}A(\lambda)\\ C(\lambda)\end{bmatrix}$ nor of $\begin{bmatrix}A(\lambda)&B(\lambda)\end{bmatrix}.$

Definition 3.3 (Polynomial system matrix minimal in a subset of $\mathbb{F}$ ).

Let $\Sigma\subseteq\mathbb{F}$ be nonempty. The polynomial system matrix $P(\lambda)$ in (4), with $n>0,$ is minimal in $\Sigma$ if $P(\lambda)$ is minimal at each point $\lambda_{0}\in\Sigma.$

Observe that Definitions 3.1 and 3.3 extend to points and subsets of $\mathbb{F}$ the classical definition of minimal, or with least order, polynomial system matrices introduced in [28]. Rosenbrock’s definition coincides with Definition 3.3 when $\Sigma=\mathbb{F}.$

Remark 3.4.

For convenience, if $n=0$ in (4), we adopt the agreement that $P(\lambda)$ is minimal at every point $\lambda_{0}\in\mathbb{F}.$

In the next example, we illustrate Definition 3.3 with a rational matrix and a polynomial system matrix taken from the recent reference [29] dealing with numerical algorithms for solving NLEPs via rational approximation. We advance that we will use the matrices in Example 3.5 several times for illustrating different concepts introduced in this paper as well as for establishing a first connection between the theory developed in this paper and NLEPs. In this respect, we emphasize that [29] does not mention at all polynomial system matrices, and that the same happens with references [18, 30].

Example 3.5.

Let $G(\lambda)$ be a rational matrix of the form

[TABLE]

with $A_{0},B_{0},\ldots,B_{s}\in\mathbb{C}^{p\times p},$ $\sigma_{1},\ldots,\sigma_{s}\in\mathbb{C},$ and $\sigma_{i}\neq\sigma_{j}$ if $i\neq j$ . Let us consider the linear polynomial matrix

[TABLE]

These matrices are introduced in [29] to tackle a NLEP $T(\lambda)v=0,$ in a certain region $\Omega\subseteq\mathbb{C},$ where the matrix $T(\lambda)$ is of the form $T(\lambda)=-B_{0}+\lambda A_{0}+f_{1}(\lambda)A_{1}+\cdots+f_{q}(\lambda)A_{q},$ with $A_{0},A_{1},\ldots,A_{q}\in\mathbb{C}^{p\times p}$ and $f_{i}:\Omega\subseteq\mathbb{C}\longrightarrow\mathbb{C},$ $i=1,\ldots,q,$ being scalar functions nonlinear in the variable $\lambda$ and holomorphic in $\Omega.$ For solving a NLEP of this form, the nonlinear matrix $T(\lambda)$ is approximated in $\Omega$ by a rational matrix $G(\lambda)$ as in (6), and $P(\lambda)$ is considered to linearize $G(\lambda).$ It is easy to see that $P(\lambda)$ is, in fact, a linear polynomial system matrix of $G(\lambda),$ by setting the matrix $\operatorname{diag}((\lambda-\sigma_{1})I,\ldots,(\lambda-\sigma_{s})I)$ as state matrix $A(\lambda)$ in (4). Moreover, without any assumption, $P(\lambda)$ is minimal in $\Sigma:=\mathbb{C}\setminus\{\sigma_{1},\ldots,\sigma_{s}\}.$ In particular, and according to [29], $\Omega$ is a subset of $\Sigma.$ Therefore, $P(\lambda)$ is minimal in the target set $\Omega.$ For completeness, notice that a polynomial system matrix as $P(\lambda)$ is minimal in $\mathbb{C}$ if and only if all the matrices $B_{1},\ldots,B_{s}$ are nonsingular. We also emphasize that the form of the rational matrix $G(\lambda)$ in (6) is very particular because it is the sum of a linear polynomial matrix and strictly proper rational matrices with linear denominators, which simplifies considerably working with it from different perspectives. We will consider later more complicated examples.

The next result provides the pole and zero elementary divisors of a rational matrix $G(\lambda)$ at any finite point $\lambda_{0}\in\mathbb{F}$ from any polynomial system matrix of $G(\lambda)$ minimal at $\lambda_{0}.$ This result is the counterpart of [28, Chapter 3, Theorem 4.1] for polynomial system matrices minimal at a finite point instead of polynomial system matrices of least order.

Theorem 3.6.

Let $\lambda_{0}\in\mathbb{F}.$ Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let

[TABLE]

be a polynomial system matrix minimal at $\lambda_{0}$ whose transfer function matrix is $G(\lambda).$ Then the elementary divisors of $A(\lambda)$ at $\lambda_{0}$ are the pole elementary divisors of $G(\lambda)$ at $\lambda_{0},$ and the elementary divisors of $P(\lambda)$ at $\lambda_{0}$ are the zero elementary divisors of $G(\lambda)$ at $\lambda_{0}.$

Proof.

Let us consider the Smith normal form of $\begin{bmatrix}A(\lambda)&B(\lambda)\end{bmatrix}.$ Namely,

[TABLE]

with $U(\lambda)$ and $V(\lambda)$ unimodular matrices. Observe that $S(\lambda)\in\mathbb{F}[\lambda]^{n\times n}$ is invertible as a rational matrix since $\mathop{\rm rank}\nolimits\begin{bmatrix}A(\lambda)&B(\lambda)\end{bmatrix}=n.$ We set $H_{1}(\lambda):=S(\lambda)^{-1}U(\lambda).$ Since $P(\lambda)$ is minimal at $\lambda_{0},$ $S(\lambda)$ has no zeros at $\lambda_{0}.$ Therefore, $H_{1}(\lambda)$ is regular at $\lambda_{0}.$ Moreover, $\begin{bmatrix}H_{1}(\lambda)A(\lambda)&H_{1}(\lambda)B(\lambda)\end{bmatrix}$ is a polynomial matrix, as it is equal to $\begin{bmatrix}I_{n}&0\end{bmatrix}V(\lambda)^{-1},$ has full row rank, and has no zeros in $\mathbb{F}.$ Now, let us consider the Smith normal form of the polynomial matrix $\begin{bmatrix}H_{1}(\lambda)A(\lambda)\\ -C(\lambda)\end{bmatrix}.$ Namely,

[TABLE]

with $\widetilde{U}(\lambda)$ and $\widetilde{V}(\lambda)$ unimodular matrices. Observe that $\widetilde{S}(\lambda)\in\mathbb{F}[\lambda]^{n\times n}$ is invertible as a rational matrix since $H_{1}(\lambda)$ is invertible and $\mathop{\rm rank}\nolimits\begin{bmatrix}A(\lambda)\\ C(\lambda)\end{bmatrix}=n.$ We set $H_{2}(\lambda):=\widetilde{V}(\lambda)\widetilde{S}(\lambda)^{-1}.$ Moreover, the matrix $\begin{bmatrix}H_{1}(\lambda)A(\lambda)H_{2}(\lambda)\\ -C(\lambda)H_{2}(\lambda)\end{bmatrix}$ is also polynomial, as it is equal to $\widetilde{U}(\lambda)^{-1}\begin{bmatrix}I_{n}\\ 0\end{bmatrix}$ , has full column rank, and has no zeros in $\mathbb{F}.$ Since $P(\lambda)$ is minimal at $\lambda_{0}$ and $H_{1}(\lambda)$ is regular at $\lambda_{0},$ $\widetilde{S}(\lambda)$ has not zeros at $\lambda_{0}.$ Therefore, $H_{2}(\lambda)$ is regular at $\lambda_{0}.$ Let us define now the polynomial system matrix

[TABLE]

We claim that $\widetilde{P}(\lambda)$ is a minimal polynomial system matrix in $\mathbb{F}$ or in the classical sense of Rosenbrock [28]. For that, it remains to prove that the matrix

[TABLE]

has full row rank for all $\lambda\in\mathbb{F}.$ Let us suppose that there exists $\lambda_{1}\in\mathbb{F}$ such that $\mathop{\rm rank}\nolimits Z(\lambda_{1})<n.$ On the one hand, we know that

[TABLE]

since the Smith normal form of $\begin{bmatrix}H_{1}(\lambda)A(\lambda)&H_{1}(\lambda)B(\lambda)\end{bmatrix}$ is equal to $\begin{bmatrix}I_{n}&0\end{bmatrix}$ and $\widetilde{V}(\lambda)$ is unimodular. On the other hand, we have that

[TABLE]

which is a contradiction. Therefore, $\widetilde{P}(\lambda)$ is a minimal polynomial system matrix. Its transfer function matrix is $G(\lambda).$ Then, by [28, Chapter 3, Theorem 4.1], we know that the zero elementary divisors of $G(\lambda)$ are the elementary divisors of $\widetilde{P}(\lambda),$ and that the pole elementary divisors of $G(\lambda)$ are the elementary divisors of $H_{1}(\lambda)A(\lambda)H_{2}(\lambda).$ Finally, the result follows by taking into account that the matrices $P(\lambda)$ and $\widetilde{P}(\lambda)$ are equivalent at $\lambda_{0},$ and that the matrices $A(\lambda)$ and $H_{1}(\lambda)A(\lambda)H_{2}(\lambda)$ are also equivalent at $\lambda_{0},$ since $H_{1}(\lambda)$ and $H_{2}(\lambda)$ are both regular at that point. ∎

Theorem 3.6 can be extended to any subset of $\mathbb{F}$ in a natural way, by applying this theorem to every point of that subset.

Theorem 3.7.

Let $\Sigma\subseteq\mathbb{F}$ be nonempty. Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let

[TABLE]

be a polynomial system matrix minimal in $\Sigma$ whose transfer function matrix is $G(\lambda).$ Then the elementary divisors of $A(\lambda)$ in $\Sigma$ are the pole elementary divisors of $G(\lambda)$ in $\Sigma,$ and the elementary divisors of $P(\lambda)$ in $\Sigma$ are the zero elementary divisors of $G(\lambda)$ in $\Sigma.$

Example 3.8.

If Theorem 3.7 is applied to the matrices $G(\lambda)$ and $P(\lambda)$ and the set $\Sigma$ in Example 3.5, we obtain immediately that (without any hypothesis) the eigenvalues of $P(\lambda)$ in $\Sigma$ coincide exactly with the zeros of $G(\lambda)$ in $\Sigma$ , with exactly the same multiplicities (geometric, algebraic and partial). Observe also that all the zeros of $G(\lambda)$ in $\Sigma$ are, in fact, eigenvalues of $G(\lambda)$ because the only potential poles of $G(\lambda)$ are $\sigma_{1},\ldots,\sigma_{s}$ . This result is stronger than Lemma 3.1 and Corollary 3.2 in [29] from two perspectives: [29] deals with determinants and, so, only gives information on algebraic multiplicities, and the requests in [29] impose the additional hypothesis that $A_{0}$ is nonsingular. Note that, under the assumption that all the matrices $B_{1},\ldots,B_{s}$ are nonsingular, we obtain that $P(\lambda)$ (and $A(\lambda)$ ) allows us to obtain the complete information on finite poles and zeros (including all the multiplicities) of $G(\lambda)$ in $\mathbb{C}.$

3.2 Polynomial system matrices minimal at infinity

Theorems 3.6 and 3.7 characterize polynomial system matrices that contain the information of the invariant orders at finite points of their transfer functions. The extension of these results for including the information at infinity is an old problem that has been considered in classical papers as, for instance, in [36, 37]. However, a satisfactory solution has been found, so far, only for polynomial system matrices with state matrix $A(\lambda)$ being a linear polynomial matrix and the other blocks $B(\lambda),$ $C(\lambda),$ $D(\lambda)$ being constant matrices. In other cases, recovering the information at infinity requires to embed the polynomial system matrix into a larger matrix. In this section, we propose a new approach for obtaining a counterpart of Theorem 3.6 at infinity. This approach is motivated by the recent work [5], but presents relevant differences with respect to [5], and is based on the use of “reversals” and local equivalences of rational matrices.

In order to develop our counterpart of Theorem 3.6 at infinity, first, we introduce the notion of $g$ -reversal of a rational matrix in Definition 3.9, where $g$ is any integer. In this definition we will use, for a particular value of $g,$ the well-known fact that any rational matrix $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ can be uniquely written as

[TABLE]

where $Q(\lambda)\in\mathbb{F}[\lambda]^{p\times m}$ is a polynomial matrix and $G_{sp}(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ is a strictly proper rational matrix. The equation (8) follows from the Euclidean division for polynomials applied to each entry of $G(\lambda).$ The matrices $Q(\lambda)$ and $G_{sp}(\lambda)$ are called the polynomial part and the strictly proper part of $G(\lambda)$ , respectively. A polynomial matrix $Q(\lambda)$ is said to have degree $d$ if $d$ is the largest exponent of the variable $\lambda$ of its entries with nonzero coefficient. In such a case, $d$ is denoted by $\deg(Q(\lambda)).$

Definition 3.9 ( $g$ -reversal of a rational matrix).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ be a rational matrix, and let $g$ be an integer. We define the $g$ -reversal of $G(\lambda)$ as the rational matrix

[TABLE]

Let $G(\lambda)$ be expressed as in (8). If $g=\deg(Q(\lambda))$ whenever $G(\lambda)$ is not strictly proper, or $g=0$ if $G(\lambda)$ is strictly proper, then the $g$ -reversal is called the reversal of $G(\lambda)$ and it is often denoted by just $\operatorname{rev}G(\lambda).$

Note that if $Q(\lambda)$ in (8) is a constant matrix, including the zero matrix, then $\operatorname{rev}G(\lambda)=G\left(1/\lambda\right)$ . Definition 3.9 extends the definition of $g$ -reversal for polynomial matrices (see, for instance, [9, Definition 2.12]). However, we emphasize that in the definition of $g$ -reversal of a polynomial matrix considered previously in the literature, $g$ is always taken larger than or equal to the degree of the polynomial matrix, while in Definition 3.9 we only ask for $g$ to be an integer.

Given a polynomial system matrix $P(\lambda)$ as in (4), we have that

[TABLE]

where $d$ is the degree of $P(\lambda),$ is also a polynomial matrix. Moreover, $\operatorname{rev}_{d}A(\lambda)$ is nonsingular since $A(\lambda)$ is nonsingular. Therefore, $\operatorname{rev}P(\lambda)$ is also a polynomial system matrix. We now introduce Definition 3.10 about minimality at infinity of a polynomial system matrix.

Definition 3.10 (Polynomial system matrix minimal at infinity).

The polynomial system matrix $P(\lambda)$ in (4) is minimal at $\infty$ if $\operatorname{rev}P(\lambda)$ is minimal at $0.$

Example 3.11.

The polynomial system matrix $P(\lambda)$ with transfer function matrix $G(\lambda)$ in Example 3.5 is minimal at $\infty$ since

[TABLE]

is, obviously, minimal at $0.$

Remark 3.12.

A polynomial system matrix $P(\lambda)$ as in (4), with $\deg(P(\lambda))=d$ and $n>0,$ is minimal at $\infty$ if and only if

[TABLE]

More precisely, let $A_{d},$ $B_{d},$ $C_{d}$ and $D_{d}$ be the matrix coefficients of $\lambda^{d}$ in $A(\lambda),$ $B(\lambda),$ $C(\lambda)$ and $D(\lambda),$ respectively. Then the fact of $P(\lambda)$ being minimal at $\infty$ is equivalent to

[TABLE]

Notice that if $d=0$ then $P(\lambda)$ is a constant polynomial system matrix, and $A_{0}$ must be invertible. Therefore, in this case, the rank condition above is automatically satisfied, and $P(\lambda)$ is minimal at $\infty.$

Theorem 3.13 is essentially the counterpart of Theorem 3.6 at infinity. We state it in terms of reversals and their elementary divisors at [math] as we only have defined elementary divisors for finite points. The implications of Theorem 3.13 on the structure at infinity are made explicit in Theorem 3.15.

Theorem 3.13.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let

[TABLE]

be a polynomial system matrix of degree $d$ minimal at $\infty$ whose transfer function matrix is $G(\lambda).$ Then the elementary divisors of $\operatorname{rev}_{d}A(\lambda)$ at [math] are the pole elementary divisors of $\operatorname{rev}_{d}G(\lambda)$ at $0,$ and the elementary divisors of $\operatorname{rev}P(\lambda)$ at [math] are the zero elementary divisors of $\operatorname{rev}_{d}G(\lambda)$ at $0.$

Proof.

It can be easily proved that the transfer function matrix of $\operatorname{rev}P(\lambda)$ is $\operatorname{rev}_{d}G(\lambda).$ The theorem then follows by applying Theorem 3.6, since $\operatorname{rev}P(\lambda)$ is minimal at $0.$ ∎

Once we have obtained the elementary divisors of the $d$ -reversal of a rational matrix at [math], from one of its polynomial system matrices of degree $d$ minimal at $\infty,$ we can then obtain its invariant orders at infinity as we state in Theorem 3.15. For proving that, we use Lemma 3.14.

Lemma 3.14.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ with $\mathop{\rm rank}\nolimits G(\lambda)=r,$ and let $g$ be an integer. Let $e_{1},\ldots,e_{r}$ be the invariant orders of $\operatorname{rev}_{g}G(\lambda)$ at $0,$ and let $q_{1},\ldots,q_{r}$ be the invariant orders at infinity of $G(\lambda).$ Then

[TABLE]

Proof.

From the local Smith–McMillan form at infinity of $G(\lambda),$ there exist biproper rational matrices $B_{1}(\lambda)$ and $B_{2}(\lambda)$ such that

[TABLE]

Let us perform the transformation $\lambda\longmapsto 1/\lambda$ on the variable of the equation above. Thus,

[TABLE]

By [4, Lemma 6.9], $B_{1}(1/\lambda)$ and $B_{2}(1/\lambda)$ are regular at $0.$ We now multiply the previous equation by $\lambda^{g},$ and we get that $q_{i}+g$ for $i=1,\ldots,r$ are the invariant orders of $\operatorname{rev}_{g}G(\lambda)$ at $0.$ ∎

Theorem 3.15.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ with $\mathop{\rm rank}\nolimits G(\lambda)=r$ and let

[TABLE]

be a polynomial system matrix of degree $d$ minimal at $\infty$ whose transfer function matrix is $G(\lambda).$ Let $e_{1}\leq\cdots\leq e_{s}$ be the partial multiplicities of $\operatorname{rev}_{d}A(\lambda)$ at [math] and let $\widetilde{e}_{1}\leq\cdots\leq\widetilde{e}_{u}$ be the partial multiplicities of $\operatorname{rev}P(\lambda)$ at $0.$ Then the invariant orders at infinity $q_{1}\leq q_{2}\leq\cdots\leq q_{r}$ of $G(\lambda)$ are

[TABLE]

Proof.

By Theorem 3.13, we know that $e_{i}$ and $\widetilde{e}_{j}$ with $i=1,\ldots,s$ and $j=1,\ldots,u$ are the pole and zero partial multiplicities of $\operatorname{rev}_{d}G(\lambda)$ at $0,$ respectively. Thus, the invariant orders of $\operatorname{rev}_{d}G(\lambda)$ at [math] are $-e_{s}\leq-e_{s-1}\leq\cdots\leq-e_{1}<\underbrace{0=\cdots=0}_{r-s-u}<\widetilde{e}_{1}\leq\cdots\leq\widetilde{e}_{u}.$ Then the use of Lemma 3.14 completes the proof. ∎

Example 3.16.

By combining Theorem 3.15 and Example 3.11, we see that $P(\lambda),$ in Example 3.5, contains the complete information about the invariant orders at $\infty$ of $G(\lambda)$ (without imposing any hypothesis). Note that, in this case, $d=1$ and that the $1$ -reversal of the state matrix, i.e., $\operatorname{rev}_{1}A(\lambda)=\operatorname{diag}((1-\lambda\sigma_{1})I,\ldots,(1-\lambda\sigma_{s})I)$ , has no partial multiplicities at [math]. This result on the relationship between the infinite structure of $G(\lambda)$ and the reversal of $P(\lambda)$ is not mentioned in [29]. In this context, it is worth emphasizing that modern references on NLEPs and their rational approximations do not pay attention to the structure at $\infty$ , while such structure plays an important role in many classic references of linear system theory and control [19, 20, 36, 37].

For polynomial system matrices that are minimal at infinity and, also, at every finite point, we state Definition 3.17 about strong minimality. This definition has already been introduced in [13, Definition 3.3]. However, in [13] the definition is given in terms of eigenvalues instead of minimality at every point, but both definitions are equivalent.

Definition 3.17 (Strongly minimal polynomial system matrix).

The polynomial system matrix $P(\lambda)$ in (4) is strongly minimal if it is minimal at each point of $\mathbb{F}\cup\{\infty\}.$

We emphasize that, as a consequence of Theorems 3.6 and 3.15, strongly minimal polynomial system matrices contain all the information about the invariant orders of their transfer function matrices, both at finite points and at infinity.

4 Local linearizations of rational matrices

In practice, one is often interested in studying the pole and zero structure of rational matrices not in the whole space $\mathbb{F}\cup\{\infty\}$ but in a particular region (see [17, 18, 21, 29]). For instance, this happens when a rational eigenvalue problem arises from approximating a nonlinear eigenvalue problem, since the approximation is usually reliable only in a target region not containing poles. As a consequence, the eigenvalues (those zeros that are not poles) of the approximating rational eigenvalue problem need to be computed only in that region. In this scenario, one can use local linearizations of the corresponding rational matrix which contain the information about the poles and zeros in the target region, but might not in the whole space $\mathbb{F}\cup\{\infty\}.$ In addition, they do not satisfy, in general, the conditions of the strong linearizations of rational matrices introduced in [5]. Thereby local linearizations provide extra flexibility in solving nonlinear eigenvalue problems.

In this section, we give separately the definitions of linearizations of rational matrices in subsets of $\mathbb{F}$ and at infinity, study their properties and establish connections with the linearizations introduced in [5]. These linearizations will be useful in order to study the pole and zero structure of rational matrices in different sets containing or not infinity. In particular, and as an application of these definitions, we will study in Section 6 the structure of the linearizations that appear in [18].

4.1 Linearizations in subsets of $\mathbb{F}$

In this subsection we introduce the definition of linearization of a rational matrix in a set not containing infinity and study some of its properties. We start by giving the definition of linearization at a finite point.

Definition 4.1 (Linearization at a point in $\mathbb{F}$ ).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let $\lambda_{0}\in\mathbb{F}.$ Let

[TABLE]

be a linear polynomial system matrix and let

[TABLE]

be its transfer function matrix. $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\lambda_{0}$ if the following conditions hold:

(a)

$\mathcal{L}(\lambda)$ * is minimal at $\lambda_{0}$ , and*

(b)

there exist nonnegative integers $s_{1},s_{2}$ satisfying $s_{1}-s_{2}=q-p=r-m,$ and rational matrices $R_{1}(\lambda)\in\mathbb{F}(\lambda)^{(p+s_{1})\times(p+s_{1})}$ and $R_{2}(\lambda)\in\mathbb{F}(\lambda)^{(m+s_{1})\times(m+s_{1})}$ regular at $\lambda_{0}$ such that

[TABLE]

Remark 4.2.

Notice that, in Definition 4.1, the following two cases are allowed:

$\widehat{G}(\lambda)=G(\lambda)$ . Then we just have to check condition $\rm(a)$ , since condition $\rm(b)$ is satisfied by setting $R_{1}(\lambda)=I_{p}$ , $R_{2}(\lambda)=I_{m},$ and $s_{1}=s_{2}=0$ . 2. 2.

$n=0.$ Then it is not necessary to take into account condition $\rm(a)$ (it is automatically satisfied by the agreement in Remark 3.4) and, therefore, we just have to check condition $\rm(b)$ with $\widehat{G}(\lambda)=D_{1}\lambda+D_{0}=\mathcal{L}(\lambda)$ .

We remark these extreme cases since they are important in applications, and make Definition 4.1 very general.

We now extend, in a natural way, the notion of linearization at a finite point to linearization in subsets of $\mathbb{F}.$

Definition 4.3 (Linearization in a subset of $\mathbb{F}$ ).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let $\Sigma\subseteq\mathbb{F}$ be nonempty. A linear polynomial system matrix $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\Sigma$ if $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at each point $\lambda_{0}\in\Sigma.$

Since linearizations of rational matrices are, in particular, polynomial system matrices, their definition includes a specific partition. Thus, a fixed linear polynomial matrix (also called a matrix pencil) may be partitioned in different ways giving rise to different linearizations of the same or of different rational matrices, or in different subsets. To deal with different partitions, we will use expressions as “ $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\Sigma$ with state matrix $A_{1}\lambda+A_{0}$ ” when it is necessary for avoiding any ambiguity. The expression “ $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\Sigma$ with empty state matrix” will cover the case $n=0$ in (10).

In condition (11), one can always take $s_{1}=0$ or $s_{2}=0,$ according to $p\geq q$ and $m\geq r$ or $q\geq p$ and $r\geq m$ , respectively. This is a consequence of the local Smith–McMillan forms of $\operatorname{diag}(G(\lambda),I_{s_{1}})$ and $\operatorname{diag}(\widehat{G}(\lambda),I_{s_{2}})$ being equivalent to each other at $\lambda_{0}$ . In the rest of the results of this subsection, we will consider $s:=s_{1}\geq 0$ and $s_{2}=0,$ since it corresponds to the most interesting situation in applications.

Remark 4.4.

If we have a linearization of $G(\lambda)$ in a set $\Sigma$ then, for each point $\mu\in\Sigma$ , there exist rational matrices $R_{1}^{\mu}(\lambda)$ and $R_{2}^{\mu}(\lambda)$ regular at $\mu$ such that $R_{1}^{\mu}(\lambda)\operatorname{diag}(G(\lambda),I_{s})R_{2}^{\mu}(\lambda)\allowbreak=\widehat{G}(\lambda).$ In principle, for different values of $\mu\in\Sigma,$ the rational matrices $R_{1}^{\mu}(\lambda)$ (respectively, $R_{2}^{\mu}(\lambda)$ ) may be different from each other, that is, $R_{1}^{\mu}(\lambda)$ (resp., $R_{2}^{\mu}(\lambda)$ ) depends on $\mu.$ However, Proposition 2.7 implies that the existence of $R_{1}^{\mu}(\lambda)$ and $R_{2}^{\mu}(\lambda)$ for each $\mu\in\Sigma$ is equivalent to the existence of two rational matrices $R_{1}(\lambda)$ and $R_{2}(\lambda)$ both regular in $\Sigma$ (and independent of $\mu$ ) such that $R_{1}(\lambda)\operatorname{diag}(G(\lambda),I_{s})R_{2}(\lambda)=\widehat{G}(\lambda)$ .

Remark 4.5.

When $\Sigma=\mathbb{F},$ in Definition 4.3, condition (11) is satisfied with $R_{1}(\lambda)$ and $R_{2}(\lambda)$ unimodular matrices. Therefore, a linearization in $\mathbb{F},$ or at every point of $\mathbb{F},$ is a linearization in the sense of [5, Definition 3.2] and vice versa.

The next result gives the relation between the invariant orders at a finite point of a rational matrix $G(\lambda)$ and those of a rational matrix of the form $\operatorname{diag}(G(\lambda),I_{s}),$ with $s>0$ . It is motivated by equation (11).

Lemma 4.6.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ , $\lambda_{0}\in\mathbb{F}$ and let $\nu_{1}\leq\cdots\leq\nu_{k}<0=\nu_{k+1}=\cdots=\nu_{u-1}<\nu_{u}\leq\cdots\leq\nu_{r}$ be the invariant orders of $G(\lambda)$ at $\lambda_{0}.$ Consider $\widetilde{G}(\lambda):=\operatorname{diag}(G(\lambda),I_{s})$ with $s>0.$ Then the invariant orders of $\widetilde{G}(\lambda)$ at $\lambda_{0}$ are $\widetilde{\nu}_{1}\leq\cdots\leq\widetilde{\nu}_{k}<0=\widetilde{\nu}_{k+1}=\cdots=\widetilde{\nu}_{u+s-1}<\widetilde{\nu}_{u+s}\leq\cdots\leq\widetilde{\nu}_{r+s},$ where $\widetilde{\nu}_{i}=\nu_{i}$ for $i=1,\ldots,k,$ and $\widetilde{\nu}_{j+s}=\nu_{j}$ for $j={u,\ldots,r}.$

Proof.

Let $M(\lambda):=\operatorname{diag}\left((\lambda-\lambda_{0})^{\nu_{1}},\ldots,(\lambda-\lambda_{0})^{\nu_{r}},0_{(p-r)\times(m-r)}\right)$ be the local Smith–McMillan form of $G(\lambda)$ at $\lambda_{0}.$ Then, $G(\lambda)=R_{1}(\lambda)M(\lambda)R_{2}(\lambda)$ for some rational matrices $R_{1}(\lambda)$ and $R_{2}(\lambda)$ regular at $\lambda_{0}.$ Moreover, $\widetilde{G}(\lambda)=\operatorname{diag}\left(R_{1}(\lambda),I_{s}\right)\operatorname{diag}\left(M(\lambda),I_{s}\right)\operatorname{diag}\allowbreak\left(R_{2}(\lambda),I_{s}\right).$ Therefore, since the matrices $\operatorname{diag}\left(R_{1}(\lambda),I_{s}\right)$ and $\operatorname{diag}\left(R_{2}(\lambda),I_{s}\right)$ are regular at $\lambda_{0},$ the local Smith–McMillan form of $\widetilde{G}(\lambda)$ at $\lambda_{0}$ is $\operatorname{diag}\left(M(\lambda),I_{s}\right)$ up to a permutation. ∎

Corollary 4.7 and Theorem 4.8 follow from Lemma 4.6. These results state the spectral information that one can obtain from local linearizations. More precisely, Theorem 4.8 is a spectral characterization of local linearizations in the spirit of [5, Theorem 3.10].

Corollary 4.7.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ , $\lambda_{0}\in\mathbb{F}$ and let

[TABLE]

be a linear polynomial system matrix minimal at $\lambda_{0}.$ Let $\widehat{G}(\lambda)$ be the transfer function matrix of $\mathcal{L}(\lambda).$ Then $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\lambda_{0}$ if and only if the following two conditions hold:

(a)

$\mbox{\rm rank}\,\widehat{G}(\lambda)=\mbox{\rm rank}\,G(\lambda)+s$ , and 2. (b)

$G(\lambda)$ * and $\widehat{G}(\lambda)$ have exactly the same pole and zero elementary divisors at $\lambda_{0}.$ *

Proof.

If $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\lambda_{0}$ then (a) and (b) are satisfied by Lemma 4.6, since $\operatorname{diag}(G(\lambda),I_{s})$ and $\widehat{G}(\lambda)$ are equivalent at $\lambda_{0}.$ For the converse, suppose that $\nu_{1}\leq\cdots\leq\nu_{k}<0=\nu_{k+1}=\cdots=\nu_{u-1}<\nu_{u}\leq\cdots\leq\nu_{r}$ are the invariant orders of $G(\lambda)$ at $\lambda_{0}.$ From (a) and (b), the Smith–McMillan form at $\lambda_{0}$ of $\widehat{G}(\lambda)$ must be $\operatorname{diag}\left((\lambda-\lambda_{0})^{\nu_{1}},\ldots,(\lambda-\lambda_{0})^{\nu_{u-1}},I_{s},(\lambda-\lambda_{0})^{\nu_{u}},\ldots,(\lambda-\lambda_{0})^{\nu_{r}},0_{(p-r)\times(m-r)}\right)$ . Observe that this is also the Smith–McMillan form at $\lambda_{0}$ of $\operatorname{diag}(G(\lambda),I_{s})$ , as proved in the previous lemma. Thus, $\operatorname{diag}(G(\lambda),I_{s})$ and $\widehat{G}(\lambda)$ are equivalent at $\lambda_{0}$ . ∎

Theorem 4.8 (Spectral characterization of linearizations at a point in $\mathbb{F}$ ).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ , $\lambda_{0}\in\mathbb{F}$ and let

[TABLE]

be a linear polynomial system matrix minimal at $\lambda_{0}.$ Then $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\lambda_{0}$ if and only if the following three conditions hold:

(a)

$\mbox{\rm rank}\,\mathcal{L}(\lambda)=\mbox{\rm rank}\,G(\lambda)+n+s$ , 2. (b)

the pole elementary divisors of $G(\lambda)$ at $\lambda_{0}$ are the elementary divisors of $A_{1}\lambda+A_{0}$ at $\lambda_{0},$ and 3. (c)

the zero elementary divisors of $G(\lambda)$ at $\lambda_{0}$ are the elementary divisors of $\mathcal{L}(\lambda)$ at $\lambda_{0}.$

Proof.

Let $\widehat{G}(\lambda)$ be the transfer function matrix of $\mathcal{L}(\lambda)$ . By (5), $\mbox{\rm rank}\,\widehat{G}(\lambda)=\mbox{\rm rank}\,\mathcal{L}(\lambda)-n$ . Moreover, by Theorem 3.6, the pole elementary divisors of $\widehat{G}(\lambda)$ at $\lambda_{0}$ are the elementary divisors of $A_{1}\lambda+A_{0}$ at $\lambda_{0}$ , and the zero elementary divisors of $\widehat{G}(\lambda)$ at $\lambda_{0}$ are the elementary divisors of $\mathcal{L}(\lambda)$ at $\lambda_{0}$ . The result follows from Corollary 4.7. ∎

It is immediate to obtain counterparts of Corollary 4.7 and Theorem 4.8 for linear polynomial system matrices minimal in sets $\Sigma\subseteq\mathbb{F}$ and for linearizations in $\Sigma.$ We omit to state such results for brevity.

The following proposition is a straightforward consequence of the definition of linearization in a subset of $\mathbb{F}$ by taking $s_{1}=s_{2}=0,$ $R_{1}(\lambda)=I_{p},$ $R_{2}(\lambda)=I_{m}$ and $G(\lambda)=\widehat{G}(\lambda),$ i.e., it corresponds to case 1 in Remark 4.2. However, we emphasize this result since it gives a sufficient condition that is easy to verify in order to ensure that a linear polynomial system matrix is a linearization of a rational matrix.

Proposition 4.9.

Let $\Sigma\subseteq\mathbb{F}$ be nonempty. Let

[TABLE]

be a linear polynomial system matrix and let $\widehat{G}(\lambda)$ be its transfer function matrix. If $\mathcal{L}(\lambda)$ is minimal in $\Sigma$ then $\mathcal{L}(\lambda)$ is a linearization of $\widehat{G}(\lambda)$ in $\Sigma.$

In plain words, any linear polynomial system matrix $\mathcal{L}(\lambda)$ is a linearization of its transfer function matrix in the sets where $\mathcal{L}(\lambda)$ is minimal.

Example 4.10.

Consider the matrices $G(\lambda)$ and $P(\lambda)$ and the set $\Sigma$ in Example 3.5, that were originally introduced in [29]. By combining the discussion in Example 3.5 with Proposition 4.9, we immediately obtain that $P(\lambda)$ is a linearization of $G(\lambda)$ in $\Sigma.$ With a bit more effort, it is also easy to obtain the following stronger result: $P(\lambda)$ is a linearization of $G(\lambda)$ in $\mathbb{C}\setminus\Pi$ where $\Pi:=\{\sigma_{i}:B_{i}\text{ is singular for }1\leq i\leq s\}.$

As mentioned in Example 3.5, the form of the rational matrix $G(\lambda)$ in (6) is very particular since its polynomial part and the denominators in the strictly proper part are linear. Thus, we finish this section by discussing in Example 4.11 a rational matrix with non linear polynomial part and with a general state space realization of the strictly proper part. For such general representation of rational matrices, an influential companion-like pencil associated to it was introduced in [30]. We will analyze this pencil from three different perspectives.

Example 4.11.

It is well-known that any rational matrix can be written in the form:

[TABLE]

By assuming $D_{q}\neq 0$ with $q\geq 2,$ from the expression above we define the pencil

[TABLE]

This pencil was introduced in [30] for regular rational matrices and is a particular case of the pencils considered in [5, Theorem 5.11] (modulo some permutations). In fact, [5, Theorem 5.11] proves that if $L(\lambda)$ is considered as a polynomial system matrix with state matrix $\lambda I-A,$ and $L(\lambda)$ is minimal in $\mathbb{F},$ then $L(\lambda)$ is a strong linearization of $G(\lambda)$ in the sense of [5, Definition 3.4] (we will revise this in subsection 4.23). Thus, under these conditions, $L(\lambda)$ contains all the information about the poles and zeros of $G(\lambda).$

We now consider $L(\lambda)$ from other two points of view different from the one in [5]. They will correspond to the two extreme cases described in Remark 4.2. First, we consider the following regular submatrix of $L(\lambda),$ obtained by removing the first block row and the penultimate block column:

[TABLE]

and we see $L(\lambda)$ as a polynomial system matrix with state matrix $A(\lambda).$ That is, once the state matrix is chosen, the other matrices in (4) are $D(\lambda):=D_{0},$ $C(\lambda):=[-\lambda D_{q}-D_{q-1}\allowbreak\quad-D_{q-2}\quad\cdots\quad-D_{1}\quad C]$ and $B(\lambda)^{T}:=[0\quad\cdots\quad 0\quad\lambda I_{m}\quad B^{T}]^{T}.$ With such partition of $L(\lambda),$ it is easy to see that the transfer function matrix of $L(\lambda)$ is precisely $G(\lambda),$ i.e., $D(\lambda)+C(\lambda)A(\lambda)^{-1}B(\lambda)=G(\lambda).$ For that, just take into account that the two last block columns of $A(\lambda)^{-1}$ are $[-\lambda^{q-2}I_{m}\,\,\cdots\,\,-\lambda I_{m}\quad-I_{m}\quad 0]^{T}$ and $[0\,\,\cdots\,\,0\quad(\lambda I_{n}-A)^{-T}]^{T}$ . Then, Proposition 4.9 guarantees that, without any extra hypothesis, $L(\lambda)$ is a linearization of $G(\lambda)$ in $\Omega:=\mathbb{F}\setminus\Lambda,$ where $\Lambda:=\{\lambda\in\mathbb{F}:\lambda\text{ is an eigenvalue of }A\}.$ With a bit more effort, it is also easy to see that if $\mathop{\rm rank}\nolimits\begin{bmatrix}\lambda_{0}I_{n}-A\\ C\end{bmatrix}=\mathop{\rm rank}\nolimits\begin{bmatrix}\lambda_{0}I_{n}-A&B\end{bmatrix}=n,$ for all $\lambda_{0}\in\Lambda,$ then $L(\lambda)$ is minimal in $\mathbb{F}$ and, thus, is a linearization of $G(\lambda)$ in $\mathbb{F}$ with state matrix $A(\lambda)$ in (15). Observe that, if we do not impose any hypothesis of minimality in $\Lambda,$ and $L(\lambda)$ is just a linearization in $\Omega,$ then we can not guarantee that $L(\lambda)$ has any information about the poles of $G(\lambda)$ since they are necessarily contained in $\Lambda.$ Moreover, the set $\Lambda$ might contain eigenvalues of $G(\lambda).$ This is not a problem in REPs coming from approximating NLEPs [18, 21, 29] because, in such cases, the target set is outside $\Lambda.$ However, it is in classical applications of rational matrices [19].

The second point of view is to consider $L(\lambda)$ as a linearization of $G(\lambda)$ in $\Omega$ with empty state matrix. To this purpose, we define the following rational matrices regular at $\Omega$ :

[TABLE]

Then, we check that $L(\lambda)V(\lambda)=U(\lambda)\operatorname{diag}(G(\lambda),I_{n},I_{m(q-1)}),$ which means that $L(\lambda)$ and $\operatorname{diag}(G(\lambda),I_{n},I_{m(q-1)})$ are equivalent in $\Omega$ and, so, that $L(\lambda)$ is a linearization of $G(\lambda)$ in $\Omega$ with empty state matrix (recall Remark 4.2(2)).

The two approaches described in Example 4.11 for viewing $L(\lambda)$ in (14) as a linearization of $G(\lambda)$ in $\Omega$ can be extended with more effort to many other of the pencils described in [5, Theorem 5.11]. We postpone these developments to future research to keep this paper concise.

4.2 Linearizations at infinity and in sets containing infinity

Our definition of linearization of a rational matrix at infinity is based on the notion of $g$ -reversal of a rational matrix introduced in Definition 3.9.

Definition 4.12 (Linearization at infinity of grade $g$ ).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}.$ Let

[TABLE]

be a linear polynomial system matrix and let

[TABLE]

be its transfer function matrix. Let $g$ be an integer. $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\infty$ of grade $g$ if the following conditions hold:

(a)

$\operatorname{rev}\mathcal{L}(\lambda)$ * is minimal at $0,$ and*

(b)

there exist nonnegative integers $s_{1},s_{2},$ with $s_{1}-s_{2}=q-p=r-m,$ and rational matrices $Q_{1}(\lambda)\in\mathbb{F}(\lambda)^{(p+s_{1})\times(p+s_{1})}$ and $Q_{2}(\lambda)\in\mathbb{F}(\lambda)^{(m+s_{1})\times(m+s_{1})}$ regular at [math] such that

[TABLE]

where $\ell=\deg(\mathcal{L}(\lambda)).$

Observe that Definition 4.12 allows, for completeness, the possibility of $\ell=\deg(\mathcal{L}(\lambda))$ being equal to $0.$ We admit that this case has a very limited interest in applications, since it corresponds to $\mathcal{L}(\lambda)$ and $\operatorname{rev}_{\ell}\widehat{G}(\lambda)=\widehat{G}(\lambda)$ being constant matrices. However, it includes linearizations at $\infty$ of rational matrices $G(\lambda)$ such that, for some integer $g,$ $\operatorname{rev}_{g}G(\lambda)$ has all its invariant orders at zero equal to zero. Moreover, notice that, in any case, $\operatorname{rev}\mathcal{L}(\lambda)$ is also a linear polynomial system matrix since $\operatorname{rev}_{\ell}(A_{1}\lambda+A_{0})$ is nonsingular. We then have the following characterization of linearizations at infinity.

Proposition 4.13.

A linear polynomial system matrix $\mathcal{L}(\lambda)$ as in (30) is a linearization of a rational matrix $G(\lambda)$ at $\infty$ of grade $g$ if and only if $\operatorname{rev}\mathcal{L}(\lambda)$ is a linearization of $\operatorname{rev}_{g}G(\lambda)$ at $0.$

Proof.

The proposition follows from the fact that $\operatorname{rev}_{\ell}\widehat{G}(\lambda)$ with $\ell=\deg(\mathcal{L}(\lambda))$ is the transfer function matrix of $\operatorname{rev}\mathcal{L}(\lambda).$ Then we make use of Definition 4.1. ∎

Conditions $\rm(a)$ and $\rm(b)$ in Definition 4.12 can be stated in a different way as we show in Remarks 4.14 and 4.15, respectively.

Remark 4.14.

As a particular case of what is discussed in Remark 3.12, condition $(a)$ in Definition 4.12 is equivalent to

[TABLE]

if $\mathcal{L}(\lambda)$ is nonconstant, i.e., if $\ell=1.$ If $\mathcal{L}(\lambda)$ is constant, i.e., $\ell=0.$ condition $(a)$ is automatically satisfied since $\mathcal{L}(\lambda)$ is a polynomial system matrix and, therefore, $A_{0}$ is invertible.

Remark 4.15.

By [4, Lemma 6.9], a rational matrix $Q(\lambda)$ is regular at [math] if and only if $Q(1/\lambda)$ is biproper. Therefore, condition $(b)$ in Definition 4.12 is equivalent to the matrices $\operatorname{diag}((1/\lambda)^{g}G(\lambda),I_{s_{1}})$ and $\operatorname{diag}((1/\lambda)^{\ell}\widehat{G}(\lambda),I_{s_{2}})$ being equivalent at infinity according to Definition 2.3. More precisely, a linear polynomial system matrix $\mathcal{L}(\lambda)$ as in (30) is a linearization of a rational matrix $G(\lambda)$ at $\infty$ of grade $g$ if and only if

(a)

$\mathcal{L}(\lambda)$ is minimal at $\infty,$ and

(b)

there exist nonnegative integers $s_{1},s_{2},$ with $s_{1}-s_{2}=q-p=r-m,$ and biproper matrices $B_{1}(\lambda)\in\mathbb{F}(\lambda)^{(p+s_{1})\times(p+s_{1})}$ and $B_{2}(\lambda)\in\mathbb{F}(\lambda)^{(m+s_{1})\times(m+s_{1})}$ such that

[TABLE]

We state in Theorem 4.16 a characterization of linearizations at infinity analogous to the one in Theorem 4.8 for linearizations at finite points. In this characterization, we consider the most usual situation $s_{1}:=s\geq 0$ and $s_{2}=0,$ assuming $q\geq p$ and $r\geq m.$ The proof of Theorem 4.16 is omitted since it follows immediately from Theorem 4.8 and Proposition 4.13.

Theorem 4.16 (Spectral characterization of linearizations at infinity).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let

[TABLE]

be a linear polynomial system matrix such that $\operatorname{rev}\mathcal{L}(\lambda)$ is minimal at [math] and let $\ell=\deg(\mathcal{L}(\lambda)).$ Then $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ at $\infty$ of grade $g$ if and only if the following three conditions hold:

(a)

$\mbox{\rm rank}\,\mathcal{L}(\lambda)=\mbox{\rm rank}\,G(\lambda)+n+s$ , 2. (b)

the pole elementary divisors of $\operatorname{rev}_{g}G(\lambda)$ at [math] are the elementary divisors of $\operatorname{rev}_{\ell}(A_{1}\lambda+A_{0})$ at $0,$ and 3. (c)

the zero elementary divisors of $\operatorname{rev}_{g}G(\lambda)$ at [math] are the elementary divisors of $\operatorname{rev}\mathcal{L}(\lambda)$ at $0.$

Next, we study in Proposition 4.17 how to recover the invariant orders at infinity of rational matrices from linearizations at infinity of grade $g.$

Proposition 4.17.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ with $\mathop{\rm rank}\nolimits G(\lambda)=r,$ and let

[TABLE]

be a linearization at infinity of grade $g$ of $G(\lambda)$ with $\ell=\deg(\mathcal{L}(\lambda)).$ Let $e_{1}\leq\cdots\leq e_{t}$ be the partial multiplicities of $\operatorname{rev}_{\ell}(A_{1}\lambda+A_{0})$ at [math], and let $\widetilde{e}_{1}\leq\cdots\leq\widetilde{e}_{u}$ be the partial multiplicities of $\operatorname{rev}\mathcal{L}(\lambda)$ at [math]. Then the invariant orders at infinity $q_{1}\leq q_{2}\leq\cdots\leq q_{r}$ of $G(\lambda)$ are

[TABLE]

Proof.

This proof is analogous to the one for Theorem 3.15. It follows just from combining Theorem 4.16 and Lemma 3.14. ∎

The following result is the counterpart of Proposition 4.9 but for linearizations at infinity. It shows when a linear polynomial system matrix is a linearization at infinity of its transfer function matrix. The proof is immediate and, therefore, omitted.

Proposition 4.18.

Let

[TABLE]

be a linear polynomial system matrix and let $\widehat{G}(\lambda)$ be its transfer function matrix. Then the following statements hold:

(a)

If $\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}\\ C_{1}\end{bmatrix}=\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}&B_{1}\end{bmatrix}=n$ then $\mathcal{L}(\lambda)$ is a linearization of $\widehat{G}(\lambda)$ at $\infty$ of grade $1.$

(b)

If $\mathcal{L}(\lambda)$ is constant then $\mathcal{L}(\lambda)$ is a linearization of $\widehat{G}(\lambda)$ at $\infty$ of grade $0.$

Example 4.19.

Consider the matrices in Example 3.5. By Proposition 4.18, the linear polynomial system matrix $P(\lambda)$ is a linearization of $G(\lambda)$ at $\infty$ of grade $1.$

Example 4.20.

Consider the matrices in Example 4.11. Let us view $L(\lambda)$ as a polynomial system matrix with state matrix $A(\lambda)$ in (15). With such partition, $G(\lambda)$ is the transfer function matrix of $L(\lambda).$ Then, by Proposition 4.18, $L(\lambda)$ is a linearization of $G(\lambda)$ at $\infty$ of grade $1$ if $D_{q}$ has full column rank. However, the condition $\mathop{\rm rank}\nolimits D_{q}=m$ is very restrictive, since it implies also $\mathop{\rm rank}\nolimits D(\lambda)=m.$ Moreover, the structure of $G(\lambda)$ at $\infty$ is, in such a case, trivial because it is very easy to see that the $m$ invariant orders at $\infty$ of $G(\lambda)$ are all equal to $-q.$ This is consistent with Proposition 4.17, because if $\mathop{\rm rank}\nolimits D_{q}=m$ then $\operatorname{rev}L(0)$ has full column rank and, thus, $\operatorname{rev}L(\lambda)$ does not have partial multiplicities at zero. Moreover, as $A(\lambda)$ is the pencil in (15), then it is easy to see that $\operatorname{rev}A(\lambda)$ has $m$ partial multiplicities at zero all equal to $q-1.$

Observe that, if we consider $A(\lambda)$ in (15) as state matrix of $L(\lambda),$ $\operatorname{rev}L(\lambda)$ is minimal at [math] if and only if $\mathop{\rm rank}\nolimits D_{q}=m.$ Thus, this hypothesis can not be avoided under such choice of state matrix. However, it is important to emphasize that if $L(\lambda)$ is viewed as a polynomial system matrix with empty state matrix then $L(\lambda)$ is a linearization of $G(\lambda)$ at $\infty$ of grade $q,$ without imposing any hypothesis. We postpone the proof of this result to Example 5.12.

A linear polynomial system matrix that satisfies Definition 4.3 in $\mathbb{F}$ and Definition 4.12, for a certain grade $g,$ allows us to recover the complete information about the poles and zeros of the corresponding rational matrix, finite and at infinity. This is due to Theorem 4.8 and Proposition 4.17. This important case leads us to introduce the following definition.

Definition 4.21 ( $g$ -strong linearization).

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and let $g$ be an integer. A linear polynomial system matrix $\mathcal{L}(\lambda)$ is said to be a strong linearization of grade $g$ , or a $g$ -strong linearization, of $G(\lambda)$ if $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\mathbb{F}$ and also at $\infty$ of grade $g.$

Example 4.22.

Consider again the matrices in Example 3.5. Then the linear polynomial system matrix $P(\lambda)$ is a $1$ -strong linearization of $G(\lambda)$ if and only if all the matrices $B_{1},\ldots,B_{s}$ are nonsingular.

4.3 Comparison with another definition of strong linearization

Recently, another definition of “strong linearization” of a rational matrix $G(\lambda)$ has been presented in [5, Definition 3.4]. In contrast to Definition 4.21, that definition does not make any reference to a “grade $g$ ”, but the linear polynomial system matrices satisfying [5, Definition 3.4] also allow us to recover the information about poles and zeros of $G(\lambda),$ including those at infinity. Therefore, it is convenient to establish a relation between [5, Definition 3.4] and Definition 4.21. This is the purpose of Proposition 4.23. Before stating and proving that proposition, we introduce some comments. Let us consider a linear polynomial system matrix

[TABLE]

with transfer function matrix $\widehat{G}(\lambda),$ and let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ be a rational matrix written as in (8). We recall that, according to [5, Remark 3.5], $\mathcal{L}(\lambda)$ is a strong linearization of $G(\lambda)$ in the sense of [5, Definition 3.4] if the following statements hold:

(a)

$\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\mathbb{F}$ and,

(b)

$A_{1}$ is invertible if $n>0,$ and there exist integers $s_{1},$ $s_{2}\geq 0$ and rational matrices $Q_{1}(\lambda)\in\mathbb{F}(\lambda)^{(p+s_{1})\times(p+s_{1})}$ and $Q_{2}(\lambda)\in\mathbb{F}(\lambda)^{(m+s_{1})\times(m+s_{1})}$ regular at [math] such that

[TABLE]

As we stated in Remark 4.5, condition $(a)$ is equivalent to $\mathcal{L}(\lambda)$ being a linearization of $G(\lambda)$ in the sense of [5, Definition 3.2]. For condition $(b)$ notice that, if $n>0$ and $\deg(\mathcal{L}(\lambda))=1$ , in Definition 4.21 we do not require $A_{1}$ to be invertible but $\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}\\ C_{1}\end{bmatrix}=\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}&B_{1}\end{bmatrix}=n,$ according to Remark 4.14. Observe, in addition, that these rank conditions are satisfied if $A_{1}$ is invertible. Moreover, in contrast to (34), in (31) we consider $\operatorname{rev}_{\ell}\widehat{G}(\lambda)$ instead of $\operatorname{rev}\widehat{G}(\lambda),$ where $\ell=\deg(\mathcal{L}(\lambda))$ , and $\operatorname{rev}_{g}G(\lambda)$ instead of $\operatorname{rev}G(\lambda),$ for an integer $g.$ In this way, Definition 4.12 looks for $\operatorname{rev}\mathcal{L}(\lambda)$ to be a linearization at [math] of the $g$ -reversal of $G(\lambda)$ , because the transfer function matrix of $\operatorname{rev}\mathcal{L}(\lambda)$ is $\operatorname{rev}_{\ell}\widehat{G}(\lambda).$ Note that, $\operatorname{rev}\widehat{G}(\lambda)$ is the transfer function matrix of $\operatorname{rev}\mathcal{L}(\lambda)$ if and only if $\widehat{G}(\lambda)$ is not strictly proper and the degree of the polynomial part of $\widehat{G}(\lambda)$ is equal to the degree of $\mathcal{L}(\lambda)$ . Thus, condition (34) is different from (31) in some cases. Nevertheless, as we will see in Proposition 4.23, in most cases strong linearizations of $G(\lambda)$ in the sense of [5, Definition 3.4] are $g$ -strong linearizations of $G(\lambda)$ of a certain grade $g.$

Proposition 4.23.

Let $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m},$ and let

[TABLE]

be a strong linearization of $G(\lambda)$ according to [5, Definition 3.4]. Let $G(\lambda)$ be expressed uniquely as in (8), and let $g_{G}:=\deg(Q(\lambda))$ if $G(\lambda)$ is not strictly proper and $g_{G}:=0$ otherwise. Then the following statements hold:

(a)

If $n=0$ or $D_{1}+C_{1}A_{1}^{-1}B_{1}\neq 0,$ then $\mathcal{L}(\lambda)$ is a $g_{G}$ -strong linearization of $G(\lambda)$ .

(b)

If $D_{1}+C_{1}A_{1}^{-1}B_{1}=0,$ $q=p,$ and $r=m,$ then $\mathcal{L}(\lambda)$ is a $(g_{G}+1)$ -strong linearization of $G(\lambda)$ .

(c)

If $D_{1}+C_{1}A_{1}^{-1}B_{1}=0,$ and $q\neq p$ or $r\neq m,$ then $\mathcal{L}(\lambda)$ is not a $g$ -strong linearization of $G(\lambda)$ for any integer $g.$

Proof.

We remark that this proof is somewhat technical and that can be skipped without affecting the understanding of the rest of the paper. We will use throughout the proof that $\operatorname{rev}G(\lambda)=\operatorname{rev}_{g_{G}}G(\lambda)$ without mentioning it explicitly. Let $\mathcal{L}(\lambda)$ be a strong linearization of $G(\lambda)$ in the sense of [5, Definition 3.4] and let $\widehat{G}(\lambda)$ be the transfer function matrix of $\mathcal{L}(\lambda)$ . Then $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\mathbb{F}.$ Moreover, if $n>0,$ $A_{1}$ is invertible, which implies $\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}\\ C_{1}\end{bmatrix}=\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}&B_{1}\end{bmatrix}=n.$ Then, it only remains to study the different cases that may occur in condition (34) in order $\mathcal{L}(\lambda)$ to satisfy (31), that is, in order to be a $g$ -strong linearization of $G(\lambda)$ for some integer $g.$

We consider first the trivial case $n=0.$ In this case, $G(\lambda)$ is a polynomial matrix and $\widehat{G}(\lambda)=\mathcal{L}(\lambda)=D_{1}\lambda+D_{0}.$ Therefore, $\operatorname{rev}\widehat{G}(\lambda)=\operatorname{rev}_{\ell}\widehat{G}(\lambda),$ where $\ell=\deg(\mathcal{L}(\lambda)).$ Thus, $\mathcal{L}(\lambda)$ satisfies (31) with $g=g_{G}$ , and it is a $g_{G}$ -strong linearization of $G(\lambda).$

In the rest of the proof, we assume $n>0$ , which implies $\ell=\deg(\mathcal{L}(\lambda))=1$ . In this case, $\widehat{G}(\lambda)$ can be written as $\widehat{G}(\lambda)=\lambda(D_{1}+C_{1}A_{1}^{-1}B_{1})+\widehat{G}_{pr}(\lambda),$ where $\widehat{G}_{pr}(\lambda)$ is a proper rational matrix. Therefore, $\operatorname{rev}\widehat{G}(\lambda)=\operatorname{rev}_{\widehat{g}}\widehat{G}(\lambda),$ where $\widehat{g}=1$ if $D_{1}+C_{1}A_{1}^{-1}B_{1}\neq 0$ and $\widehat{g}=0$ otherwise. Then, we have two different cases. If $D_{1}+C_{1}A_{1}^{-1}B_{1}\neq 0$ then $\widehat{g}=\ell=1,$ and, therefore, $\mathcal{L}(\lambda)$ is a $g_{G}$ -strong linearization of $G(\lambda).$ If $D_{1}+C_{1}A_{1}^{-1}B_{1}=0$ then $\widehat{g}=0,$ and there are two different sub-cases:

$q=p$ and $r=m,$ that is, $G(\lambda)$ and $\widehat{G}(\lambda)$ have the same size. So, in (34), we have $s_{1}=s_{2}=:s.$ Then the invariant orders at [math] of $\operatorname{diag}(\operatorname{rev}_{g_{G}}G(\lambda),I_{s})$ are equal to those of $\operatorname{diag}(\operatorname{rev}_{0}\widehat{G}(\lambda),I_{s}),$ which implies that the invariant orders at [math] of $\operatorname{rev}_{g_{G}}G(\lambda)$ are also equal to those of $\operatorname{rev}_{0}\widehat{G}(\lambda).$ Multiplication by $\lambda$ of $\operatorname{rev}_{g_{G}}G(\lambda)$ and $\operatorname{rev}_{0}\widehat{G}(\lambda)$ yields that $\operatorname{rev}_{g_{G}+1}G(\lambda)$ and $\operatorname{rev}_{1}\widehat{G}(\lambda)$ have the same invariant orders at $0.$ The same happens with $\operatorname{diag}(\operatorname{rev}_{g_{G}+1}G(\lambda),I_{s})$ and $\operatorname{diag}(\operatorname{rev}_{1}\widehat{G}(\lambda),I_{s}).$ Thus, there exist $\widetilde{Q}_{1}(\lambda)$ and $\widetilde{Q}_{2}(\lambda)$ rational matrices regular at [math] such that $\widetilde{Q}_{1}(\lambda)\operatorname{diag}(\operatorname{rev}_{g_{G}+1}G(\lambda),I_{s})\widetilde{Q}_{2}(\lambda)=\operatorname{diag}(\operatorname{rev}_{1}\widehat{G}(\lambda),\allowbreak I_{s}),$ which proves according to (31) that $\mathcal{L}(\lambda)$ is a $(g_{G}+1)$ -strong linearization of $G(\lambda).$

2.

$q\neq p$ or $r\neq m,$ that is, $G(\lambda)$ and $\widehat{G}(\lambda)$ have different sizes and $s_{1}\neq s_{2}.$ In this case, there does not exist any integer $g$ such that the invariant orders at [math] of $\operatorname{diag}(\operatorname{rev}_{g}G(\lambda),I_{s_{1}})$ are equal to the invariant orders at [math] of $\operatorname{diag}(\operatorname{rev}_{1}\widehat{G}(\lambda),\allowbreak I_{s_{2}}).$ As a consequence, $\mathcal{L}(\lambda)$ is not a $g$ -strong linearization of $G(\lambda)$ for any grade $g,$ since (31) is never satisfied. In order to prove this, note that by (34), $\mathop{\rm rank}\nolimits G(\lambda)\neq\mathop{\rm rank}\nolimits\widehat{G}(\lambda),$ and the invariant orders at zero of $\operatorname{diag}(\operatorname{rev}_{g_{G}}G(\lambda),I_{s_{1}})$ are equal to those of $\operatorname{diag}(\operatorname{rev}_{0}\widehat{G}(\lambda),I_{s_{2}}).$ Moreover, $\widehat{G}(\lambda)$ is proper since $D_{1}+C_{1}A_{1}^{-1}B_{1}=0.$ Therefore, all the invariant orders at [math] of $\operatorname{rev}_{0}\widehat{G}(\lambda)=\widehat{G}(1/\lambda)$ are nonnegative. So, the same happens to $\operatorname{rev}_{g_{G}}G(\lambda).$ Then, $\operatorname{diag}(\operatorname{rev}_{1}\widehat{G}(\lambda),\allowbreak I_{s_{2}})$ has $s_{2}$ invariant orders at [math] equal to zero, and its remaining invariant orders at [math] are $\mathop{\rm rank}\nolimits\widehat{G}(\lambda)$ positive numbers. In contrast, if $g>g_{G},$ then $\operatorname{diag}(\operatorname{rev}_{g}G(\lambda),I_{s_{1}})$ has $s_{1}$ invariant orders at [math] equal to zero, and its remaining invariant orders at [math] are $\mathop{\rm rank}\nolimits G(\lambda)$ positive numbers. If $g\leq g_{G},$ notice that the largest invariant order at [math] of $\operatorname{diag}(\operatorname{rev}_{g}G(\lambda),I_{s_{1}})$ is less than the largest of $\operatorname{diag}(\operatorname{rev}_{1}\widehat{G}(\lambda),I_{s_{2}}).$

∎

We emphasize that, as far as we know, no explicit examples of strong linearizations in the sense of [5] with $n>0,$ $q\neq p$ or $r\neq m,$ and $D_{1}+C_{1}A_{1}^{-1}B_{1}=0$ have been constructed so far in the literature. Thus, in plain words, Proposition 4.23 states that strong linearizations according to [5] are particular cases of $g$ -strong linearizations according to Definition 4.21, except for a very particular instance.

In the rest of this section, we first revise important examples of strong linearizations in [5] from the perspective of Definition 4.21. Then, in Example 4.26, we provide a $g$ -strong linearization that is not a strong linearization in the sense of [5]. This example illustrates that the local approach followed in this paper yields, apart from the flexibility of constructing local linearizations, a concept of “global” strong linearization, more general than that of [5].

Example 4.24.

Let $G(\lambda)$ be a rational matrix written as in (8), i.e., $G(\lambda)=Q(\lambda)+G_{sp}(\lambda),$ with $\deg(Q(\lambda))>1.$ Then, the strong block minimal bases linearizations constructed in [5, Theorem 5.11] are $\deg(Q(\lambda))$ -strong linearizations of $G(\lambda),$ according to Definition 4.21. Note that, with the notation in Proposition 4.23, these linearizations satisfy $D_{1}+C_{1}A_{1}^{-1}B_{1}\neq 0,$ since $D_{1}\neq 0,$ and $C_{1}=B_{1}=0.$

Example 4.25.

Let us now consider a rational matrix $G(\lambda)$ written as in (8) with $\deg(Q(\lambda))\leq 1$ or $Q(\lambda)=0,$ and let $G_{sp}(\lambda)=C(\lambda I_{n}-A)^{-1}B$ be a minimal state-space realization of $G_{sp}(\lambda).$ Then, the following strong linearization

[TABLE]

is considered in [5] (paragraph below equation $(28)$ ). In this case, with the notation in Proposition 4.23, we have $n>0,$ $q=p,$ $r=m,$ and $C_{1}A_{1}^{-1}B_{1}=0.$ Then $D_{1}+C_{1}A_{1}^{-1}B_{1}=0$ if $g_{G}=0,$ or $D_{1}+C_{1}A_{1}^{-1}B_{1}\neq 0$ if $g_{G}=1.$ In any case, $L(\lambda)$ is a $1$ -strong linearization by Proposition 4.23. Observe that in this example $G(\lambda)$ is the transfer function of $L(\lambda).$ Thus, the fact that $L(\lambda)$ is a $1$ -strong linearization can also be obtained directly from Propositions 4.9 and 4.18.

Finally, we discuss the announced example of a linear polynomial system matrix that is a strong linearization in the sense of Definition 4.21 but not in the sense of [5, Definition 3.4].

Example 4.26.

Let us consider the rational matrix

[TABLE]

It can be easily proved that

[TABLE]

is a linear polynomial system matrix of $G(\lambda).$ Moreover, note that $\mathcal{L}(\lambda)$ is minimal for all $\lambda_{0}\in\mathbb{F}.$ Therefore, by Proposition 4.9, $\mathcal{L}(\lambda)$ is a linearization of $G(\lambda)$ in $\mathbb{F}.$ By Proposition 4.18, $\mathcal{L}(\lambda)$ is also a linearization of $G(\lambda)$ at $\infty$ of grade $1$ since $\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}\\ C_{1}\end{bmatrix}=\mathop{\rm rank}\nolimits\begin{bmatrix}A_{1}&B_{1}\end{bmatrix}=2.$ Thus, $\mathcal{L}(\lambda)$ is a $1$ -strong linearization of $G(\lambda),$ according to Definition 4.21. However, $\mathcal{L}(\lambda)$ is not a strong linearization according to [5, Definition 3.4] since $A_{1}$ is singular. Nevertheless, we can recover easily the invariant orders at $\infty$ from $\mathcal{L}(\lambda)$ by applying Proposition 4.17 with $g=1.$ For this purpose, note that $\operatorname{rev}\mathcal{L}(\lambda)$ does not have elementary divisors at $0,$ since $\operatorname{rev}\mathcal{L}(\lambda)$ is regular at $0.$ Moreover, the only elementary divisor at [math] of $A_{1}+A_{0}\lambda$ is $\lambda.$ Therefore, the invariant orders at infinity of $G(\lambda)$ are $-2$ and $-1$ by Proposition 4.17. The invariant orders of $G(\lambda)$ at any finite point can be recovered from $\mathcal{L}(\lambda)$ by using Theorem 4.8. It is worthwhile to emphasize that the grade of $\mathcal{L}(\lambda)$ as a strong linearization of $G(\lambda)$ is different from the degree of the polynomial part of $G(\lambda).$ Observe that this also happens in Example 4.25 when $Q(\lambda)$ is a constant matrix.

5 Block full rank pencils

In this section, we introduce a wide family of pencils that give us the information about the zeros of rational matrices locally. More precisely, they are linearizations with empty state matrix of rational matrices in some subsets of $\mathbb{F}$ , as well as at $\infty$ under some conditions. These pencils will be called block full rank pencils, since they generalize the block minimal bases pencils introduced in [10, Definition 3.1]. The definition of block full rank pencils is motivated by the fact that most of the linearizations for rational approximations of NLEPs that have been constructed so far are pencils of this type. The key results in this section are Theorems 5.4 and 5.8, which will be applied in the following section to establish rigorously and very easily the properties of the linearizations used in [18]. Note that, according to Theorem 4.8, the results in this section are not useful for studying, or computing, the finite poles of rational matrices because the considered linearizations have empty state matrix. This may be a drawback in certain situations, but we emphasize again that it is not in the development of algorithms for solving large-scale NLEPs via rational approximations [17, 18, 21, 29]. This is due to the fact that, in those cases, the poles of the rational matrix are known, since they are chosen for constructing the approximation, and/or are located outside the target set.

The theory we develop for block full rank pencils is based on the results for block minimal bases pencils from [10]. It is also possible to develop directly such theory at the cost of proving some preliminary lemmas. However, we think that our approach has the advantages of establishing a connection between both families of pencils and of emphasizing the relevance of these families in the study of rational and polynomial matrices.

Next, we present a few definitions and results from [10] for making easier the reading of this section. The notion of (strong) block minimal bases pencil is recalled in Definition 5.1. It relies on the concept of minimal bases of rational subspaces, which are certain polynomial bases of such subspaces defined in [14, 19]. As in [10], we will say for brevity that a polynomial matrix $K(\lambda)\in\mathbb{F}[\lambda]^{p\times m}$ (with $p<m$ ) is a minimal basis if its rows form a minimal basis of the rational subspace they span. One of the most useful characterizations of minimal bases (see [14, Main Theorem] or [10, Theorem 2.2]) is that $K(\lambda)\in\mathbb{F}[\lambda]^{p\times m}$ is a minimal basis if and only if $K(\lambda_{0})$ has full row rank for all $\lambda_{0}\in\mathbb{F}$ and $K(\lambda)$ is row reduced, i.e., its highest row degree coefficient matrix has full row rank (see [10, Definition 2.1]). Moreover, a minimal basis $N(\lambda)\in\mathbb{F}[\lambda]^{q\times m}$ is said to be dual to $K(\lambda)$ if $p+q=m$ and $K(\lambda)N(\lambda)^{T}=0$ [10, Definition 2.5].

Definition 5.1.

[10, Definition 3.1]* ((Strong) block minimal bases pencil). A block minimal bases pencil is a linear polynomial matrix over $\mathbb{F}$ with the following structure*

[TABLE]

where $K_{1}(\lambda)$ and $K_{2}(\lambda)$ are both minimal bases. In addition, if $K_{1}(\lambda)$ (respectively $K_{2}(\lambda)$ ) is a minimal basis with all its row degrees equal to $1$ and with the row degrees of a minimal basis $N_{1}(\lambda)$ (respectively $N_{2}(\lambda)$ ) dual to $K_{1}(\lambda)$ (respectively $K_{2}(\lambda)$ ) all equal, then $L(\lambda)$ is called a strong block minimal bases pencil. Moreover, given a polynomial matrix $P(\lambda),$ it is said that $L(\lambda)$ is associated with $P(\lambda)$ if $N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}=P(\lambda).$

Theorem 3.3 in [10] uses the standard definitions of linearizations and strong linearizations of polynomial matrices (see, for instance, [9]) to prove the most important property of a (strong) block minimal bases pencil $L(\lambda)$ as in (36), namely, $L(\lambda)$ is a (strong) linearization of the polynomial matrix $P(\lambda)=N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}$ for any $N_{1}(\lambda)$ and $N_{2}(\lambda)$ minimal bases dual to $K_{1}(\lambda)$ and $K_{2}(\lambda)$ , respectively. In the strong case, this result considers $P(\lambda)$ as a polynomial matrix with grade $g_{P}:=1+\deg(N_{1}(\lambda))+\deg(N_{2}(\lambda))$ . We can state [10, Theorem 3.3] in the language of this paper through Definitions 4.3 and 4.21 as “a block minimal bases pencil $L(\lambda)$ is a linearization of $P(\lambda)$ in $\mathbb{F}$ with empty state matrix and a strong block minimal bases pencil $L(\lambda)$ is a $g_{P}$ -strong linearization of $P(\lambda)$ with empty state matrix”. In order to see this, recall that the “empty state matrix” condition implies that the minimality condition is automatically satisfied (see Remarks 3.4 and 4.2) and that $\widehat{G}(\lambda)=L(\lambda)$ in the definitions cited above.

Next, we relax to a minimum the conditions on $K_{1}(\lambda)$ and $K_{2}(\lambda)$ in (36) for defining a wider family of pencils that includes block minimal bases pencils as a particular case.

Definition 5.2.

(Block full rank pencil)* A block full rank pencil is a linear polynomial matrix over $\mathbb{F}$ with the following structure*

[TABLE]

where $K_{1}(\lambda)$ and $K_{2}(\lambda)$ are pencils with full row normal rank.

Note that Definition 5.2 includes the cases when $K_{1}(\lambda)$ or $K_{2}(\lambda)$ are empty matrices, that is, when $L(\lambda)$ has only one block row or only one block column, respectively.

We introduce some auxiliary concepts and results before establishing the most important properties of block full rank pencils in Theorems 5.4 and 5.8. We will say that a rational matrix $R(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ has full row rank in $\Sigma\subseteq\mathbb{F}$ if, for all $\lambda_{0}\in\Sigma$ , $R(\lambda_{0})\in\mathbb{F}^{p\times m}$ , i.e., $R(\lambda)$ is defined or bounded at $\lambda_{0}$ , and $\mathop{\rm rank}\nolimits R(\lambda_{0})=p$ . Observe that this implies that $R(\lambda)$ has no poles in $\Sigma$ . The following lemma connects rational matrices with full row rank in $\Sigma$ with minimal bases, and establishes other properties that will be used later.

Lemma 5.3.

Let $R(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ be a rational matrix with full row normal rank and let $T(\lambda)\in\mathbb{F}[\lambda]^{p\times m}$ be a minimal basis of the row space of $R(\lambda)$ . Then the following statements hold:

(a)

There exists a unique regular rational matrix $S(\lambda)\in\mathbb{F}(\lambda)^{p\times p}$ such that $R(\lambda)=S(\lambda)T(\lambda)$ . 2. (b)

$R(\lambda)$ * has full row rank in $\Sigma\subseteq\mathbb{F}$ if and only if $S(\lambda)$ in (a) is regular in $\Sigma$ .* 3. (c)

$R(\lambda)$ * is a polynomial matrix if and only if $S(\lambda)$ in (a) is a polynomial matrix.* 4. (d)

If $R(\lambda)$ is a matrix pencil, then $S(\lambda)$ in (a) and $T(\lambda)$ are both matrix pencils.

Proof.

Part (a). Each row of $S(\lambda)$ is uniquely defined because its entries are the unique rational coefficients that allow us to express the corresponding row of $R(\lambda)$ as a unique linear combination of the rows of $T(\lambda)$ . Moreover, $S(\lambda)$ must be regular since, otherwise, there would exist a nonzero vector $y(\lambda)\in\mathbb{F}(\lambda)^{p\times 1}$ such that $y(\lambda)^{T}S(\lambda)=0$ . So, $y(\lambda)^{T}R(\lambda)=0$ , which contradicts that $\mathop{\rm rank}\nolimits R(\lambda)=p$ .

Part (b). It is obvious that if $S(\lambda)$ is regular in $\Sigma$ , then $R(\lambda)$ has full row rank in $\Sigma$ , because $T(\lambda)$ is defined in $\Sigma$ , as $T(\lambda)$ is a polynomial matrix, and $T(\lambda)$ has full row rank in $\Sigma$ , since $T(\lambda)$ is a minimal basis. The proof of the converse implication starts by proving that if $R(\lambda)$ has full row rank in $\Sigma$ , then $S(\lambda)$ is defined in $\Sigma$ . To see this, note that the Smith form of $T(\lambda)$ is $[I_{p}\;\;0]$ , because $T(\lambda)$ is a minimal basis and, therefore, does not have finite zeros. Thus, there exist unimodular matrices $U(\lambda)$ and $V(\lambda)$ such that $T(\lambda)=U(\lambda)\,[I_{p}\;\;0]\,V(\lambda)$ , and $R(\lambda)V(\lambda)^{-1}=[S(\lambda)U(\lambda)\;\;0]$ . This shows that $C(\lambda):=S(\lambda)U(\lambda)$ is defined in $\Sigma$ , because $R(\lambda)$ and $V(\lambda)^{-1}$ are both defined in $\Sigma$ ( $R(\lambda)$ by hypothesis and $V(\lambda)^{-1}$ because is unimodular and so a polynomial matrix). Therefore, $S(\lambda)=C(\lambda)U(\lambda)^{-1}$ is defined in $\Sigma$ . This implies that we can write $R(\lambda_{0})=S(\lambda_{0})T(\lambda_{0})$ for each $\lambda_{0}\in\Sigma$ , which in turns implies that $S(\lambda_{0})$ is invertible because $R(\lambda_{0})$ has full row rank.

Part (c). It follows directly from [14, Main Theorem, part 4].

Part (d). From [14, Main Theorem, part 4], we have that

[TABLE]

where $\mbox{row}_{i}\,(R(\lambda))$ denotes the $i$ th row of $R(\lambda)$ and the maximum is taken over the nonzero entries $s_{ij}(\lambda)$ of $S(\lambda)$ . Since all the rows of $T(\lambda)$ are different from zero, (38) implies that $\deg(s_{ij}(\lambda))\leq 1$ for each nonzero entry of $S(\lambda)$ . Moreover, each column of $S(\lambda)$ has at least one nonzero entry, because $S(\lambda)$ is regular, which, combined with (38), implies that $\deg(\mbox{row}_{j}\,(T(\lambda)))\leq 1$ , for each $j=1,\ldots,p$ . ∎

The last concepts we need before stating and proving the main Theorem 5.4 are those of rational basis and dual rational bases. A rational matrix $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ (with $p<m$ ) is said to be a rational basis if it is a basis of the rational subspace spanned by its rows, i.e., if it has full row normal rank. Two rational bases $G(\lambda)\in\mathbb{F}(\lambda)^{p\times m}$ and $H(\lambda)\in\mathbb{F}(\lambda)^{q\times m}$ are said to be dual if $p+q=m$ , and $G(\lambda)\,H(\lambda)^{T}=0$ . We are finally ready for presenting the main result of this section.

Theorem 5.4.

Let $L(\lambda)$ be a block full rank pencil as in (37) and let $N_{1}(\lambda)$ and $N_{2}(\lambda)$ be any rational bases dual to $K_{1}(\lambda)$ and $K_{2}(\lambda)$ , respectively. Let $\Omega\subseteq\mathbb{F}$ be nonempty. If $K_{i}(\lambda)$ and $N_{i}(\lambda)$ have full row rank in $\Omega$ , for $i=1,2,$ then $L(\lambda)$ is a linearization with empty state matrix of the rational matrix $G(\lambda)=N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}$ in $\Omega$ .

Proof.

In order to simplify the notation, throughout this proof we do not specify the sizes of different identity matrices and all of them are denoted by $I$ . Let $\widetilde{K}_{1}(\lambda),\widetilde{K}_{2}(\lambda),\widetilde{N}_{1}(\lambda)$ and $\widetilde{N}_{2}(\lambda)$ be minimal bases of the row spaces of $K_{1}(\lambda)$ , $K_{2}(\lambda)$ , $N_{1}(\lambda)$ and $N_{2}(\lambda)$ , respectively. Then, Lemma 5.3 implies that there exist regular rational matrices $S_{1}(\lambda)$ , $S_{2}(\lambda)$ , $W_{1}(\lambda)$ and $W_{2}(\lambda)$ such that

[TABLE]

Moreover, $\widetilde{K}_{1}(\lambda),\widetilde{K}_{2}(\lambda),S_{1}(\lambda)$ and $S_{2}(\lambda)$ are all matrix pencils. Then, $L(\lambda)$ can be factorized as follows,

[TABLE]

where the first and third factors are regular in $\Omega$ . Note that the factor in the middle is a block minimal bases pencil associated with the polynomial matrix $\widetilde{N}_{2}(\lambda)M(\lambda)\widetilde{N}_{1}(\lambda)^{T}$ , since the regularity of $S_{i}(\lambda)$ and $W_{i}(\lambda)$ implies that $\widetilde{K}_{i}(\lambda)$ and $\widetilde{N}_{i}(\lambda)$ are dual minimal bases for $i=1,2$ . Then, there exist unimodular matrices $U(\lambda)$ and $V(\lambda)$ such that

[TABLE]

where $U(\lambda)\operatorname{diag}(W_{2}(\lambda)^{-1},I)$ and $\operatorname{diag}(W_{1}(\lambda)^{-T},I)V(\lambda)$ are regular in $\Omega$ . From combining (39) and (51), we obtain that $L(\lambda)$ and $\operatorname{diag}(G(\lambda),I)$ are equivalent in $\Omega$ . This proves that $L(\lambda)$ is a linearization with empty state matrix of $G(\lambda)$ in $\Omega$ according to Definitions 4.1 and 4.3, since the minimality condition is automatically satisfied if the state matrix is empty. ∎

Remark 5.5.

In the scenario of Theorem 5.4, Theorem 4.8 guarantees that the elementary divisors of $L(\lambda)$ in $\Omega$ coincide exactly with the zero elementary divisors of $G(\lambda)$ in $\Omega$ . Moreover, it is clear from the expression $G(\lambda)=N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}$ that $G(\lambda)$ does not have poles in $\Omega,$ since the matrices $N_{i}(\lambda)$ must be defined in $\Omega$ but they are not defined at the poles of $G(\lambda)$ . Thus, $G(\lambda)$ has only eigenvalues in $\Omega$ , and all the information about them, i.e., geometric, algebraic and partial multiplicities, is contained in $L(\lambda)$ .**

Remark 5.6.

If in Theorem 5.4, $K_{1}(\lambda)$ (resp. $K_{2}(\lambda)$ ) is an empty matrix, we can take any rational matrix $N_{1}(\lambda)\in\mathbb{F}(\lambda)^{s_{1}\times s_{1}}$ (resp. $N_{2}(\lambda)\in\mathbb{F}(\lambda)^{s_{2}\times s_{2}}$ ) regular in $\Omega$ , where $s_{1}$ (resp. $s_{2}$ ) is the number of colums (resp. rows) of $M(\lambda)$ . The standard choices are $N_{1}(\lambda)=I_{s_{1}}$ and $N_{2}(\lambda)=I_{s_{2}}$ .

Remark 5.7.

Under the conditions of Theorem 5.4, we will say for brevity that “ $L(\lambda)$ is a block full rank pencil associated with $G(\lambda)$ in $\Omega$ ”. We emphasize that this “association” is not one-to-one because there are infinitely many rational bases $N_{1}(\lambda)$ and $N_{2}(\lambda)$ dual to $K_{1}(\lambda)$ and $K_{2}(\lambda)$ .

Next, we present sufficient conditions for a block full rank pencil to be a linearization of $G(\lambda)=N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}$ at $\infty$ of a certain grade $g$ . In order to avoid cases with limited interest in applications that complicate the statement, in Theorem 5.8 we assume $\deg(L(\lambda))=1$ .

Theorem 5.8.

Let $L(\lambda)$ be a block full rank pencil as in (37) with $\deg(L(\lambda))=1$ and let $N_{1}(\lambda)$ and $N_{2}(\lambda)$ be rational bases dual to $K_{1}(\lambda)$ and $K_{2}(\lambda)$ , respectively. If, for $i=1,2$ , $\operatorname{rev}_{1}K_{i}(\lambda)$ has full row rank at zero, and there exists an integer number $t_{i}$ such that $\operatorname{rev}_{t_{i}}N_{i}(\lambda)$ has full row rank at zero, then $L(\lambda)$ is a linearization with empty state matrix of the rational matrix $G(\lambda)=N_{2}(\lambda)M(\lambda)N_{1}(\lambda)^{T}$ at $\infty$ of grade $1+t_{1}+t_{2}$ .

Proof.

Note that

[TABLE]

is a block full rank pencil. Moreover, for $i=1,2$ , $\operatorname{rev}_{t_{i}}N_{i}(\lambda)$ has full row normal rank, and $K_{i}(\lambda)\,N_{i}(\lambda)^{T}=0$ implies $(\operatorname{rev}_{1}K_{i}(\lambda))\,(\operatorname{rev}_{t_{i}}N_{i}(\lambda))^{T}=0$ . Therefore, $\operatorname{rev}_{t_{i}}N_{i}(\lambda)$ is a rational basis dual to $\operatorname{rev}_{1}K_{i}(\lambda)$ . Then, Theorem 5.4 applied to $\operatorname{rev}L(\lambda)$ proves that $\operatorname{rev}L(\lambda)$ is a linearization with empty state matrix at zero of

[TABLE]

which combined with Proposition 4.13 proves the result. ∎

As a consequence of Theorems 5.4 and 5.8, we obtain Corollary 5.9. Although it follows immediately from them, we state it since it generalizes the structure of most of the linearizations of rational approximations of NLEPs that appear in the literature. Moreover, it is very useful in order to characterize easily some pencils as linearizations of rational matrices locally and to obtain the information about the zeros of such rational matrices in subsets not containing poles.

Corollary 5.9.

Let

[TABLE]

be a $p\times m$ rational matrix written in terms of some matrix pencils $A_{i}-\lambda B_{i}\in\mathbb{F}[\lambda]^{p\times n_{i}}$ and rational matrices $R_{i}(\lambda)\in\mathbb{F}(\lambda)^{n_{i}\times m}$ . Define

[TABLE]

and assume that $N_{1}(\lambda)$ has full row normal rank. Let $L(\lambda)=\left[\begin{array}[]{c}M(\lambda)\\ \hdashline[2pt/2pt]K_{1}(\lambda)\end{array}\right]$ be a block full rank pencil of degree $1$ with only one block column and such that $K_{1}(\lambda)$ and $N_{1}(\lambda)$ are dual rational bases. Let $\Omega\subseteq\mathbb{F}$ be nonempty. Then the following statements hold:

(a)

If $K_{1}(\lambda)$ and $N_{1}(\lambda)$ have full row rank in $\Omega$ then $L(\lambda)$ is a linearization with empty state matrix of $R(\lambda)$ in $\Omega.$

(b)

If $\operatorname{rev}_{1}K_{1}(\lambda)$ has full row rank at $0,$ and there exists an integer $t$ such that $\operatorname{rev}_{t}N_{1}(\lambda)$ has full row rank at [math], then $L(\lambda)$ is a linearization with empty state matrix of $R(\lambda)$ at $\infty$ of grade $1+t.$

Remark 5.10.

We emphasize that in some relevant applications the rational matrices $R_{i}(\lambda)$ of Corollary 5.9 are just of the form $R_{i}(\lambda)=r_{i}(\lambda)I_{m},$ where $r_{i}(\lambda)$ are scalar rational functions, and/or most of the pencils $A_{i}-\lambda B_{i}$ are constant matrices or a linear scalar function times a constant matrix. Moreover, in some other applications a low rank structure is present in $R(\lambda)$ , that is, some of the terms in $R(\lambda)$ have a rank much smaller than $\min\{p,m\}$ , and the corresponding rational matrices are written in the form $R_{i}(\lambda)=r_{i}(\lambda)R_{i}$ , where $R_{i}\in\mathbb{F}^{n_{i}\times m}$ is a constant matrix with $n_{i}\ll m.$

In the next two examples, we revisit the pencils introduced in Examples 3.5 and 4.11 from the perspective of the block full rank pencils. These examples illustrate how the theory of block full rank pencils may simplify the analysis of the properties of important linearizations of rational matrices when one is not interested on the information about the poles.

Example 5.11.

Let us consider the rational matrix $G(\lambda)$ and the pencil $P(\lambda)$ in Example 3.5. We partition $P(\lambda)$ as follows:

[TABLE]

Observe that, in the above partition, we are considering a permuted version of the structure of the pencil $L(\lambda)$ in Corollary 5.9. Note now that $K_{1}(\lambda)$ has full row rank in $\mathbb{C}$ , and

[TABLE]

is a rational basis dual to $K_{1}(\lambda)$ with full row rank in $\Sigma:=\mathbb{C}\setminus\{\sigma_{1},\ldots,\sigma_{s}\}$ . Then, by Corollary 5.9 $\rm(a)$ , $P(\lambda)$ is a linearization with empty state matrix of $G(\lambda)$ in $\Sigma.$ Moreover, note that $\operatorname{rev}_{1}K_{1}(\lambda)$ and $\operatorname{rev}_{0}N_{1}(\lambda)=\left[\begin{array}[]{ccccc}\frac{\lambda}{\lambda\sigma_{1}-1}I&\frac{\lambda}{\lambda\sigma_{2}-1}I&\ldots&\frac{\lambda}{\lambda\sigma_{s}-1}I&I\end{array}\right]$ both have full row rank at [math]. Thus, by Corollary 5.9 $\rm(b)$ , $P(\lambda)$ is a linearization with empty state matrix of $G(\lambda)$ at $\infty$ of grade $1.$

Example 5.12.

Let us consider the rational matrix $G(\lambda)$ and the pencil $L(\lambda)$ in Example 4.11. We now consider the following partition of $L(\lambda)$ :

[TABLE]

Since $K_{1}(\lambda)$ has full row normal rank, $L(\lambda)$ has the structure of the block full rank pencil in Corollary 5.9. Observe that

[TABLE]

is a rational basis dual to $K_{1}(\lambda)$ and that $K_{1}(\lambda)$ and $N_{1}(\lambda)$ have both full row rank in $\Omega:=\{\lambda\in\mathbb{F}:\lambda\text{ is not eigenvalue of }A\}.$ Thus, Corollary 5.9 $\rm(a)$ implies that $L(\lambda)$ is a linearization with empty state matrix of $G(\lambda)$ in $\Omega.$ This example, together with Example 4.11, illustrates a very important fact that we have already mentioned: the same pencil can be viewed as a linearization with different state matrices. Moreover, different views may require different conditions, may lead to different sets where the pencil is a linearization, and may differ in the difficulty to get the results. For instant, when the developments in this example are compared with the direct application of the definition of linearization presented in the second approach in Example 4.11 through the matrices $V(\lambda)$ and $U(\lambda)$ in (22) and (29), respectively, we can conclude that the “block full rank pencil” view leads to the same results in a much simpler way. We have experimented the simplicity of the “block full rank pencil” approach in many other examples.

Finally, note that the pencil in (52) satisfies that $\operatorname{rev}_{1}K_{1}(\lambda)$ has full row rank at [math] and that $N_{1}(\lambda)$ in (53) satisfies that $\operatorname{rev}_{q-1}N_{1}(\lambda)$ has also full row rank at [math]. Thus, Corollary 5.9 $\rm(b)$ implies that $L(\lambda)$ is a linearization with empty state matrix of $G(\lambda)$ at $\infty$ of grade $q$ . By comparing this result with the result in Example 4.20, we see that considering $L(\lambda)$ as a block full rank pencil leads to much stronger results on the structure at infinity than considering $L(\lambda)$ as a polynomial system matrix with state matrix $A(\lambda)$ in (15). In the former case, we do not need any extra hypothesis in order $L(\lambda)$ to be a linearization at infinity, while in the latter the condition $\mathop{\rm rank}\nolimits D_{q}=m$ is needed.

As previously announced, the results in this section will be used in Section 6. In addition, in a future work [12], we will extend them. More precisely, we will define block full rank linearizations of rational matrices with non empty state matrix that, therefore, will contain information about the poles. Moreover, we will apply these results to establish rigorously and very easily the properties of the linearizations introduced in [21].

6 Application of the local linearization theory to NLEIGS pencils

In this section we study in depth the pencils introduced in the influential reference [18]. This reference presents one of the first systematic approaches for solving large scale NLEPs. The approach in [18] consists essentially of three steps: (1) the matrix defining the NLEP is approximated by a rational matrix $Q_{N}(\lambda)$ via Hermite’s interpolation in a certain compact target set $\Sigma\subset\mathbb{C}$ where the eigenvalues of interest are located; (2) the obtained rational matrix is linearized by using a certain pencil $L_{N}(\lambda)$ ; and (3) a highly structured rational Krylov method is applied to the pencil to compute the eigenvalues of $Q_{N}(\lambda)$ in $\Sigma$ . For brevity of exposition, and also for recognizing the key contribution of [18], we will call NLEIGS pencils to the pencils introduced in this reference. The main goal of this section is to replace the vague usage of the word “linearization” in [18] by a number of rigorous results on NLEIGS pencils which, combined with the results in Sections 4 and 5, establish the precise properties enjoyed with respect to eigenvalues (and poles) of the NLEIGS pencils. We remark that NLEIGS pencils $L_{N}(\lambda)$ were the initial motivation for developing the results of this paper, since $L_{N}(\lambda)$ is not a linearization of the rational matrix $Q_{N}(\lambda)$ , according to the definitions of linearization and strong linearization presented in [5] or [1].

Since we are interested in rational matrices and their linearizations, all the delicate details about how the rational matrices $Q_{N}(\lambda)$ are constructed as approximations of the original NLEPs are omitted. Such details can be found in [18]. As in the rest of the paper, the results in this section are valid and are stated in any algebraically closed field $\mathbb{F}$ that does not include infinity. Note, however, that reference [18] considers only the complex field and that this restriction is important in the approximation phase of the NLEP. Moreover, although [18] deals with regular rational matrices $Q_{N}(\lambda)$ , we will not impose such condition initially in our developments.

Reference [18] uses two families of rational matrices, and corresponding pencils, depending on whether or not a certain low rank structure is present in the original NLEP. We will refer to them as the NLEIGS basic problem and the NLEIGS low rank structured problem, respectively. The NLEIGS pencils corresponding to each of these two cases will be studied from two perspectives giving rise to the four subsections included in this section. These two perspectives are considering NLEIGS pencils as block full rank pencils and, thus, as linearizations with empty state matrices, and considering them as polynomial system matrices with transfer function matrices equivalent to $Q_{N}(\lambda)$ everywhere except at a point $\xi_{N}$ . Both perspectives allow us to state in a rigorous sense that NLEIGS pencils are linearizations of $Q_{N}(\lambda)$ , but the one based on block full rank pencils is much simpler, does not require any hypothesis and covers fully the applications of interest in [18]. In contrast, the polynomial system matrix perspective provides more information on $Q_{N}(\lambda)$ but at the cost of extra hypotheses which are not imposed in [18] and that require considerable effort to check.

6.1 The NLEIGS basic problem from the point of view of block full rank pencils

The families of rational matrices considered in [18] are defined in terms of the following parameters: a list of nodes $(\sigma_{0},\sigma_{1},\ldots,\sigma_{N-1})$ in $\mathbb{F}$ , a list of nonzero poles $(\xi_{1},\xi_{2},\ldots,\xi_{N})$ in $\mathbb{F}\,\cup\,\{\infty\}$ , and a list of nonzero scaling parameters $(\beta_{0},\beta_{1},\ldots,\beta_{N})$ in $\mathbb{F}$ . It is important to bear in mind that [18] assumes that the poles are all distinct from the nodes. However, we do not assume such property, except in a few results where it will be explicitly stated. With these parameters, the following sequence of rational scalar functions is defined:

[TABLE]

Let us now define the linear scalar functions

[TABLE]

for $i=1,\ldots,N$ , and $j=0,\ldots,N-1.$ Then, the rational functions $b_{i}(\lambda)$ satisfy the simple recursion

[TABLE]

which will be useful in the sequel. Note that the rational functions $b_{i}(\lambda)$ could not be proper, since for any infinite pole $\xi_{i}=\infty$ the corresponding factor $1-\lambda/\xi_{i}$ is just equal to $1$ , and, therefore, $b_{i}(\lambda)$ has a nonconstant polynomial part.

With all this information, we are in the position of introducing the first family of rational matrices considered in [18], whose elements are defined as

[TABLE]

where $D_{0},\ldots,D_{N}\in\mathbb{F}^{m\times m}$ are constant matrices.

In this section, the nodes $(\sigma_{0},\ldots,\allowbreak\sigma_{N-1})$ , the poles $(\xi_{1},\ldots,\xi_{N})$ , the scaling parameters $(\beta_{0},\ldots,\beta_{N})$ and the matrices $D_{0},\allowbreak\ldots,D_{N}$ are arbitrary parameters that allow us to define the considered family of rational matrices. However, in [18] these parameters are carefully chosen in such a way that $Q_{N}(\lambda)$ approximates satisfactorily the matrix defining the NLEP to be solved in the target set $\Sigma\subset\mathbb{F}$ containing the desired eigenvalues of the NLEP. In this scenario, it is important to stress that the poles $(\xi_{1},\ldots,\xi_{N})$ are always chosen outside the region of interest $\Sigma$ [18, p. A2852], which implies that all the zeros of $Q_{N}(\lambda)$ located in $\Sigma$ are eigenvalues of $Q_{N}(\lambda)$ . Thus, the REP associated with $Q_{N}(\lambda)$ is an explicit example of a problem with a property that has been mentioned before in this paper, i.e., the poles are known and located outside the region of interest and, then, it is not needed to compute them. Note, however, the following subtlety: though it is clear that the finite poles of $Q_{N}(\lambda)$ are included in the list $(\xi_{1},\ldots,\xi_{N})$ , it is easy to construct examples of matrices as in (56) for which some of the finite numbers in $(\xi_{1},\ldots,\xi_{N})$ are not poles due to some cancellations. Thus, all the finite numbers in $(\xi_{1},\ldots,\xi_{N})$ are not necessarily finite poles of $Q_{N}(\lambda)$ and, even more, the partial multiplicities of such poles are not immediately visible from (56). Despite these comments, we will call the numbers $(\xi_{1},\ldots,\xi_{N})$ poles, following the usage in [18].

In order to solve the REP $Q_{N}(\lambda)\,y=0$ , the authors of [18] solve the generalized eigenvalue problem corresponding to the pencil

[TABLE]

where

[TABLE]

and

[TABLE]

In [18] the use of $L_{N}(\lambda)$ for solving the REP associated to $Q_{N}(\lambda)$ is supported by [18, Theorem 3.2], which states that $L_{N}(\lambda)$ is a strong linearization of the rational matrix $Q_{N}(\lambda)$ without specifying the exact meaning of “strong linearization” in this rational context. Moreover, the proof of [18, Theorem 3.2] consists of a reference to [2, Theorem 3.1], which is a paper dealing with strong linearizations of polynomial matrices in the classical sense of [16]. However, as a consequence of the results in Section 5, it is very easy to prove that $L_{N}(\lambda)$ is always a linearization of $Q_{N}(\lambda)$ in a set including the region of interest in [18], as well as at infinity. This is proved in Theorem 6.1, where the nomenclature introduced in Remark 5.7 is used.

Theorem 6.1.

Let $Q_{N}(\lambda)$ be the rational matrix in (56) and $L_{N}(\lambda)$ be the pencil in (57). Let $\mathcal{P}_{N}$ and $i_{N}$ be, respectively, the set of finite poles and the number of infinite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N})$ . Then, the following statements hold:

(a)

$L_{N}(\lambda)$ * partitioned as in (57) is a block full rank pencil with only one block column associated with $Q_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ .* 2. (b)

$L_{N}(\lambda)$ * is a linearization with empty state matrix of $Q_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ .* 3. (c)

$L_{N}(\lambda)$ * is a linearization with empty state matrix of $Q_{N}(\lambda)$ at $\infty$ of grade $i_{N}$ .*

Proof.

It is immediate to check that

[TABLE]

is a rational basis dual to $K_{N}(\lambda)$ . Note also that $K_{N}(\lambda)$ and $N_{N}(\lambda)$ have both full row rank in $\mathbb{F}\setminus\mathcal{P}_{N}$ . In addition, an easy direct computation proves $M_{N}(\lambda)N_{N}(\lambda)^{T}=Q_{N}(\lambda)$ . Thus, parts (a) and (b) follow from Theorem 5.4. Observe that parts (a) and (b) can also be proved from Corollary 5.9, since the structures of $Q_{N}(\lambda)$ , $L_{N}(\lambda)$ and $N_{N}(\lambda)$ are particular cases of those described in that corollary.

In order to prove part (c), note first that $\mbox{rev}_{1}\,K_{N}(\lambda)$ has full row rank at zero. We now consider the rational matrix $\mbox{rev}_{i_{N}-1}\,N_{N}(\lambda)=\lambda^{i_{N}-1}N_{N}\left(\frac{1}{\lambda}\right)$ , which is of the form

[TABLE]

where the entries $*$ are defined at $0.$ Denote by $i_{N-1}$ the number of infinite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N-1})$ . Then, $b_{N-1}\left(\frac{1}{\lambda}\right)=\frac{1}{\lambda^{i_{N-1}}}c(\lambda),$ for a certain rational function $c(\lambda)$ with $c(0)\neq 0.$ Thus, we obtain that $\mbox{rev}_{i_{N}-1}\,N_{N}(\lambda)$ has full row rank at $0,$ taking into account that $i_{N-1}=i_{N}$ if $\xi_{N}\neq\infty,$ and $i_{N-1}=i_{N}-1$ if $\xi_{N}=\infty.$ Then, part (c) follows from Theorem 5.8. ∎

Combining Theorems 6.1 and 4.8, we get that $L_{N}(\lambda)$ contains all the information about the finite eigenvalues of $Q_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ , including all type of multiplicities (algebraic, geometric and partial). Moreover, Proposition 4.17 allows us to recover the complete pole-zero structure of $Q_{N}(\lambda)$ at $\infty$ from the eigenvalue structure at [math] of $\operatorname{rev}L_{N}(\lambda)$ , just by noting that, in this case, $t=0$ in Proposition 4.17 since we are taking an empty state matrix. We stress that all these results hold for any rational matrix $Q_{N}(\lambda)$ either regular or singular. However, no information is provided on the finite poles of $Q_{N}(\lambda),$ and some of them could also be zeros. As explained above, this is not an issue in [18], since $\mathcal{P}_{N}$ is outside the target set $\Sigma$ . Nevertheless, at the cost of imposing extra hypotheses, we will solve this problem in Section 6.3 for completeness and also because it is of interest for the theory of REPs.

Remark 6.2.

Let $d_{N}(\lambda)$ and $d_{N-1}(\lambda)$ be the denominators of $b_{N}(\lambda)$ and $b_{N-1}(\lambda)$ in (54), respectively. Then, under the hypothesis $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ , $L_{N}(\lambda)$ is a strong block minimal bases pencil (recall Definition 5.1) associated with the polynomial matrix $d_{N}(\lambda)Q_{N}(\lambda)$ . This follows easily from the facts that $K_{N}(\lambda)$ in (57) is a minimal basis with all its row degrees equal to one, that

[TABLE]

is a minimal basis dual to $K_{N}(\lambda)$ with all its row degrees equal to $N-1$ , and that $M_{N}(\lambda)\widehat{N}_{N}(\lambda)^{T}=d_{N}(\lambda)Q_{N}(\lambda)$ . Thus, using the results stated in the paragraph after Definition 5.1, we get that $L_{N}(\lambda)$ is a $N$ -strong linearization of the polynomial matrix $d_{N}(\lambda)Q_{N}(\lambda)$ with empty state matrix. Since $d_{N}(\lambda)Q_{N}(\lambda)$ and $Q_{N}(\lambda)$ are equivalent in $\mathbb{F}\setminus\mathcal{P}_{N}$ , we obtain again the result in Theorem 6.1(b) through a different path which requires to use extra hypotheses. **

6.2 The NLEIGS low rank problem from the point of view of block full rank pencils

The second family of rational matrices considered in [18] comes from approximating NLEPs, $A(\lambda)x=0$ , such that the associated matrix $A(\lambda)$ is the sum of a polynomial matrix plus a matrix of the form $\sum_{i=1}^{n}C_{i}f_{i}(\lambda)$ , where the constant matrices $C_{i}$ have much smaller rank than the size of $A(\lambda)$ and $f_{i}(\lambda)$ are scalar nonlinear functions of $\lambda$ . This type of NLEPs arise in several applications [17] and are approximated in [18, eq. (6.2)] by a family of rational matrices of the form

[TABLE]

where $b_{0}(\lambda),\ldots,b_{N}(\lambda)$ are the scalar rational functions in (54), $\widetilde{D}_{0},\ldots,\widetilde{D}_{p}\in\mathbb{F}^{m\times m}$ , $\widetilde{L}_{p+1},\ldots,\widetilde{L}_{N}\in\mathbb{F}^{m\times r}$ and $\widetilde{U}\in\mathbb{F}^{m\times r}$ are constant matrices, and $r\ll m$ . For the functions in (55), let us consider the simpler notation $h_{i}:=h_{i}(\lambda)$ and $g_{i}:=g_{i}(\lambda)$ . Then, in order to solve the REP $\widetilde{Q}_{N}(\lambda)y=0$ efficiently by taking advantage of the low rank structure of $Q_{N}(\lambda),$ the following pencil is introduced in [18, Sec. 6.4]:

[TABLE]

where

[TABLE]

and

[TABLE]

A result analogous to Theorem 6.1 can be proved for the pencil $\widetilde{L}_{N}(\lambda)$ and the matrix $\widetilde{Q}_{N}(\lambda)$ . This is accomplished in Theorem 6.3. We remark, nevertheless, that the result concerning the linearizations at $\infty$ is weaker in Theorem 6.3 than in Theorem 6.1. This is an unavoidable consequence of the used approach and the low rank structure of $\widetilde{Q}_{N}(\lambda)$ .

Theorem 6.3.

Let $\widetilde{Q}_{N}(\lambda)$ be the rational matrix in (59) and $\widetilde{L}_{N}(\lambda)$ be the pencil in (60). Let $\mathcal{P}_{N}$ and $i_{N}$ be, respectively, the set of finite poles and the number of infinite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N})$ . Then, the following statements hold:

(a)

$\widetilde{L}_{N}(\lambda)$ * partitioned as in (60) is a block full rank pencil with only one block column associated with $\widetilde{Q}_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ .* 2. (b)

$\widetilde{L}_{N}(\lambda)$ * is a linearization with empty state matrix of $\widetilde{Q}_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ .* 3. (c)

If, in addition, the poles $\xi_{p+1},\xi_{p+2},\ldots,\xi_{N-1}$ are all finite, then $\widetilde{L}_{N}(\lambda)$ is a linearization with empty state matrix of $\widetilde{Q}_{N}(\lambda)$ at $\infty$ of grade $i_{N}$ .

Proof.

The proof is similar to that of Theorem 6.1 with some differences coming from the presence of the low rank term in $\widetilde{Q}_{N}(\lambda)$ . It is immediate to check that

[TABLE]

is a rational basis dual to $\widetilde{K}_{N}(\lambda)$ , that $\widetilde{K}_{N}(\lambda)$ and $\widetilde{N}_{N}(\lambda)$ have both full row rank in $\mathbb{F}\setminus\mathcal{P}_{N}$ and that $\widetilde{M}_{N}(\lambda)\widetilde{N}_{N}(\lambda)^{T}=\widetilde{Q}_{N}(\lambda)$ . Thus, parts (a) and (b) follow from Theorem 5.4.

In order to prove part (c), note first that $\mbox{rev}_{1}\,\widetilde{K}_{N}(\lambda)$ has full row rank at zero as a consequence of the fact that the poles $\xi_{p+1},\xi_{p+2},\ldots,\xi_{N-1}$ are all finite. We now consider the rational matrix $\mbox{rev}_{i_{N}-1}\,\widetilde{N}_{N}(\lambda)=\lambda^{i_{N}-1}\widetilde{N}_{N}\left(\frac{1}{\lambda}\right)$ , which is of the form

[TABLE]

where the entries $*$ are defined at $0.$ Denote by $i_{p}$ the number of infinite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{p})$ . Then, $b_{p}\left(\frac{1}{\lambda}\right)=\frac{1}{\lambda^{i_{p}}}\tilde{c}(\lambda)$ for a certain rational function $\tilde{c}(\lambda)$ with $\tilde{c}(0)\neq 0.$ Taking into account that the poles $\xi_{p+1},\xi_{p+2},\ldots,\xi_{N-1}$ are all finite, we have that $i_{p}=i_{N}$ if $\xi_{N}\neq\infty,$ and $i_{p}=i_{N}-1$ if $\xi_{N}=\infty.$ Therefore, $\mbox{rev}_{i_{N}-1}\,\widetilde{N}_{N}(\lambda)$ has full row rank at [math] because $\tilde{c}(0)\neq 0$ . Thus, part (c) follows from Theorem 5.8. ∎

A discussion similar to the one in the last paragraph of Section 6.1 can be developed on the basis of Theorem 6.3. The details are omitted for brevity. The open problem corresponding to the information of the finite poles will be solved in Section 6.4.

6.3 The NLEIGS basic problem from the point of view of polynomial system matrices

As discussed previously, the approach presented in Section 6.1 to the NLEIGS pencil $L_{N}(\lambda)$ in (57) considers $L_{N}(\lambda)$ as a linearization with empty state matrix and, thus, it does not provide any information on the finite poles of $Q_{N}(\lambda)$ . In order to get this information, we need to identify a convenient square regular submatrix $A_{N}(\lambda)$ of $L_{N}(\lambda)$ that may be used as a state matrix. The block structure of $L_{N}(\lambda)$ makes it not possible to find such a matrix $A_{N}(\lambda)$ in a way that it includes the information of all the potential poles $(\xi_{1},\ldots,\xi_{N})$ . This is related with the comment included in [18, p. A2849] on the fact that $\xi_{N}$ plays a special role and that it is convenient to choose $\xi_{N}=\infty$ . In what follows we will not assume that $\xi_{N}=\infty$ , though the obtained results are simpler and stronger under such assumption, but we will focus on getting information on the finite poles in $(\xi_{1},\ldots,\xi_{N-1})$ . With this spirit, we consider the following partition of $L_{N}(\lambda)$ in (57), where $A_{N}(\lambda)$ will play the role of the state matrix,

[TABLE]

and the rest of the blocks are easily described from the blocks in (57) as follows: $B_{N}(\lambda)$ is the first block column of $K_{N}(\lambda)$ , $-C_{N}(\lambda)$ is obtained by removing the first block of $M_{N}(\lambda)$ and $A_{N}(\lambda)$ is obtained by removing the first block column of $K_{N}(\lambda)$ .

The next technical lemma reveals which is the transfer function matrix of $L_{N}(\lambda)$ , with the partition above, and establishes necessary and sufficient conditions for $L_{N}(\lambda)$ to be minimal in the whole field $\mathbb{F}$ . Of course, the conditions in Lemma 6.4(b) come from imposing that $\left[\begin{array}[]{cc}B_{N}(\lambda_{0})&A_{N}(\lambda_{0})\end{array}\right]\in\mathbb{F}^{m(N-1)\times mN}$ and $\left[\begin{array}[]{cc}-C_{N}(\lambda_{0})^{T}&A_{N}(\lambda_{0})^{T}\end{array}\right]^{T}\in\mathbb{F}^{mN\times m(N-1)}$ have, respectively, full row and column rank for any $\lambda_{0}\in\mathbb{F}$ , but have an important advantage with respect to these direct conditions for minimality. More precisely, the conditions in Lemma 6.4(b) require to evaluate the rational matrix $R_{N}(\lambda)$ of size $m\times m$ , which for practical problems is much smaller than $m(N-1)\times mN$ .

Lemma 6.4.

Let us consider the pencil $L_{N}(\lambda)$ in (57) as a polynomial system matrix with state matrix $A_{N}(\lambda)$ , where $A_{N}(\lambda)$ is defined through the partition (62), and let $Q_{N}(\lambda)$ be the rational matrix in (56). Then the following statements hold:

(a)

The transfer function matrix of $L_{N}(\lambda)$ is $\beta_{0}\left(1-\frac{\lambda}{\xi_{N}}\right)Q_{N}(\lambda).$ 2. (b)

Let us define the rational matrix $R_{N}(\lambda):=(Q_{N}(\lambda)-b_{0}(\lambda)D_{0})/b_{N}(\lambda)$ , whose explicit expression is

[TABLE]

let $\mathcal{P}_{N-1}$ be the set of finite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N-1})$ , and assume $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ . Then, $L_{N}(\lambda)$ is minimal in $\mathbb{F}$ if and only if the matrix $R_{N}(\xi_{k})\in\mathbb{F}^{m\times m}$ is nonsingular for all $\xi_{k}\in\mathcal{P}_{N-1}$ .

Proof.

Part (a). According to (62), the transfer function matrix of $L_{N}(\lambda)$ is $D_{N}(\lambda)+C_{N}(\lambda)A_{N}(\lambda)^{-1}B_{N}(\lambda)$ . The computation of this transfer function is very easy because $B_{N}(\lambda)=\left[\begin{array}[]{cccc}-h_{0}(\lambda)I_{m}&0&\cdots&0\end{array}\right]^{T}$ , which implies that only the first block column of $A_{N}(\lambda)^{-1}$ is needed. It is immediate to check that this first block column is

[TABLE]

The rest of the proof of part (a) is just an elementary and short algebraic manipulation.

Part (b). The proof is elementary but long. Thus, it is postponed to A. ∎

We emphasize that Lemma 6.4(a) holds for any rational matrix $Q_{N}(\lambda)$ expressed as in (56) without imposing any extra condition. Moreover, the constant matrix $A_{N}(\lambda_{0})$ is invertible for any $\lambda_{0}\in\mathbb{F}\setminus\mathcal{P}_{N-1}$ and, so, $L_{N}(\lambda)$ is minimal in $\mathbb{F}\setminus\mathcal{P}_{N-1}$ . Combining these results with the fact that $Q_{N}(\lambda)$ and $\beta_{0}\left(1-\frac{\lambda}{\xi_{N}}\right)Q_{N}(\lambda)$ are equivalent in $\mathbb{F}$ if $\xi_{N}=\infty$ or in $\mathbb{F}\setminus\{\xi_{N}\}$ if $\xi_{N}$ is finite, we immediately obtain from Definitions 4.1 and 4.3 that $L_{N}(\lambda)$ is a linearization of $Q_{N}(\lambda)$ with state matrix $A_{N}(\lambda)$ in $\mathbb{F}\setminus\mathcal{P}_{N}$ , which is a result analogous to Theorem 6.1(b). This approach, of course, does not give any information on the finite poles of $Q_{N}(\lambda)$ , because the finite eigenvalues of $A_{N}(\lambda)$ coincide with $\mathcal{P}_{N-1}$ . Such information is obtained from the next result, which is the main result of this section and is a corollary of Lemma 6.4.

Theorem 6.5.

Let $Q_{N}(\lambda)$ be the rational matrix in (56), $L_{N}(\lambda)$ be the pencil in (57), $A_{N}(\lambda)$ be the submatrix of $L_{N}(\lambda)$ in (62), and $R_{N}(\lambda)$ be the rational matrix in (63). Consider $\mathcal{P}_{N-1}$ the set of finite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N-1})$ , and assume $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ . If $R_{N}(\xi_{k})\in\mathbb{F}^{m\times m}$ is nonsingular for every $\xi_{k}\in\mathcal{P}_{N-1}$ , then $L_{N}(\lambda)$ is a linearization of $Q_{N}(\lambda)$ with state matrix $A_{N}(\lambda)$ in $\mathbb{F}$ , if $\xi_{N}=\infty$ , or in $\mathbb{F}\setminus\{\xi_{N}\}$ , if $\xi_{N}$ is finite.

Proof.

Under the hypotheses of Theorem 6.5, $L_{N}(\lambda)$ is minimal in $\mathbb{F}$ . Moreover, its transfer function matrix, i.e., $\beta_{0}\left(1-\frac{\lambda}{\xi_{N}}\right)Q_{N}(\lambda)$ is equivalent to $Q_{N}(\lambda)$ in $\mathbb{F}$ , if $\xi_{N}=\infty$ , or in $\mathbb{F}\setminus\{\xi_{N}\}$ , if $\xi_{N}$ is finite. The result follows immediately from Definitions 4.1 and 4.3 with $s_{1}=s_{2}=0$ . ∎

We emphasize that the hypotheses that the constant matrices $R_{N}(\xi_{k})$ in Theorem 6.5 are nonsingular are not mentioned at all in [18], but, fortunately, are generic, in the sense that they are satisfied by almost all regular rational matrices $Q_{N}(\lambda)$ expressed as in (56).

Remark 6.6.

Under the conditions of Theorem 6.5, the pole elementary divisors of $Q_{N}(\lambda)$ in $\mathbb{F}$ , if $\xi_{N}=\infty$ , or in $\mathbb{F}\setminus\{\xi_{N}\}$ , if $\xi_{N}$ is finite, are the elementary divisors of $A_{N}(\lambda)$ , as a consequence of Theorem 4.8. These elementary divisors can be very easily determined as follows: first express $A_{N}(\lambda)=\widehat{A}_{N}(\lambda)\otimes I_{m}$ ; second note that if $\widehat{S}_{N}(\lambda)$ is the Smith form of $\widehat{A}_{N}(\lambda)$ , then $\widehat{S}_{N}(\lambda)\otimes I_{m}$ is the Smith form of $A_{N}(\lambda)$ ; third, use the fact that $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ , to prove that the greatest common divisor of all $(N-2)\times(N-2)$ minors of $\widehat{A}_{N}(\lambda)$ is equal to $1$ , which implies, according to [15, Ch. VI], that there is only one invariant polynomial of $\widehat{S}_{N}(\lambda)$ different from $1$ and that is equal to

[TABLE]

where $c\in\mathbb{F}$ is a constant that makes $p(\lambda)$ monic. Finally, we get that $A_{N}(\lambda)$ has $m$ invariant polynomials different from $1$ all equal to $p(\lambda)$ . This allows us to obtain easily the finite elementary divisors of $A_{N}(\lambda)$ and, thus, the finite pole elementary divisors of $Q_{N}(\lambda)$ (in $\mathbb{F}$ if $\xi_{N}=\infty$ , or in $\mathbb{F}\setminus\{\xi_{N}\}$ if $\xi_{N}$ is finite). In particular, they are of the form $(\lambda-\xi_{i})^{\nu_{i}}$ and, in order to obtain the partial multiplicities $\nu_{i},$ we have to take into account possible repetitions in $(\xi_{1},\ldots,\xi_{N-1})$ . Observe that the infinite $\xi_{i}$ for $i=1,\ldots,N-1$ do not contribute at all to the finite pole elementary divisors of $Q_{N}(\lambda).$ Moreover, if $\xi_{N}=\infty$ , then we can state the compact and simple result that the $m$ denominators of the global Smith–McMillan form of $Q_{N}(\lambda)$ are all equal to $p(\lambda)$ . However, with this choice of state matrix, there is no way of obtaining information on the pole structure of $\xi_{N}$ when it is finite. This is the reason why, even if $L_{N}(\lambda)$ is minimal in $\mathbb{F}$ , $L_{N}(\lambda)$ is not a linearization of $Q_{N}(\lambda)$ in $\mathbb{F}.$

6.4 The NLEIGS low rank problem from the point of view of polynomial system matrices

The results in this section are the counterpart for $\widetilde{Q}_{N}(\lambda)$ in (59) and $\widetilde{L}_{N}(\lambda)$ in (60) of those presented in Section 6.3 for $Q_{N}(\lambda)$ and $L_{N}(\lambda)$ . For brevity, we avoid in this section to introduce auxiliary comments similar to the corresponding ones in Section 6.3 and just some relevant differences are remarked. The motivation of this section is to obtain from $\widetilde{L}_{N}(\lambda)$ information about the finite poles of $\widetilde{Q}_{N}(\lambda)$ . For this purpose, we consider the following partition of $\widetilde{L}_{N}(\lambda)$ in (60), where $\widetilde{A}_{N}(\lambda)$ will play the role of the state matrix,

[TABLE]

and the rest of the blocks are easily described from the blocks in (60) as follows: $\widetilde{B}_{N}(\lambda)$ is the first block column of $\widetilde{K}_{N}(\lambda)$ , $-\widetilde{C}_{N}(\lambda)$ is obtained by removing the first block of $\widetilde{M}_{N}(\lambda)$ , and $\widetilde{A}_{N}(\lambda)$ is obtained by removing the first block column of $\widetilde{K}_{N}(\lambda)$ .

The next lemma is the counterpart of Lemma 6.4. Note that the low rank structure in $\widetilde{Q}_{N}(\lambda)$ complicates the minimality conditions in part (b) of Lemma 6.7, which are expressed in terms of matrices of size $(2m+r)\times(m+r)$ .

Lemma 6.7.

Let us consider the pencil $\widetilde{L}_{N}(\lambda)$ in (60) as a polynomial system matrix with state matrix $\widetilde{A}_{N}(\lambda)$ , where $\widetilde{A}_{N}(\lambda)$ is defined through the partition (64), and let $\widetilde{Q}_{N}(\lambda)$ be the rational matrix in (59). Then the following statements hold:

(a)

The transfer function matrix of $\widetilde{L}_{N}(\lambda)$ is $\beta_{0}\left(1-\frac{\lambda}{\xi_{N}}\right)\widetilde{Q}_{N}(\lambda).$ 2. (b)

Let us define the rational matrices

[TABLE]

and

[TABLE]

Let $\mathcal{P}_{N-1}$ be the set of finite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N-1})$ , and assume that $\mathop{\rm rank}\nolimits\widetilde{U}=r$ and that $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ . Then, $\widetilde{L}_{N}(\lambda)$ is minimal in $\mathbb{F}$ if and only if the matrix $\widetilde{R}_{N}(\xi_{k})\in\mathbb{F}^{(2m+r)\times(m+r)}$ has full column rank for all $\xi_{k}\in\mathcal{P}_{N-1}$ .

Proof.

Part (a). The proof is similar to that of Lemma 6.4(a) with some differences coming from the presence of the low rank term in $\widetilde{Q}_{N}(\lambda)$ . According to (64), the transfer function matrix of $\widetilde{L}_{N}(\lambda)$ is $\widetilde{D}_{N}(\lambda)+\widetilde{C}_{N}(\lambda)\widetilde{A}_{N}(\lambda)^{-1}\widetilde{B}_{N}(\lambda)$ . The computation of this matrix is very easy because, again, $\widetilde{B}_{N}(\lambda)=\left[\begin{array}[]{cccc}-h_{0}I_{m}&0&\cdots&0\end{array}\right]^{T}$ , and only the first block column of $\widetilde{A}_{N}(\lambda)^{-1}$ is needed, which, in this case, is equal to

[TABLE]

Part (b). The proof is elementary but long. Thus, it is postponed to B. ∎

Remark 6.8.

If, in addition to $\mathop{\rm rank}\nolimits\widetilde{U}=r$ and $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ , we assume that $\xi_{1}=\cdots=\xi_{p}=\infty$ , then the necessary and sufficient conditions for minimality in Lemma 6.7(b) can be considerably simplified, since we get as an immediate corollary of Lemma 6.7(b) that “ $\widetilde{L}_{N}(\lambda)$ is minimal in $\mathbb{F}$ if and only if the matrix $\widetilde{R}_{N}^{(2)}(\xi_{k})\in\mathbb{F}^{m\times r}$ has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ ”. Note that the hypothesis $\xi_{1}=\cdots=\xi_{p}=\infty$ implies that the “no-low rank” term $\sum_{i=0}^{p}b_{i}(\lambda)\widetilde{D}_{i}$ of $\widetilde{Q}_{N}(\lambda)$ is a polynomial matrix, as often happens in NLEPs [18].

Observe also that if $\widehat{R}_{N}(\lambda)$ is the $(m+r)\times(m+r)$ matrix obtained from $\widetilde{R}_{N}(\lambda)$ in (65) by removing the second block row, then under the assumptions $\mathop{\rm rank}\nolimits\widetilde{U}=r$ and $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ , we get, as another immediate corollary of Lemma 6.7(b), the following sufficient condition for minimality: “if $\widehat{R}_{N}(\xi_{k})\in\mathbb{F}^{(m+r)\times(m+r)}$ is invertible for every $\xi_{k}\in\mathcal{P}_{N-1}$ , then $\widetilde{L}_{N}(\lambda)$ is minimal in $\mathbb{F}$ ”. **

Theorem 6.9 is the main result in this section and is an easy corollary of Lemma 6.7. Its proof is omitted because is very similar to that of Theorem 6.5.

Theorem 6.9.

Let $\widetilde{Q}_{N}(\lambda)$ be the rational matrix in (59), $\widetilde{L}_{N}(\lambda)$ be the pencil in (60), $\widetilde{A}_{N}(\lambda)$ be the submatrix of $\widetilde{L}_{N}(\lambda)$ in (64), and $\widetilde{R}_{N}(\lambda)$ be the rational matrix in (65). Consider $\mathcal{P}_{N-1}$ the set of finite poles in the list $(\xi_{1},\xi_{2},\ldots,\xi_{N-1})$ . If $\mathop{\rm rank}\nolimits\widetilde{U}=r$ , $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ , $0\leq j\leq N-1$ , and $\widetilde{R}_{N}(\xi_{k})\in\mathbb{F}^{(2m+r)\times(m+r)}$ has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ , then $\widetilde{L}_{N}(\lambda)$ is a linearization of $\widetilde{Q}_{N}(\lambda)$ with state matrix $\widetilde{A}_{N}(\lambda)$ in $\mathbb{F}$ , if $\xi_{N}=\infty$ , or in $\mathbb{F}\setminus\{\xi_{N}\}$ , if $\xi_{N}$ is finite.

Finally, note that the conditions in Theorem 6.9 on the full column rank of the matrices $\widetilde{R}_{N}(\xi_{k})$ can be simplified as in Remark 6.8 under extra hypotheses.

7 Conclusions and future work

A theory of local linearizations of rational matrices has been carefully presented in this paper, by developing as starting point the extension of Rosenbrock’s minimal polynomial system matrices to a local scenario. Moreover, this theory has been applied to a number of pencils that have appeared recently in some influential papers on solving numerically NLEPs by combining rational approximations, linearizations of the resulting rational matrices, and efficient numerical algorithms for generalized eigenvalue problems adapted to the structure of such linearizations. It has been emphasized throughout the paper that the theory of local linearizations allows us to view these pencils, and to explain their properties, from rather different perspectives, which depend on the particular choice of the submatrix of the pencil to be considered as state matrix. In particular, we have seen that the choice of an empty state matrix is simple and adequate for those rational matrices and pencils arising in NLEPs, when the poles are already known from the approximation process. This has led us to define and analyze the very general family of block full rank pencils, as a template that covers many of the pencils, available in the literature, that linearize the rational approximations in the corresponding target set. We plan to extend these ideas in [12], where other ways to choose the state matrices will be explored. In addition, the results in this paper and also the new ones in [12] will be applied to the pencils defined in [21], as well as to other pencils. Finally, we also plan to study numerical properties of some of the linearizations analyzed in this work. In particular, given a linearization of the REP in a set, it is important to study the backward stability in terms of the structure of the rational matrix defining the REP when applying a numerical method to compute the eigenvalues of the linearization. In addition, we plan to investigate the conditioning of eigenvalues, that is, the sensitivity to perturbations, both in the original REP and its linearization, of a zero that is not a pole of the rational matrix.

Appendix A Proof of Lemma 6.4(b)

Let us consider $L_{N}(\lambda)$ partitioned as in (62) and as a polynomial system matrix with state matrix $A_{N}(\lambda)$ . Recall throughout the proof that the parameters $\beta_{0},\beta_{1},\ldots,\beta_{N}$ are all different from zero. Observe first that $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ and $0\leq j\leq N-1$ , implies that $\left[\begin{array}[]{cc}B_{N}(\lambda_{0})&A_{N}(\lambda_{0})\end{array}\right]$ has full row rank for any $\lambda_{0}\in\mathbb{F}$ . On the other hand, if we define

[TABLE]

then $Z_{N}(\lambda_{0})$ has full column rank for every $\lambda_{0}\in\mathbb{F}\setminus\mathcal{P}_{N-1}$ , because $A_{N}(\lambda_{0})$ is invertible in $\mathbb{F}\setminus\mathcal{P}_{N-1}$ . Therefore, combining the discussion above with Definition 3.3, we obtain that $L_{N}(\lambda)$ is minimal in $\mathbb{F}$ if and only if $Z_{N}(\xi_{k})$ has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ . The rest of the proof proceeds as follows: we will find a rational matrix $S_{N}(\lambda)$ such that is equivalent to $Z_{N}(\lambda)$ in $\mathcal{P}_{N-1}$ and has a simple structure that allows us to see that $S_{N}(\xi_{k})$ (and, so, $Z_{N}(\xi_{k})$ ) has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ if and only if $R_{N}(\xi_{k})$ is invertible for every $\xi_{k}\in\mathcal{P}_{N-1}$ , where $R_{N}(\lambda)$ is the rational matrix in (63).

For brevity, we use the notation $g_{i}:=g_{i}(\lambda)$ and $h_{i}:=h_{i}(\lambda)$ for the scalar functions in (55). In addition, $Z_{N}(\lambda)$ in (66) is partitioned as

[TABLE]

where

[TABLE]

Note that the matrix $Z_{21}(\lambda)$ is invertible in $\mathcal{P}_{N-1}$ and that the last block column of $Z_{21}(\lambda)^{-1}$ is

[TABLE]

Next, a sequence of equivalence transformations in $\mathcal{P}_{N-1}$ are applied to $Z_{N}(\lambda)$ . Such transformations are described by using the notation in (67) and (68), and the first one is

[TABLE]

The second transformation is designed to turn zero the second block row of $Z_{11}(\lambda)$ as follows

[TABLE]

The third transformation turns zero the block $g_{N-1}Y_{22}(\lambda)$ of $W_{N}(\lambda)$ and performs a convenient scalar multiplication in its first block row. Such transformation is

[TABLE]

where $R_{N}(\lambda)$ is the rational matrix in (63). The last transformation makes zero the first $N-2$ blocks of size $m\times m$ in the first block row of $X_{N}(\lambda)$ and yields the announced matrix $S_{N}(\lambda)$ equivalent to $Z_{N}(\lambda)$ in $\mathcal{P}_{N-1}$ . More precisely,

[TABLE]

The block $H(\lambda):=\left(\prod_{i=1}^{N-2}\frac{g_{i}}{h_{i}}\right)g_{N-1}I_{m}$ of $S_{N}(\lambda)$ satisfies $H(\xi_{k})=0$ for all $\xi_{k}\in\mathcal{P}_{N-1}$ . Therefore, $S_{N}(\xi_{k})$ (and, so, $Z_{N}(\xi_{k})$ ) has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ if and only if $R_{N}(\xi_{k})$ is invertible for all $\xi_{k}\in\mathcal{P}_{N-1}$ , and the result is proved.

Appendix B Proof of Lemma 6.7(b)

The first part of the proof is completely analogous to the first part of the proof of Lemma 6.4(b). So, some details are ommited. Let us consider $\widetilde{L}_{N}(\lambda)$ partitioned as in (64) and as a polynomial system matrix with state matrix $\widetilde{A}_{N}(\lambda)$ . Then the hypotheses $\mathop{\rm rank}\nolimits\widetilde{U}=r$ and $\xi_{i}\neq\sigma_{j}$ , $1\leq i\leq N$ and $0\leq j\leq N-1$ , imply that $\left[\begin{array}[]{cc}\widetilde{B}_{N}(\lambda_{0})&\widetilde{A}_{N}(\lambda_{0})\end{array}\right]$ has full row rank for any $\lambda_{0}\in\mathbb{F}$ . Also, if we define

[TABLE]

then $\widetilde{Z}_{N}(\lambda_{0})$ has full column rank for every $\lambda_{0}\in\mathbb{F}\setminus\mathcal{P}_{N-1}$ , because $\widetilde{A}_{N}(\lambda_{0})$ is invertible in $\mathbb{F}\setminus\mathcal{P}_{N-1}$ . Therefore, $\widetilde{L}_{N}(\lambda)$ is minimal in $\mathbb{F}$ if and only if $\widetilde{Z}_{N}(\xi_{k})$ has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ . In the rest of the proof we will find a rational matrix $\widetilde{S}_{N}(\lambda)$ such that is equivalent to $\widetilde{Z}_{N}(\lambda)$ in $\mathcal{P}_{N-1}$ and that allows us to see that $\widetilde{S}_{N}(\xi_{k})$ (and, so, $\widetilde{Z}_{N}(\xi_{k})$ ) has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ if and only if $\widetilde{R}_{N}(\xi_{k})$ in (65) has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ . We advance that this second part of the proof is considerably more involved than the corresponding part of the proof of Lemma 6.4(b), as a consequence of the presence in $\widetilde{Z}_{N}(\lambda)$ of two kinds of blocks, one kind corresponding to the “full rank” part of $\widetilde{Q}_{N}(\lambda)$ , i.e., the first summation in (59), and another kind corresponding to the “low rank” part of $\widetilde{Q}_{N}(\lambda)$ . Nevertheless, the equivalence transformations in $\mathcal{P}_{N-1}$ used in the sequel are similar to those in the proof of Lemma 6.4(b), and many details will be omitted for brevity. Recall that we use the notation in (55) omitting the dependence on $\lambda$ for simplicity, i.e., we write simply $g_{i}$ and $h_{j}$ .

The first two equivalence transformations in $\mathcal{P}_{N-1}$ that we perform affect only to the last $N-1-p$ block rows of $\widetilde{Z}_{N}(\lambda)$ , i.e., those containing $I_{r}$ matrices. Thus, in this part of the proof, it is convenient to partition $\widetilde{Z}_{N}(\lambda)$ as

[TABLE]

with $\widetilde{Z}_{N}^{(1)}(\lambda)$ comprising the first $p+1$ block rows of $\widetilde{Z}_{N}(\lambda)$ . In order to construct the first equivalence transformation, we pay attention to the following submatrix of $\widetilde{Z}_{N}^{(2)}(\lambda)$ ,

[TABLE]

which is invertible in $\mathcal{P}_{N-1}$ and has the same structure as $Z_{21}(\lambda)$ in (67). The last block column of $\widetilde{H}_{N}(\lambda)^{-1}$ has a structure similar to (68) and is denoted by $J(\lambda)$ . Then, the first two equivalence transformations are

[TABLE]

where

[TABLE]

with $e_{p}^{T}=[0\,\,\cdots\,\,0\,\,1]\in\mathbb{F}^{1\times p}$ . In order to describe the outcome of the next two transformations, we consider the following submatrix of $\widetilde{Z}_{N}^{(1)}(\lambda)$ :

[TABLE]

The next equivalence transformations in $\mathcal{P}_{N-1}$ are

[TABLE]

where $\widetilde{R}_{N}^{(2)}(\lambda)$ is the rational matrix appearing in (65). Observe that the structure of the last block row of $\widetilde{X}_{N}(\lambda)$ allows us to perform an equivalence transformation in $\mathcal{P}_{N-1}$ that turns the block $\left[\begin{array}[]{ccc}\frac{g_{N}}{h_{N-1}}\widetilde{L}_{p+1}&\cdots&\frac{g_{N}}{h_{N-1}}\widetilde{L}_{N-2}\end{array}\right]$ into [math] without changing the remaining blocks. The resulting matrix is called $\widehat{X}_{N}(\lambda)$ . Now, denote by $E_{21}(\lambda)$ the matrix obtained from $\widetilde{E}_{N}(\lambda)$ by removing its first block row and its last block column, and observe that $E_{21}(\lambda)$ is invertible in $\mathcal{P}_{N-1}$ and has the same structure as $Z_{21}(\lambda)$ in (67) with $N-2$ replaced by $p-1$ . The last block column of $E_{21}(\lambda)^{-1}$ is denoted by $\widetilde{Y}_{22}(\lambda)$ . With this information, the following equivalence transformations are

[TABLE]

where $I_{s}=I_{(p-2)m+(N-1-p)r}$ , and

[TABLE]

where $\widetilde{R}_{N}^{(1)}(\lambda)$ is the rational matrix appearing in (65). Finally, the announced matrix $\widetilde{S}_{N}(\lambda)$ is obtained from $\widehat{S}_{N}(\lambda)$ by using its third block row to transform the block $\left[\begin{array}[]{ccc}\!\!\frac{g_{N}}{h_{N-1}}\widetilde{D}_{1}&\cdots&\!\!\!\!\frac{g_{N}}{h_{N-1}}\widetilde{D}_{p-1}\end{array}\right]$ into [math] without changing the remaining blocks. The structure of $\widetilde{S}_{N}(\lambda)$ implies immediately that $\widetilde{S}_{N}(\xi_{k})$ has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ if and only if $\widetilde{R}_{N}(\xi_{k})$ in (65) has full column rank for every $\xi_{k}\in\mathcal{P}_{N-1}$ .

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Alam, N. Behera, Linearizations for rational matrix functions and Rosenbrock system polynomials , SIAM J. Matrix Anal. Appl. 37(1) (2016) 354–380.
2[2] A. Amiraslani, R. M. Corless, P. Lancaster, Linearization of matrix polynomials expressed in polynomial bases , IMA J. Numer. Anal. 29 (2009) 141–157.
3[3] A. Amparan, S. Marcaida, I. Zaballa, On the structure invariants of proper rational matrices with prescribed finite poles , Linear and Multilinear Algebra 61(11) (2013) 1464–1486.
4[4] A. Amparan, S. Marcaida, I. Zaballa, Finite and infinite structures of rational matrices: a local approach , Electron. J. Linear Algebra 30 (2015) 196–226.
5[5] A. Amparan, F. M. Dopico, S. Marcaida, I. Zaballa, Strong linearizations of rational matrices , SIAM J. Matrix Anal. Appl. 39(4) (2018) 1670–1700.
6[6] D. J. Cullen, Local system equivalence , Math. Systems Theory 19 (1986) 67-78.
7[7] R. Das, R. Alam, Recovery of minimal bases and minimal indices of rational matrices from Fiedler-like pencils , Linear Algebra Appl. 566 (2019) 34–60.
8[8] R. Das, R. Alam, Affine spaces of strong linearizations for rational matrices and the recovery of eigenvectors and minimal bases , Linear Algebra Appl. 569 (2019) 335–368.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Local Linearizations of Rational Matrices with Application to Rational Approximations of Nonlinear Eigenvalue Problems

Abstract

keywords:

1 Introduction

2 Preliminaries

Definition 2.1**.**

Definition 2.2**.**

Definition 2.3**.**

Definition 2.4**.**

Definition 2.5**.**

Definition 2.6**.**

Proposition 2.7**.**

Proof.

3 Polynomial system matrices minimal in subsets of F\mathbb{F}F and at infinity

3.1 Polynomial system matrices minimal in subsets of F\mathbb{F}F

Definition 3.1** (Polynomial system matrix minimal at a point in F\mathbb{F}F).**

Remark 3.2**.**

Definition 3.3** (Polynomial system matrix minimal in a subset of F\mathbb{F}F).**

Remark 3.4**.**

Example 3.5**.**

Theorem 3.6**.**

Proof.

Theorem 3.7**.**

Example 3.8**.**

3.2 Polynomial system matrices minimal at infinity

Definition 3.9** (ggg-reversal of a rational matrix).**

Definition 3.10** (Polynomial system matrix minimal at infinity).**

Example 3.11**.**

Remark 3.12**.**

Theorem 3.13**.**

Proof.

Lemma 3.14**.**

Proof.

Theorem 3.15**.**

Proof.

Example 3.16**.**

Definition 3.17** (Strongly minimal polynomial system matrix).**

4 Local linearizations of rational matrices

4.1 Linearizations in subsets of F\mathbb{F}F

Definition 4.1** (Linearization at a point in F\mathbb{F}F).**

Remark 4.2**.**

Definition 4.3** (Linearization in a subset of F\mathbb{F}F).**

Remark 4.4**.**

Remark 4.5**.**

Lemma 4.6**.**

Proof.

Corollary 4.7**.**

Proof.

Theorem 4.8** (Spectral characterization of linearizations at a point in F\mathbb{F}F).**

Proof.

Proposition 4.9**.**

Example 4.10**.**

Example 4.11**.**

4.2 Linearizations at infinity and in sets containing infinity

Definition 4.12** (Linearization at infinity of grade ggg).**

Proposition 4.13**.**

Proof.

Remark 4.14**.**

Remark 4.15**.**

Theorem 4.16** (Spectral characterization of linearizations at infinity).**

Proposition 4.17**.**

Proof.

Proposition 4.18**.**

Example 4.19**.**

Example 4.20**.**

Definition 4.21** (ggg-strong linearization).**

Example 4.22**.**

4.3 Comparison with another definition of strong linearization

Proposition 4.23**.**

Proof.

Example 4.24**.**

Example 4.25**.**

Example 4.26**.**

5 Block full rank pencils

Definition 2.1.

Definition 2.2.

Definition 2.3.

Definition 2.4.

Definition 2.5.

Definition 2.6.

Proposition 2.7.

3 Polynomial system matrices minimal in subsets of $\mathbb{F}$ and at infinity

3.1 Polynomial system matrices minimal in subsets of $\mathbb{F}$

Definition 3.1 (Polynomial system matrix minimal at a point in $\mathbb{F}$ ).

Remark 3.2.

Definition 3.3 (Polynomial system matrix minimal in a subset of $\mathbb{F}$ ).

Remark 3.4.

Example 3.5.

Theorem 3.6.

Theorem 3.7.

Example 3.8.

Definition 3.9 ( $g$ -reversal of a rational matrix).

Definition 3.10 (Polynomial system matrix minimal at infinity).

Example 3.11.

Remark 3.12.

Theorem 3.13.

Lemma 3.14.

Theorem 3.15.

Example 3.16.

Definition 3.17 (Strongly minimal polynomial system matrix).

4.1 Linearizations in subsets of $\mathbb{F}$

Definition 4.1 (Linearization at a point in $\mathbb{F}$ ).

Remark 4.2.

Definition 4.3 (Linearization in a subset of $\mathbb{F}$ ).

Remark 4.4.

Remark 4.5.

Lemma 4.6.

Corollary 4.7.

Theorem 4.8 (Spectral characterization of linearizations at a point in $\mathbb{F}$ ).

Proposition 4.9.

Example 4.10.

Example 4.11.

Definition 4.12 (Linearization at infinity of grade $g$ ).

Proposition 4.13.

Remark 4.14.

Remark 4.15.

Theorem 4.16 (Spectral characterization of linearizations at infinity).

Proposition 4.17.

Proposition 4.18.

Example 4.19.

Example 4.20.

Definition 4.21 ( $g$ -strong linearization).

Example 4.22.

Proposition 4.23.

Example 4.24.

Example 4.25.

Example 4.26.

Definition 5.1.

Definition 5.2.

Lemma 5.3.

Theorem 5.4.

Remark 5.5.

Remark 5.6.

Remark 5.7.

Theorem 5.8.

Corollary 5.9.

Remark 5.10.

Example 5.11.

Example 5.12.

Theorem 6.1.

Remark 6.2.

Theorem 6.3.

Lemma 6.4.

Theorem 6.5.

Remark 6.6.

Lemma 6.7.

Remark 6.8.

Theorem 6.9.