On the gap between deterministic and probabilistic joint spectral radii for discrete-time linear systems
Yacine Chitour, Guilherme Mazanti, Mario Sigalotti

TL;DR
This paper explores the relationship between deterministic and probabilistic measures of asymptotic behavior in discrete-time linear switched systems, aiming to characterize when these measures are equal.
Contribution
It provides a characterization of the conditions under which the deterministic joint spectral radius equals certain probabilistic spectral radii.
Findings
Identifies conditions for equality between deterministic and probabilistic spectral radii.
Analyzes the sets of matrices where such equalities hold.
Provides insights into the structure of systems with matching spectral radii.
Abstract
Given a discrete-time linear switched system associated with a finite set of matrices, we consider the measures of its asymptotic behavior given by, on the one hand, its deterministic joint spectral radius and, on the other hand, its probabilistic joint spectral radii for Markov random switching signals with transition matrix and a corresponding invariant probability . Note that is larger than or equal to for every pair . In this paper, we investigate the cases of equality of with either a single or with the supremum of over and we aim at characterizing the sets for which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the gap between deterministic and probabilistic joint spectral radii for discrete-time linear systems††thanks: This research was partially supported by the iCODE Institute, research project of the IDEX Paris-Saclay. The second author was partially supported by the public grant number ANR-10-CAMP-0151-02-FMJH as part of the “Programme des Investissements d’Avenir”.
Yacine Chitour, Guilherme Mazanti††footnotemark: , Mario Sigalotti††footnotemark: Université Paris-Saclay, CNRS, CentraleSupélec, Laboratoire des signaux et systèmes, 91190, Gif-sur-Yvette, France.Inria, France. Sorbonne Université, Université de Paris, CNRS, Inria, Laboratoire Jacques-Louis Lions, 75005 Paris, France.
Abstract
Given a discrete-time linear switched system associated with a finite set of matrices, we consider the measures of its asymptotic behavior given by, on the one hand, its deterministic joint spectral radius and, on the other hand, its probabilistic joint spectral radii for Markov random switching signals with transition matrix and a corresponding invariant probability . Note that is larger than or equal to for every pair . In this paper, we investigate the cases of equality of with either a single or with the supremum of over and we aim at characterizing the sets for which such equalities may occur.
Keywords. Linear switched systems, discrete time, joint spectral radius, Markov process.
2020 Mathematics Subject Classification. 93C30, 93C55, 37H15.
This paper was first published in Linear Algebra and its Applications, 613:24–45, 2021. With respect to the published version, this version provides an additional remark (Remark 2.10) and a more precise proof of Lemma 3.2. All modifications with respect to the published version are given in blue. The authors are very grateful to Matteo Della Rossa for pointing out the imprecisions in the previous version of the proof of Lemma 3.2.
1 Introduction
In this paper, we consider discrete-time switched linear systems of the form
[TABLE]
where and are positive integers, , is the set of the set of all maps , and is an -tuple of matrices with real coefficients.
Switched systems model the behavior of a continuous variable whose dynamics may change over time according to the value of a discrete variable . These models are useful for several applications, ranging from air traffic control, electronic circuits, and automotive engines to chemical processes and population models in biology. This wide field of applications, together with the interesting mathematical questions arising from their analysis, justify the extensive literature on switched systems, which have been studied from the point of view of both deterministic and random switching [28, 22, 29, 23, 7, 6]. A commonly used point of view on the switching signal , which we adopt in this paper, is to consider it as an uncertainty or perturbation acting on the system, the goal being thus to provide properties of the system independent of a particular choice of .
We are interested in describing the asymptotic behavior of . For a given , the asymptotic behavior of the corresponding non-autonomous linear system is measured by the quantity defined by
[TABLE]
Indeed, if and only if all trajectories of the non-autonomous system converge exponentially to the origin.
In order to capture the asymptotic behavior of , we must formulate some condition which is independent of the choice of . There exist two main approaches to proceed. The first one is deterministic and consists in considering the joint spectral radius of , defined as the supremum of over all . Since its introduction in [26] and after the seminal paper [13], it has been extensively studied in the computer science and control theory communities (see, e.g., the monograph [19]).
The other approach to handle the asymptotic behavior of is probabilistic and amounts to considering a probability measure on and hence as a random variable. We may then consider as a probabilistic joint spectral radius the expected value of with respect to the probability law , which we denote by . There exists a vast literature devoted to the properties of products of random matrices, and we refer the reader to [1, 5, 8] for more details. A major result in this field has been obtained in [16] and provides general conditions on under which on a set of probability .
The interest in considering and comes from the stability analysis of (1.1). Indeed, if and only if (1.1) is uniformly exponentially stable [19], whereas, under the conditions of [16], if and only if -almost every trajectory of (1.1) converges exponentially to the origin.
In this paper, we aim at understanding the relations between the deterministic and the probabilistic approaches. The deterministic measure of stability characterizes the worst possible behavior over all , while the probabilistic counterpart provides the average behavior for corresponding to the probability measure . As a consequence, the deterministic approach provides a more conservative estimate of the asymptotic behavior of the system than the probabilistic one, in the sense that
[TABLE]
A natural question is then to understand under which conditions on and the inequality in (1.3) is strict. Furthermore, for practical and modeling purposes, it is important to understand whether, given a family of probability measures , the strict inequality holds true. Regarding the first question, it is known that there always exists a measure such that equality holds in (1.3) (see, for instance, [24], where such measures are referred to as maximizing measures). At such a level of generality, a handy characterization of maximizing measures cannot be expected. This is why we restrict our attention to the family of probability measures on obtained from discrete-time shift-invariant Markov chains and reformulate the previous two questions as follows: under which conditions on do we have
- (Q1)
equality between and for a given ? 2. (Q2)
equality between and ?
Notice that the condition is related to the almost sure stability of the system uniformly with respect to the Markov process, a stability property first considered in [18] in the case of Markov chains with positive transition probabilities. Other stability notions have also been considered for (1.1), such as periodic stability, meaning stability for all periodic signals , or mean square stability. Several works explore relations between these different notions, see, e.g., [18, 14, 15, 12, 6, 9]. In particular, [12] establishes a probabilistic version of the finiteness conjecture, i.e., if (1.1) is periodically stable, then for every .
Another interesting fact is that the quantities and of for could be equivalently computed by replacing the norm in (1.2) by the spectral radius. In the deterministic case, this result is known as the Berger–Wang formula or also as the Joint Spectral Radius Theorem [19], and it has been extended to the Markovian setting in [20, 10].
In order to describe the main results of our paper, let us identify a measure with the pair , where is the transition matrix of the Markov chain corresponding to and is its (invariant) initial probability. In particular, we write for . Our main result concerning (a) (see Theorem 3.1) establishes that a necessary and sufficient condition for equality is that for every that corresponds to a cycle in the directed weighted graph determined by such that . The necessity follows from results provided in [24], whereas, for sufficiency, we consider first the particular case where is irreducible and is strongly connected (see Lemma 3.3). Irreducibility implies in particular the existence of a Barabanov norm for (see Definition 2.1), which is an important tool in our proof. We then generalize the result to the case of reducible (see Lemma 3.5) by a suitable block decomposition of the matrices in and the fact that and can be read on the diagonal blocks of the decomposed matrix (cf. [19, 17]). Finally, the general case for can be obtained by using a classical block decomposition of stochastic matrices.
The equivalence established in Theorem 3.1 can be further characterized in terms of simultaneous similarity of the matrices , , to orthogonal matrices, under some additional assumptions on and (Proposition 3.9). The latter characterization is based on the description of matrix semigroups with constant spectral radius from [25].
Our next main result, Theorem 3.13, concerns (b) and states that equality is equivalent to the existence of a family of pairwise distinct indices such that . This corresponds to the case where the worst behavior of the system is attained by a periodic with no repetition of indices on a period. This property is reminiscent of the finiteness property, except for the fact that, in the finiteness property, repetition of indices is allowed. We recall that the finiteness property is known to hold only for a proper subclass of -tuples [3, 4], contrarily to what had been earlier conjectured [21]. By applying a standard lifting argument of Markov chains of higher order to Markov chains of order one, we generalize the equivalence stated in Theorem 3.13 by providing the following characterization of the finiteness property: a -tuple satisfies the finiteness property if and only if there exist and a Markov chain of order whose corresponding probabilistic Lyapunov exponent is equal to (see Corollary 4.4). This, in turns, is equivalent to say that the finiteness property holds if and only if the set of maximizing measures contains the measure induced by some Markov chain of arbitrary order.
Acknowledgements. The authors are indebted with D. Chafaï for helpful discussions. They are also grateful to the anonymous reviewers of a preceding version of the manuscript for providing helpful comments and pointing out relevant literature.
2 Definitions, notations, and basic facts
Throughout the paper, and belong to , which is used to denote the set of positive integers. If and are positive integers, denotes the set of integers such that . For , denotes the smallest integer greater than or equal to , and we extend this notation componentwise to vectors and matrices. We use to denote a norm in as well as the corresponding induced norm on the space of matrices with real coefficients. We only consider in this paper norms in obtained as induced norms from . An -tuple is said to be irreducible if the only subspaces of invariant under all the matrices are and .
2.1 Deterministic joint spectral radius
Let be the discrete-time switched system defined in (1.1). The deterministic joint spectral radius of , introduced in [26], is defined by
[TABLE]
Since all norms in induced by norms in are submultiplicative and equivalent to each other, it immediately follows that does not depend on a specific choice of such a norm and that
[TABLE]
Notice that, for every and , we have
[TABLE]
where we use the definition of and the fact that for every square matrix and .
Definition 2.1** (Barabanov norm).**
Let be an -tuple of matrices with real coefficients. A norm on is said to be a Barabanov norm for if the following two conditions hold.
- (a)
For every , . 2. (b)
For every and , there exists such that .
The following basic result on Barabonov norms was proved in [2].
Proposition 2.2**.**
Let be an -tuple of matrices with real coefficients. If is irreducible, then it admits a Barabanov norm.
2.2 Probabilistic joint spectral radius
We now provide a probabilistic counterpart to . For that purpose, we collect some basic notions concerning transition matrices of Markov chains.
Definition 2.3**.**
Let be an matrix with nonnegative coefficients.
- (a)
is said to be stochastic if, for every , . 2. (b)
is said to be strongly connected if it is not similar via a permutation to an upper block triangular matrix. 3. (c)
For and , we say that is a -word if . The integer is called the length of the -word . We say that is a -cycle if . The index is called the starting index of the -cycle . 4. (d)
Let be a vector in with nonnegative coefficients. We say that is a -word (respectively, -cycle) if it is a -word (respectively, -cycle) and . 5. (e)
If is stochastic, a row vector is said to be an invariant probability for if for every , , and .
Remark 2.4**.**
In the context of discrete-time Markov chains in a finite state space with states, the transition matrix is the stochastic matrix where represents the probability to switch from the state to the state . Notice that is strongly connected if and only if its associated oriented graph is strongly connected. In the stochastic processes literature, the strong connectedness of is more often referred to as irreducibility. We choose to stick with the former to avoid ambiguities with the homonymous notion for -tuples of matrices. Notice also that the notions of strong connectedness, -cycles, and -words only depend on the adjacency matrix of the graph , while -cycles and -words depend on and .
Remark 2.5**.**
Recall that, by the Perron–Frobenius Theorem, a stochastic matrix always admits an invariant probability, which is unique and has positive entries if is strongly connected. In the latter case, the definitions of -word and -word coincide, as well as those of -cycle and -cycle.
We have the following classical decomposition result for stochastic matrices [27, §§1.2 and 4.2].
Proposition 2.6**.**
Let be a stochastic matrix. Then, up to a permutation in the set of indices , is given by
[TABLE]
where and, for , is a stochastic and strongly connected matrix.
Moreover, for , let be the unique invariant probability for and denote by the same symbol its canonical extension as a vector in according to the decomposition (2.3). Then every invariant probability can be uniquely decomposed as
[TABLE]
where and .
The next lemma, useful in the proof of some of our results, uses the previous decomposition to obtain that any -cycle has all its indices corresponding to a same diagonal block in (2.3).
Lemma 2.7**.**
Let be a stochastic matrix decomposed according to Proposition 2.6. For , let
[TABLE]
i.e., is the set of indices corresponding to the diagonal block in (2.3). Let be an invariant probability for . Then, for every -cycle , there exists such that are in .
Proof.
Notice that, by (2.4), if . Hence, since , there exists such that . Since , it follows by the block decomposition (2.3) that . The conclusion follows by an immediate inductive argument. ∎
We also introduce the following notation.
Definition 2.8**.**
Let be a stochastic matrix and be an -tuple of matrices with real coefficients.
- (a)
For every -word , we use to denote the matrix product . 2. (b)
For every , let be the matrix semigroup made of all matrix products associated with -cycles with starting index , i.e.,
[TABLE]
We also set
[TABLE]
We finally provide the definition of the probabilistic counterpart of for . Let be a stochastic matrix, be an invariant probability for , and an -tuple in . The probabilistic joint spectral radius is defined as
[TABLE]
where
[TABLE]
As in the deterministic case, does not depend on the specific choice of the norm and we have
[TABLE]
Remark 2.9**.**
The expectation in (2.5) is taken with respect to the random variable . The definition of probabilistic joint spectral radius provided here is a particular instance of a more general and comprehensive formulation based on symbolic dynamics; see, for instance, [12, 11, 24]. Notice also that it follows from the definition of -word that the summation in (2.6) can be restricted to -words of length .
Remark 2.10**.**
When dealing with probabilistic switching phenomena in discrete time, several works, such as [24, 17, 16, 11, 8, 1], deal with the probabilistic Lyapunov exponent defined by
[TABLE]
Our choice to use instead is motivated by the fact that the main goal of our paper is to compare the probabilistic behavior of (1.1) with the worst deterministic behavior provided by the classical joint spectral radius , whose definition in discrete-time (2.1) does not involve taking the logarithm of the norm of the matrix product. Working with also has the additional advantage of being able to handle the case without dealing with the singularity of the logarithm at [math].
Clearly, by Jensen’s inequality, we have , but this inequality may be strict in some cases. Indeed, for , , , , and , we easily compute that and .
We do have equality between and , however, under the assumption that defines an ergodic Markov chain, i.e., for some in the decompositions (2.3) and (2.4) in Proposition 2.6. Indeed, in this case, the main result of [16] implies that
[TABLE]
where denotes the probability measure on associated canonically with the transition matrix and the invariant probability . Using this fact, one deduces that , and, in addition, we also have the equality
[TABLE]
Notice that, in particular, is ergodic when is strongly connected and is its unique invariant measure.
Remark 2.11**.**
The deterministic joint spectral radius provides the worst asymptotic behavior of with respect to . By introducing the probability measure on associated canonically with the transition matrix and the invariant probability , the quantity defined in (2.5) can be interpreted as an asymptotic behavior averaged by . When is ergodic, thanks to (2.8), we have the stronger interpretation of as the -almost sure asymptotic behavior of .
It is immediate to see that, for every as above, we have , and then
[TABLE]
where the first supremum is taken over all invariant probabilities for and the second one over the pairs made of an stochastic matrix and an invariant probability for . We find it useful to introduce the notation
[TABLE]
Remark 2.12**.**
It follows from (2.7) that is upper semicontinuous. Moreover, the set of pairs consisting of an stochastic matrix and an invariant probability for is compact. As a consequence, the suprema in (2.10) can be replaced by maxima.
3 Equality between deterministic and probabilistic joint spectral radii
3.1 Equality between and
The goal of this section is to prove the following result characterizing equality between and .
Theorem 3.1**.**
Let be a stochastic matrix, be an invariant probability measure for , and . Then the following statements are equivalent:
- (a)
. 2. (b)
* for every -cycle .*
The fact that (a) implies (b) follows from the results in [24], as detailed in the following lemma.
Lemma 3.2**.**
Let be a stochastic matrix, be an invariant probability measure for , and . If , then for every -cycle .
Proof.
If , the result follows trivially from (2.2). We then assume ,
we decompose and according to Proposition 2.6, and we use in the sequel the same notations as in its statement. We also let be defined as in the statement of Lemma 2.7. Thanks to (2.4), (2.5), and (2.6), we have
[TABLE]
By (2.9), we have for every and, since for every and , we deduce from (3.1) and the equality that for every such that .
Let be a -cycle and note that, by Lemma 2.7, there exists such that are in , and thus, in particular, is also a -cycle. Moreover, such a necessarily satisfies .
Consider the -periodic switching signal corresponding to , defined by for all integers and . Endow with its usual product topology and denote by the Borel probability measure on corresponding to the Markov chain defined by . Note that, since is a -cycle, for every the set has positive measure, and thus is in the support of . Moreover, using also Remark 2.10, we have , and then is a maximizing measure of in the sense of [24], where the set of maximizing measures of is defined as the set of all Borel probability measures on invariant under the usual time shift and such that the corresponding probabilistic Lyapunov exponent coincides with . Hence belongs to the Mather set of (see [24, Theorem 2.3], where the Mather set of is defined as the union of the supports of all maximizing measures of ), and thus, by [24, Theorem 2.3(3)], we get
[TABLE]
Set . By (2.2), we have that . For every , there exist integers and such that . Since is -periodic, we have that
[TABLE]
If , then the right-hand side of the above inequality tends to [math] as , contradicting (3.2). Hence, we have necessarily . ∎
The proof that (b) implies (a) in Theorem 3.1 is decomposed in three steps. We first establish the result under the extra assumptions that is irreducible and is strongly connected (Lemma 3.3). We then obtain the conclusion under the sole additional assumption that is strongly connected (Lemma 3.5). Finally, we consider the general case in the third step.
Lemma 3.3**.**
Let be a stochastic strongly connected matrix, be irreducible, and be a Barabanov norm for . Then the following statements are equivalent:
- (a)
. 2. (b)
* for every -cycle .* 3. (c)
\mathopen{}\mathclose{{}\left\lVert A_{i_{k}}\dotsm A_{i_{1}}}\right\rVert_{\mathrm{B}}^{1/k}=\rho_{\mathrm{d}}(\mathcal{A})* for every -word .*
Proof.
The fact that (a) implies (b) is a particular case of Lemma 3.2. Moreover, it is immediate that (c) implies (a) thanks to (2.5), (2.6), and Remark 2.9. We are then left to prove that (b) implies (c).
Assume that (b) holds. Fix a -word . Since is strongly connected, there exist and (obtained by connecting to ) such that is a -cycle. Then, by (b),
[TABLE]
Since the spectral radius is a lower bound for any induced norm of a matrix, we obtain that
[TABLE]
Using the fact that is a Barabanov norm, we also have that
[TABLE]
By combining the previous inequalities, it follows that \mathopen{}\mathclose{{}\left\lVert A_{i_{k}}\dotsm A_{i_{1}}}\right\rVert_{\mathrm{B}}=\rho_{\mathrm{d}}(\mathcal{A})^{k}. ∎
Remark 3.4**.**
The proof of Lemma 3.3 only uses that is an extremal norm, i.e., it satisfies (a) in Definition 2.1. The irreducibility assumption on could then be replaced by its nondefectiveness (we refer the reader to [19, Section 2.1.2] for details). However, we prefer to state Lemma 3.3 in terms of irreducibility since this condition is easier to handle: it can be checked more directly and, up to a linear change of coordinates, a reducible can be put into block-triangular form with irreducible diagonal blocks. This block decomposition is a key argument in the proof of Lemma 3.5.
We now consider the case where is not necessarily irreducible. Here, a Barabanov norm for in general does not exist, and hence item (c) from Lemma 3.3 cannot be expected.
Lemma 3.5**.**
Let be a stochastic strongly connected matrix and . Then the following statements are equivalent:
- (a)
. 2. (b)
* for every -cycle .*
Proof.
Before giving the core of the argument, we start with a set of remarks. First, up to a linear change of coordinates, can be presented in block-triangular form as
[TABLE]
with irreducible for every . Remark that, on the one hand, according to [19, Proposition 1.5], we have and, on the other hand, it follows from [17, Theorem 1.1] and the strong connectedness of that . Moreover, for every -cycle , we have
[TABLE]
where the inequality comes from (2.2) and the equality results from the simple fact that the spectral radius of a block-triangular matrix is equal to the maximum of the spectral radii over the diagonal blocks.
Since (a) implies (b) by Lemma 3.2, we are left to prove the converse implication. Assume that (b) holds true. Then (a) holds trivially if . Otherwise, we can assume, with no loss of generality, that up to replacing by . By assumption and (3.3), for every -cycle , there exists such that \rho\mathopen{}\mathclose{{}\left(A_{i_{k}}^{(r)}\dotsm A_{i_{1}}^{(r)}}\right)=1.
We claim that can be chosen independently of the -cycle. We argue by contradiction, i.e., we assume that, for every , there exists a -cycle such that . Let be a -word such that and (with the convention that ). Then, for every ,
[TABLE]
For every , we apply (b) to the above product, and we deduce from (3.3) that there exists such that
[TABLE]
Let be an increasing sequence such that there exists with for every . Since is irreducible, there exists a Barabanov norm for . Then, for every , we have
[TABLE]
where the last inequality follows from the fact that is a Barabanov norm. Since , we have that \mathopen{}\mathclose{{}\left\lVert A^{(\overline{r})}(i^{\overline{r}})^{n_{q}}}\right\rVert_{\overline{r}}\xrightarrow[q\to\infty]{}0, hence the contradiction.
We thus have proved that there exists such that, for every -cycle ,
[TABLE]
On the other hand, by (2.2), we have \rho\mathopen{}\mathclose{{}\left(A_{i_{k}}^{(\overline{r})}\dotsm A_{i_{1}}^{(\overline{r})}}\right)\leq\rho_{\mathrm{d}}(\mathcal{A}^{(\overline{r})}). Since , we deduce that
[TABLE]
for every -cycle . Then, using Lemma 3.3, we obtain that
[TABLE]
and then (a) holds thanks to (2.9). ∎
We can conclude now the proof of Theorem 3.1.
Proof of Theorem 3.1.
Recall that, thanks to Lemma 3.2, we are only left to prove that (b) implies (a). We first decompose and according to Proposition 2.6 and use in the sequel the same notations as in its statement. Thanks to (2.5) and (2.6), we have
[TABLE]
For , let be the ordered -tuple made of the matrices such that . Notice that for every . Using (2.9) and the fact that is made of matrices from , we obtain that, for every ,
[TABLE]
Let be defined for as in Lemma 2.7 and let be such that . Thanks to Lemma 2.7, there exists a -cycle with in . Then, by (2.2), (3.5), and (b), we have
[TABLE]
In particular, and . Lemma 3.5 applied to and yields that . Hence , and, since this holds for every such that , it follows from (3.4) that , as required. ∎
Remark 3.6**.**
Theorem 3.1 and Lemmas 3.3 and 3.5 characterize equality between deterministic and probabilistic joint spectral radii in terms of -cycles and -cycles only, and hence only on \mathopen{}\mathclose{{}\left\lceil P}\right\rceil and (see Remark 2.4). In other words, equality in Theorem 3.1(a) depends only on the graph associated with the Markov chain and the possible choices of initial states, but not on the precise values of the non-zero initial and transition probabilities.
3.2 Geometric characterization of equality between and
It is clear from Theorem 3.1 that equality between and is possible only for restricted choices of . The goal of this section is to provide a more precise description of such choices of using results from [25], where the authors classify matrix semigroups of constant spectral radius. We start with the following proposition.
Proposition 3.7**.**
Let be a stochastic strongly connected matrix and be such that . Assume that there exists such that is irreducible. Then there exists an invertible matrix such that, for every -cycle starting at , either is singular or is orthogonal, where is the length of .
Proof.
We only have to provide an argument if there exists a -cycle starting at such that is invertible. In that case, from (2.2), , where is the length of . From Lemma 3.5, the set
[TABLE]
is a matrix semigroup with constant spectral radius. Since, moreover, this semigroup is also irreducible, the conclusion follows from [25, Theorem 2]. ∎
Remark 3.8**.**
As remarked in [25], the problem of classifying matrix semigroups with constant spectral radius is highly nontrivial when the semigroup contains singular matrices. By using additional results from [25], we may obtain, under the assumptions of Proposition 3.7, properties on that are weaker than orthogonality but apply to all matrices , and not only nonsingular ones. We refer the interested reader to [25, Theorem 3 and Corollary 6].
A limitation of Proposition 3.7 lies in the fact that, in general, given a stochastic and strongly connected matrix , it is a nontrivial task to verify the existence of an index such that is irreducible, even if is itself irreducible. However, this is true if one assumes in addition that contains only invertible matrices and that all diagonal elements of are positive, in which case we have the following proposition.
Proposition 3.9**.**
Let be a stochastic strongly connected matrix with positive diagonal entries and be irreducible with invertible. Then, for every , is irreducible. Moreover, if and only if there exists an invertible matrix such that, for every , is orthogonal.
Proof.
Let and consider the group generated by . We claim that . Indeed, since is strongly connected, there exists a -cycle starting at such that . Since , then and
[TABLE]
Similarly, since , then and
[TABLE]
An inductive reasoning based on the identity
[TABLE]
yields that for , as required.
To prove that is irreducible for every , assume by contradiction that there exists such that is reducible. Then the group is also reducible, however, since it contains , this contradicts the irreducibility of .
Since are invertible matrices, is positive and, with no loss of generality, we can assume that . If , then, applying Proposition 3.7 to , there exists a basis in which every is orthogonal. Hence, in this same basis, is also made of orthogonal matrices, yielding the conclusion. On the other hand, if there exists a basis in which are orthogonal, then for every -word , and the conclusion follows by Lemma 3.5. ∎
Remark 3.10**.**
Notice that, to obtain the second part of the conclusion of Proposition 3.9, it is enough that there exists such that is irreducible and the generated group contains all matrices . The assumption that has positive diagonal entries is used to guarantee the latter, and therefore it can be replaced by any other condition ensuring that belong to for some . For instance, assume that and for . For every -cycle with and for every , we can proceed as in the proof of the proposition to obtain that for every and use the identity
[TABLE]
to obtain that . Since is strongly connected, every matrix , , belongs to such a -cycle, hence the conclusion.
At the light of Remark 3.10, we may wonder whether the second part of the conclusion of Proposition 3.9 can be obtained under even weaker assumptions on the matrix , allowing for instance the presence of more than one diagonal element equal to zero, but requiring at least one non-zero element in the diagonal. The example given below shows that this is not possible.
Example 3.11**.**
Consider the case , ,
[TABLE]
Note that, in this case,
[TABLE]
The matrix is stochastic, strongly connected, and its unique invariant probability is . Moreover, is irreducible and are invertible. Denoting by the Euclidean norm in , we have , yielding that , and we easily check that by considering given by for every . Moreover, for any -word , there exist an integer and such that . Setting in the case and in the case , it is immediate to verify that , yielding that for every -word , and thus . However, is not similar to an orthogonal matrix, and hence the second conclusion of Proposition 3.9 does not hold. Notice moreover that, in this case, , , and , and thus is reducible for every .
Remark 3.12**.**
We now provide a description of all cases where equality holds between and under the assumption that is irreducible and made of two invertible matrices.
- (a)
If for with , by Remark 3.10, equality occurs if and only if there exists an invertible matrix such that and are orthogonal. 2. (b)
If , equality occurs if and only if . 3. (c)
If , equality occurs if and only if whenever , . 4. (d)
If for some , then equality is equivalent to . 5. (e)
If for some , then equality is equivalent to .
3.3 Equality between and
Based on the results obtained previously, we can now address the issue of characterizing the equality between and . Recall that the latter is defined as the maximum of over all pairs .
Theorem 3.13**.**
Let . Then the following statements are equivalent:
- (a)
. 2. (b)
There exist pairwise distinct such that
[TABLE]
Proof.
We start by proving that (a) implies (b). Recall that, by Remark 2.12, there exist a stochastic matrix and an invariant probability for such that . Using (a), we deduce that . It is clear that there exists a -cycle such that are pairwise distinct, and the conclusion follows from Theorem 3.1.
To prove that (b) implies (a), let be a stochastic matrix with for and . Set as the probability vector such that for . Then is invariant under and the set of -cycles is made of the shifts of and their powers. Moreover, for every such -cycle , we have
[TABLE]
Indeed, this follows from the fact that for every . Then Theorem 3.1(b) holds, hence , and the conclusion follows from (2.9). ∎
Remark 3.14**.**
It follows from (2.9) that, if , the ratio belongs to and Theorem 3.13 addresses the case where it is equal to . We provide next an example where it is equal to [math], proving that it is not possible to find a uniform positive lower bound for this ratio. Indeed, considering
[TABLE]
an immediate computation yields
[TABLE]
and . Let denote the matrix norm induced by the norm in . Define
[TABLE]
and, for , let be the set made of the three words of length obtained by taking the first entries of each element of . By an easy computation, we get that, for every and ,
[TABLE]
We then obtain that . On the other hand, for every stochastic matrix and every invariant probability vector for , we have . Hence
[TABLE]
proving that . Then .
4 Markov chains of higher order
In this section, we extend the previous results to probability measures on obtained from discrete-time shift-invariant Markov chains of order . Any such probability measure can be described by a pair of tensors of orders and , respectively, where the non-negative scalar represents the probability to switch from the state to the state when the previous states of the chain are , and represents the probability of the first states being . In particular, for every , we have that
[TABLE]
and satisfies
[TABLE]
We refer to such and as a probability tensor of order and a stochastic tensor of order , respectively. The shift-invariance property now reads
[TABLE]
and any probability tensor satisfying the above shift-invariant property is said to be invariant under . The probabilistic joint spectral radius associated with is still defined by (2.5), where the expectation corresponds to the probability measure on defined above.
Markov chains of order can be canonically transformed into Markov chains of order by considering as state space the set and defining a pair from by and
[TABLE]
for every and in . It is immediate from the definitions and the shift-invariance property that
[TABLE]
where and for every .
For every positive integer , we say that is a -cycle if
[TABLE]
is a -cycle, where is extended to by -periodicity.
Applying Theorem 3.1 to and , we deduce at once the following.
Theorem 4.1**.**
Let be a positive integer, be a stochastic tensor of order , be an invariant probability tensor for , and . Then the following statements are equivalent:
- (a)
. 2. (b)
* for every -cycle .*
Recall that (1.1) is said to be periodically stable if for all periodic signals . It has been shown in [12] that this property implies for every strongly connected stochastic matrix , where is the unique invariant probability vector for . A slightly improved version of this result can be obtained as a consequence of Theorem 4.1 as stated in the following corollary.
Corollary 4.2**.**
*Assume that (1.1) is periodically stable. Then, for every , every stochastic tensor of order , and every invariant probability tensor for , we have . *
Proof.
By the Joint Spectral Radius Theorem (see, e.g., [19, Theorem 2.3]), periodic stability implies that . In the case , the conclusion follows immediately. Otherwise, when , the periodic stability assumption implies that assertion (b) from Theorem 4.1 does not hold, which proves that , yielding the conclusion. ∎
Similarly as for Theorem 4.1, we deduce by applying Theorem 3.13 to and the following.
Theorem 4.3**.**
Let be a positive integer and . Then the following statements are equivalent:
- (a)
, where is the supremum of over all pairs with a stochastic tensor of order and an invariant probability tensor for . 2. (b)
There exist such that
[TABLE]
and whenever with , where is extended to by -periodicity.
As a consequence of Theorem 4.3, we have the following corollary. To state it, recall that is said to have the finiteness property if there exist such that .
Corollary 4.4**.**
Let . Then has the finiteness property if and only if there exists such that .
Proof.
If there exists such that , then the finiteness property of follows immediately from Theorem 4.3. Assume now that has the finiteness property and let be such that . Extend over by -periodicity and let be the minimal period of . Without loss of generality, we can assume that . We claim that property (b) of Theorem 4.3 holds with . Indeed, let be such that and assume, to obtain a contradiction, that . Without loss of generality, . Set and notice that and for every . Since is -periodic, the previous equality holds for every , proving that is -periodic, contradicting the minimality of as period of . Hence property (a) of Theorem 4.3 holds, as required. ∎
Remark 4.5**.**
Given , , and a word , set and let be the length of . Notice that, by proceeding similarly to the second part of the proof of Theorem 3.13, we can construct, for every word of finite length, a Markov chain of order with tensors , such that . We deduce that
[TABLE]
where the equality is a consequence of the Joint Spectral Radius Theorem (see, e.g., [19]). Since, moreover, for every , it follows that .
A further characterization of the equivalence in Corollary 4.4 can then be stated as follows: an -tuple of matrices satisfies the finiteness property if and only if
[TABLE]
is attained at some , where the supremum is taken over all with , a stochastic tensor of order , and an invariant probability tensor for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Arnold. Random dynamical systems . Springer Monographs in Mathematics. Springer-Verlag, Berlin, 1998.
- 2[2] N. E. Barabanov. Lyapunov indicator of discrete inclusions. I–III. Autom. Remote Control , 49:152–157, 283–287, 558–565, 1988.
- 3[3] V. D. Blondel, J. Theys, and A. A. Vladimirov. An elementary counterexample to the finiteness conjecture. SIAM J. Matrix Anal. Appl. , 24(4):963–970, 2003.
- 4[4] T. Bousch and J. Mairesse. Asymptotic height optimization for topical IFS, Tetris heaps, and the finiteness conjecture. J. Amer. Math. Soc. , 15(1):77–111, 2002.
- 5[5] F. Colonius and W. Kliemann. Dynamical systems and linear algebra , volume 158 of Graduate Studies in Mathematics . American Mathematical Society, Providence, RI, 2014.
- 6[6] O. L. V. Costa, M. D. Fragoso, and R. P. Marques. Discrete-time Markov jump linear systems . Probability and its Applications (New York). Springer-Verlag London, Ltd., London, 2005.
- 7[7] O. L. V. Costa, M. D. Fragoso, and M. G. Todorov. Continuous-time Markov jump linear systems . Probability and its Applications (New York). Springer, Heidelberg, 2013.
- 8[8] A. Crisanti, G. Paladin, and A. Vulpiani. Products of random matrices in statistical physics , volume 104 of Springer Series in Solid-State Sciences . Springer-Verlag, Berlin, 1993. With a foreword by Giorgio Parisi.
