Joint spectrum and large deviation principle for random matrix products
Cagri Sert

TL;DR
This paper explores the asymptotic behavior of random matrix products and introduces a limit set for powers of subsets in semisimple Lie groups, extending classical large deviation principles to matrix norms.
Contribution
It establishes a large deviation principle for norms of random matrix products and defines a new limit set describing the asymptotic shape of powers in semisimple Lie groups.
Findings
Large deviation principle for random matrix products.
Introduction of a limit set for powers in semisimple Lie groups.
Applications to the study of large deviations in linear groups.
Abstract
The aim of this note is to announce some results about the probabilistic and deterministic asymptotic properties of linear groups. The first one is the analogue, for norms of random matrix products, of the classical theorem of Cramer on large deviation principles (LDP) for sums of iid real random variables. In the second result, we introduce a limit set describing the asymptotic shape of the powers of a subset S of a semisimple linear Lie group G (e.g. SL(d;R)). This limit set has applications, among others, in the study of large deviations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Joint spectrum and large deviation principle for random matrix products
Cagri SERT
Abstract
The aim of this note is to announce some results about the probabilistic and deterministic asymptotic properties of linear groups. The first one is the analogue, for norms of random matrix products, of the classical theorem of Cramér on large deviation principles (LDP) for sums of iid real random variables. In the second result, we introduce a limit set describing the asymptotic shape of the powers of a subset of a semisimple linear Lie group (e.g. SL). This limit set has applications, among others, in the study of large deviations.
1 Large deviation principle for random matrix products
1.1 Introduction
Let be a connected semisimple linear real algebraic group (e.g. SL). A random walk on is a random process where ’s are independent and identically distributed (iid) -valued random variables. Starting from Bellman [8], Furstenberg-Kesten [14] and Furstenberg [13], an important aim in the study of these non-commutative random walks was to establish the analogues of the classical limit theorems existing for the iid real random variables. More precisely, one is interested in studying the probabilistic limiting behavior of the (logarithms of the) norms of random matrix products. In fact, we will consider the slightly more general multi-norm given by the Cartan projection that we briefly describe before going on: let be the Lie algebra of the group , a Cartan subalgebra of and a chosen Weyl chamber. Let be a maximal compact subgroup of for which we have the Cartan decomposition . Then the map is well-defined by the following equality and is called the Cartan projection: for all , for some . As an example, in the case of , the Cartan projection of a matrix writes as , where ’s are considered with their canonical Euclidean structures and ’s denote the associated operator norms. The components of are the logarithms of the singular values of .
The first limit theorem that was proven for random matrix products was the analogue of the law of large numbers. Stating it in our setting, Furstenberg-Kesten’s result [14] reads: if is a probability measure on with a finite first moment (i.e. for some norm on ), then the -random walk (i.e. ’s are iid of law ) satisfies
[TABLE]
where can be defined by this and called the Lyapunov vector of . Nowadays, this result is a rather straightforward corollary of Kingman’s subadditive ergodic theorem. A second important limit theorem that was established in increasing generality by Tutubalin [26], Le Page [19], Goldsheid-Guivarc’h [16], and Benoist-Quint [6] [5] is the central limit theorem (CLT). Benoist-Quint’s CLT reads: if is a probability measure on of finite second order moment and such that the support of generates a Zariski-dense semigroup in , then converges in distribution to a non- degenerate Gaussian law on . A feature of this result is the Zariski density assumption which also appears in our result below. We note that the fact that the support of the probability measure generates a Zariski-dense semigroup can be read as: any polynomial that vanishes on also vanishes on (we recall that in , a subset is Zariski dense if and only if it is infinite). Some other limit theorems whose analogues have been obtained are the law of iterated logarithm and local limit theorems for which we refer the reader to the nice books of Bougerol-Lacroix [11] and more recently Benoist-Quint [5].
An essential and, up to our work, a rather incomplete aspect of these non-commutative limit theorems is concerned with large deviations. The main result in this direction is that of Le Page [19], (see also Bougerol [11]) and its extension by Benoist-Quint [5], stating the exponential decay of large deviation probabilities off the Lyapunov vector. Before stating this result, recall that a probability measure on is said to have a finite exponential moment, if there exists such that we have . We have
Theorem 1.1** (Le Page [19], Benoist-Quint [5]).**
Let be as before, be a probability measure of finite exponential moment on whose support generates a Zariski-dense semigroup in . Let denote the step of the -random walk on . Then, for all , we have .
1.2 Statement of main result
In our first main result, under the usual Zariski density assumption, we prove the matrix analogue of a classical theorem (see below) about large deviations for iid real random variables. Let be a topological space and be a - algebra on .
Definition 1.2**.**
A sequence of -valued random variables is said to satisfy a large deviation principle (LDP) with rate function , if for every measurable subset of , we have
[TABLE]
where, denotes the interior and the closure of .
With this definition, the classical Cramér-Chernoff theorem says that the sequence of averages of real iid random variables of finite exponential moment satisfies an LDP with a proper convex rate function , given by the convex conjugate (Fenchel-Legendre transform) of the logarithmic moment generating function of ’s. Our first main result reads
Theorem 1.3**.**
Let be a connected semisimple linear real algebraic group and be a probability measure of finite exponential moment on , whose support generates a Zariski dense sub- semigroup of . Then, the sequence random variables satisfies an LDP with a proper convex rate function assuming a unique zero on the Lyapunov vector of .
Remark 1.4**.**
*1. Without any moment assumptions on , we also obtain a weaker result which is an analogue of a result of Bahadur [7] for iid real random variables.
-
Under a stronger exponential moment condition (sometimes called finite super-exponential moment), by exploiting convexity of , we are able to identify the rate function with the convex conjugate of a limiting logarithmic moment generating function of the random variables .
-
It follows by convexity of that the effective support of the rate function is a convex subset of . We also show that this set depends only on the support of (i.e. not on the particular mass distribution on ) and we identify with a Hausdorff limiting set of a deterministic construction, namely the joint spectrum of (see Theorem 2.3 below).
-
We conjecture that a similar LDP holds for the Jordan projection in place of (see Def. in paragraph 2.1).*
1.3 The idea of the proof
The fact that follows from the definition of and . That this zero is unique is merely the translation of the deep result of Le Page (Theorem 1.1). That is proper is a rather straightforward consequence of Chernoff’s estimates and that one can identify as a Fenchel-Legendre transform (2. of Remark 1.4) follows from its convexity using standard techniques, namely the Varadhan’s integral lemma (see Theorem 4.3.1. in [12]) and elementary properties of convex conjugation of functions. Moreover, the convexity of the rate function is proven using ideas similar to those used to prove its existence. Therefore, in this note, we will focus on the proof of the existence of an LDP. For simplicity, we shall assume that the measure is compactly supported. We make use of the following general fact:
Theorem 1.5** (see Theorem 4.1.11 in [12]).**
Let be a topological space endowed with its Borel -algebra , and be a sequence of - valued random variables taking values in a compact subset of . Let be a base of open sets for the topology of . For each , define:
[TABLE]
Suppose that for all , we have . Then, the sequence satisfies an LDP with rate function given by .
In view of this theorem, to prove the existence of the LDP in Theorem 1.3, we have to show that the equality is satisfied. A first remark is if was an additive mapping (i.e. ), this would follow rather easily from the independence of random walk increments and uniform continuity of . A further important remark is that in fact a weaker form of additivity (i.e. is uniformly bounded for all ) is sufficient to insure this equality. A key result of Benoist [2], [3] shows that this weak form of additivity is satisfied in a given -Schottky semigroup. This already finishes the proof in the case when is supported on such a semigroup. For the general case, we need an argument showing that we can restrict the random walk on Schottky semigroups with no loss in the exponential rate of decay probabilities (as in Lemma 1.6). This is done by using a result of Abels-Margulis-Soifer [1] about proximal elements in Zariski dense semigroups (which in turn uses a result of Benoist-Labourie [9] and Prasad [20]) together with the uniform continuity of the Cartan projection and then a simple partitioning and pigeonhole argument. A key step in this proof is the following Lemma 1.6.
Recall that an element in is called -proximal if it has a unique eigenvalue of maximal modulus and the ratio of its first two singular values is at least . It is called -proximal if additionally the top eigenvector is at least away from projective hyperplane spanned by the other generalized eigenspaces. An element in is called -loxodromic if it is -proximal in each of the rk fundamental proximal representations of G.
Abels-Margulis-Soifer show that for a Zariski dense semigroup in , there exists such that for every , one can find a finite subset with the property that for all , there exists such that is -loxodromic. We denote by the semigroup generated by the support of .
Lemma 1.6**.**
Let . There exist a compact set , a natural number , and a constant such that for all and subset , there exists a natural number with such that we have
[TABLE]
The next step in the proof consists in observing that one can further restrict the random walk to an -Schottky semigroup, again losing only a uniformly bounded proportion of the probability (as in Lemma 1.6). By doing so, we reduce the situation to a random walk on a semigroup on which the Cartan projection is almost additive and hence we can conclude as we sketched in the beginning of the argument.
Remark 1.7**.**
In a further work in preparation [25], using essentially the same method as in the proof of Theorem 1.3, we show that the LDP holds for the average word length of random walks on Gromov hyperbolic groups.
2 Joint spectrum
2.1 Introduction
In this second part of this note, we define the notion of joint spectrum of a bounded subset of a semisimple Lie group. We then relate this notion to the effective support of the rate function from Theorem 1.3. Recall the definition of the Jordan projection : if is the Jordan decomposition of with elliptic, hyperbolic and is unipotent, then is defined as . Let be a bounded subset of and denote its power. We are interested in the following questions: do the sequences of bounded subsets of , and have a limit in the Hausdorff topology ? If yes, are the limits the same and can one describe these limit sets ?
2.2 Statement of the main results
Regarding the above questions, we show the following:
Theorem 2.1**.**
*Let be a connected semisimple linear real algebraic group and a bounded subset of generating a Zariski dense sub-semigroup. Then,
- The following Hausdorff limits exist, and we have the equality:*
[TABLE]
*This common limit will be denoted as and called the joint spectrum of .
- is a compact convex subset of with non-empty interior.*
Remark 2.2**.**
*1. It is not hard to see that if two subsets and of generate the same Zariski dense semigroup in , then the projective images of and are the same. Therefore the corresponding cone in only depends on . It turns out that this is precisely the Benoist limit cone of [3].
- For a Banach algebra and a bounded subset of , denote the joint spectral radius of by . When is finite dimensional, this does not depend on the particular norm on . Let now be an irreducible rational representation of and let denote the highest weight of . Then we have the equality . This allows us to derive a multi-dimensional generalization of Berger-Wang identity [10].*
We now relate the joint spectrum of the support of a probability measure with the effective support of the rate function for the LDP of , where denotes as usual the -random walk on .
Theorem 2.3**.**
*Let and be as in Theorem 1.3, and suppose moreover that the support of is bounded. Let be the rate function given by Theorem 1.3 and be the effective support of . We then have
-
and . If is moreover finite, then .
-
The Lyapunov vector of belongs to the interior of .*
For the point 2. of this theorem, we note that for a probability measure as in Theorem 1.3, the fact that belongs to the interior of was obtained by Guivarc’h-Raugi [18] and Goldsheid-Margulis [15]. Our result gives a more precise location for in case is, moreover, boundedly supported. Our method extends also to the case where only has a finite exponential moment to show that belongs to the Benoist cone of the semigroup generated by the support of .
The tools that go into the proof of Theorem 2.1 and Theorem 2.3 are mostly similar to those used in the proof of Theorem 1.3. We use an additional tool to prove the fact that the joint spectrum is of non-empty interior: while this result can be directly deduced from the properties of Benoist cone proved in [3], we adopt an indirect approach and use the central limit theorem of Goldsheid-Guivarc’h [16] and Guivarc’h [17] that we combine with Abels- Margulis-Soifer result [1] and Benoist’s estimates [2]. This in turn allows us to derive point 2. of the previous theorem and hence also part of the point 2. of Theorem 2.1.
Remark 2.4**.**
The notion of joint spectrum plays also an important role in another work in preparation [23], where we study a new exponential counting function for a finite subset in a group as before.
Acknowledgements
These results are part of author’s doctoral thesis realized under the supervision of Emmanuel Breuillard in Université Paris-Sud during 2013-2016. The author would like to take the opportunity to thank him for numerous discussions and for suggesting the notion of joint spectrum. The author also thanks to WWU Münster where part of this work was conducted.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Abels, H., Margulis, G.A. and Soifer, G.A., 1995. Semigroups containing proximal linear maps. Israel journal of mathematics, 91(1-3), pp.1-30.
- 2[2] Benoist, Y., 1996. Actions propres sur les espaces homogènes réductifs. Annals of mathematics, pp.315-347.
- 3[3] Benoist, Y., 1997. Propriétés asymptotiques des groupes linéaires. Geometric & Functional Analysis GAFA, 7(1), pp.1-47.
- 4[4] Benoist, Y., 2000, Propriétés asymptotiques des groupes linéaires II. Advanced Studies Pure Math. 26 (2000) pp.33-48.
- 5[5] Benoist, Y. and Quint, J.F., 2013. Random walks on reductive groups. to appear.
- 6[6] Benoist, Y. and Quint, J.F., 2016. Central limit theorem for linear groups. Ann. Probab, 44(2), pp.1308-1340.
- 7[7] Bahadur, R.R., 1971. Some limit theorems in statistics (No. 04; QA 276. A 1, B 3 1971.). Philadelphia: Society for industrial and applied mathematics.
- 8[8] Bellman, R., 1954. Limit theorems for non-commutative operations. I. Duke Mathematical Journal, 21(3), pp.491-500.
