Hilbert Space Lyapunov Exponent stability

Gary Froyland; Cecilia Gonz\'alez-Tokman; Anthony Quas

arXiv:1703.04841·math.DS·March 16, 2017

Hilbert Space Lyapunov Exponent stability

Gary Froyland, Cecilia Gonz\'alez-Tokman, Anthony Quas

PDF

Open Access

TL;DR

This paper investigates how small Gaussian noise affects the stability of Lyapunov exponents and Oseledets spaces in cocycles of compact operators on infinite-dimensional Hilbert spaces, establishing convergence results.

Contribution

It provides the first known results on the stability of Lyapunov exponents and Oseledets spaces under noise for infinite-dimensional operator cocycles.

Findings

01

Lyapunov exponents converge as noise diminishes

02

Oseledets spaces converge in probability

03

Addresses challenges unique to infinite-dimensional spaces

Abstract

We study cocycles of compact operators acting on a separable Hilbert space, and investigate the stability of the Lyapunov exponents and Oseledets spaces when the operators are subjected to additive Gaussian noise. We show that as the noise is shrunk to 0, the Lyapunov exponents of the perturbed cocycle converge to those of the unperturbed cocycle; and the Oseledets spaces converge in probability to those of the unperturbed cocycle. This is, to our knowledge, the first result of this type with cocycles taking values in operators on infinite-dimensional spaces. The infinite dimensionality gives rise to a number of substantial difficulties that are not present in the finite-dimensional case.

Equations263

A^{ϵ}_{\overset{ω}{ˉ}}^{(n)} = (A_{σ^{n - 1} ω} + ϵ Δ_{n - 1}) \dots (A_{ω} + ϵ Δ_{0}) .

A^{ϵ}_{\overset{ω}{ˉ}}^{(n)} = (A_{σ^{n - 1} ω} + ϵ Δ_{n - 1}) \dots (A_{ω} + ϵ Δ_{0}) .

∠ (U, V) = max (u \in U \cap S max v \in V \cap S min ∥ u - v ∥, v \in V \cap S max u \in U \cap S min ∥ u - v ∥),

∠ (U, V) = max (u \in U \cap S max v \in V \cap S min ∥ u - v ∥, v \in V \cap S max u \in U \cap S min ∥ u - v ∥),

⊥ (U, V) = \frac{1}{2} u \in U \cap S, v \in V \cap S min ∥ u - v ∥.

⊥ (U, V) = \frac{1}{2} u \in U \cap S, v \in V \cap S min ∥ u - v ∥.

\tilde{Ξ}_{k} (A) = E Ξ_{k} (Π_{k} Δ A Δ^{'} Π_{k}),

\tilde{Ξ}_{k} (A) = E Ξ_{k} (Π_{k} Δ A Δ^{'} Π_{k}),

u \in U \cap S max v \in V \cap S min ∥ u - v ∥ = v \in V \cap S max u \in U \cap S min ∥ u - v ∥.

u \in U \cap S max v \in V \cap S min ∥ u - v ∥ = v \in V \cap S max u \in U \cap S min ∥ u - v ∥.

E_{k} (Π) = - Ξ_{k} (Π \circ D_{3}) = - i = 1 \sum k lo g ∥ D_{3} v_{i} ∥,

E_{k} (Π) = - Ξ_{k} (Π \circ D_{3}) = - i = 1 \sum k lo g ∥ D_{3} v_{i} ∥,

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+(\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime}))\big{|}\leq C.

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+(\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime}))\big{|}\leq C.

Cov (M_{ij}, M_{i^{'} j^{'}})

Cov (M_{ij}, M_{i^{'} j^{'}})

= l, m \sum 3^{- 2 (l + m)} (u_{i})_{l} (u_{i^{'}})_{l} (v_{j})_{m} (v_{j^{'}})_{m}

= ⟨ D_{3} u_{i}, D_{3} u_{i^{'}} ⟩ ⟨ D_{3} v_{j}, D_{3} v_{j^{'}} ⟩,

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime})\big{|}\leq C,

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime})\big{|}\leq C,

\Big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})|\Delta\in Q\big{)}-\big{(}\operatorname{\mathbb{E}}\Xi_{k}(\Pi\Delta\Pi_{k})+\operatorname{\mathbb{E}}\Xi_{k}(\Pi_{k}\Delta\Pi^{\prime})\big{)}\Big{|}<C

\Big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})|\Delta\in Q\big{)}-\big{(}\operatorname{\mathbb{E}}\Xi_{k}(\Pi\Delta\Pi_{k})+\operatorname{\mathbb{E}}\Xi_{k}(\Pi_{k}\Delta\Pi^{\prime})\big{)}\Big{|}<C

\displaystyle\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime})\big{|}\leq C;

\displaystyle\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi^{\prime})\big{|}\leq C;

\displaystyle\big{|}\operatorname{\mathbb{E}}\Xi_{k}(\Pi\Delta\Pi_{k})+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi_{k})\big{|}\leq C;

\displaystyle\big{|}\operatorname{\mathbb{E}}\Xi_{k}(\Pi_{k}\Delta\Pi^{\prime})+\operatorname{\mathcal{E}}_{k}(\Pi)+\operatorname{\mathcal{E}}_{k}(\Pi_{k})\big{|}\leq C,

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}-\big{(}\operatorname{\mathbb{E}}\Xi_{k}(\Pi\Delta\Pi_{k})+\operatorname{\mathbb{E}}\Xi_{k}(\Pi_{k}\Delta\Pi^{\prime})\big{)}\Big{|}\leq K,

\big{|}\operatorname{\mathbb{E}}\big{(}\Xi_{k}(\Pi\Delta\Pi^{\prime})\big{|}\Delta\in Q\big{)}-\big{(}\operatorname{\mathbb{E}}\Xi_{k}(\Pi\Delta\Pi_{k})+\operatorname{\mathbb{E}}\Xi_{k}(\Pi_{k}\Delta\Pi^{\prime})\big{)}\Big{|}\leq K,

∥ A^{ϵ}_{\overset{ω}{ˉ}}^{(N)} - A_{ω}^{(N)} ∥_{op} \leq 1,

∥ A^{ϵ}_{\overset{ω}{ˉ}}^{(N)} - A_{ω}^{(N)} ∥_{op} \leq 1,

∥ A^{ϵ}_{\overset{ω}{ˉ}}^{(N)} - A_{ω}^{(N)} ∥_{op}

∥ A^{ϵ}_{\overset{ω}{ˉ}}^{(N)} - A_{ω}^{(N)} ∥_{op}

\leq 2 N ϵ exp (g (ω) + \dots + g (σ^{N - 1} ω)) .

Ξ_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{((l - j) N)}) \geq Ξ_{k} (A_{ω}^{((l - j) N)}) + 2 (l - j) k lo g δ .

Ξ_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{((l - j) N)}) \geq Ξ_{k} (A_{ω}^{((l - j) N)}) + 2 (l - j) k lo g δ .

Ξ_{k} (\tilde{B}_{l - 1} \dots \tilde{B}_{j}) \geq 2 (l - j) k lo g δ + i = j \sum l - 1 Ξ_{k} (B_{i}) \geq 2 (l - j) k lo g δ + Ξ_{k} (B_{l - 1} \dots B_{j}),

Ξ_{k} (\tilde{B}_{l - 1} \dots \tilde{B}_{j}) \geq 2 (l - j) k lo g δ + i = j \sum l - 1 Ξ_{k} (B_{i}) \geq 2 (l - j) k lo g δ + Ξ_{k} (B_{l - 1} \dots B_{j}),

lo g ∣ det (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)} ∣_{V}) ∣

lo g ∣ det (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)} ∣_{V}) ∣

= Ξ_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)} Π_{F_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)})^{⊥}} Π_{V})

= Ξ_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)} Π_{F_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)})^{⊥}}) + Ξ_{k} (Π_{F_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)})^{⊥}} Π_{V})

\geq Ξ_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)}) + k lo g ⊥ (F_{k} (A^{ϵ}_{\overset{ω}{ˉ}}^{(n N)}), V),

\operatorname{\mathbb{E}}\Big{(}\big{(}\Xi_{k}(\Pi(A+T)\Pi^{\prime})-\Xi_{k}(\Pi A\Pi^{\prime})\big{)}^{-}\Big{|}T\in Q\Big{)}\geq-C,

\operatorname{\mathbb{E}}\Big{(}\big{(}\Xi_{k}(\Pi(A+T)\Pi^{\prime})-\Xi_{k}(\Pi A\Pi^{\prime})\big{)}^{-}\Big{|}T\in Q\Big{)}\geq-C,

\operatorname{\mathbb{E}}\Big{(}\big{(}\log|\det(\tilde{\Pi}(A+T)\tilde{\Pi}^{\prime})|-\log|\det(\tilde{\Pi}A\tilde{\Pi}^{\prime})|\big{)}^{-}\;\Big{|}T\in Q\Big{)}\geq-C.

\operatorname{\mathbb{E}}\Big{(}\big{(}\log|\det(\tilde{\Pi}(A+T)\tilde{\Pi}^{\prime})|-\log|\det(\tilde{\Pi}A\tilde{\Pi}^{\prime})|\big{)}^{-}\;\Big{|}T\in Q\Big{)}\geq-C.

2 C_{d} \int_{S} d μ (M) \int_{0}^{\infty} lo g^{-} ∣ det (I + r M) ∣ r^{d - 1} e^{- r^{2} /2} d r .

2 C_{d} \int_{S} d μ (M) \int_{0}^{\infty} lo g^{-} ∣ det (I + r M) ∣ r^{d - 1} e^{- r^{2} /2} d r .

G (d, M) = \int_{0}^{\infty} lo g^{-} ∣ det (I + r M) ∣ r^{d - 1} e^{- r^{2} /2} d r

G (d, M) = \int_{0}^{\infty} lo g^{-} ∣ det (I + r M) ∣ r^{d - 1} e^{- r^{2} /2} d r

F (d, b)

F (d, b)

F (d, b)

F (d, b)

= \frac{1}{b ^{d}} \int_{0}^{2} lo g ∣1 - r ∣ r^{d - 1} e^{- r^{2} / (2 b^{2})} d r .

F (d, b)

F (d, b)

\geq \frac{1}{b ^{d}} \int_{0}^{b^{d / (1 + d)}} lo g ∣1 - r ∣ r^{d - 1} d r + \frac{1}{b ^{d}} \int_{b^{d / (1 + d)}}^{2} lo g ∣1 - r ∣ r^{d - 1} e^{- r^{2} / (2 b^{2})} d r

\geq - 2 \frac{1}{b ^{d}} \int_{0}^{b^{d / (1 + d)}} r^{d} d r + (2/ b)^{d} exp (- 1/ (2 b^{α})) \int_{0}^{2} lo g ∣1 - r ∣ d r

\geq - 2/ (d + 1) - 2^{d + 1} exp (- 1/ (2 b^{α})) / b^{d}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum chaos and dynamical systems · Mathematical Dynamics and Fractals · Stability and Controllability of Differential Equations

Full text

Hilbert Space Lyapunov Exponent stability

Gary Froyland

[email protected]

,

Cecilia González-Tokman

[email protected]

and

Anthony Quas

[email protected]

Abstract.

We study cocycles of compact operators acting on a separable Hilbert space, and investigate the stability of the Lyapunov exponents and Oseledets spaces when the operators are subjected to additive Gaussian noise. We show that as the noise is shrunk to 0, the Lyapunov exponents of the perturbed cocycle converge to those of the unperturbed cocycle; and the Oseledets spaces converge in probability to those of the unperturbed cocycle. This is, to our knowledge, the first result of this type with cocycles taking values in operators on infinite-dimensional spaces. The infinite dimensionality gives rise to a number of substantial difficulties that are not present in the finite-dimensional case.

1. Introduction

A question of paramount importance in applied mathematics is: How to tell if the conclusions derived from a model indeed capture relevant features of an underlying system? Stability results address this question by giving conditions under which small changes in a model entail small changes in the outcomes of the analysis.

In the last decade, multiplicative ergodic theory has been developed in the so-called semi-invertible setting (that is the setting in which the underlying base dynamics are assumed to be invertible, but no invertibility assumptions are made on the matrices) [12, 13, 16, 17] with the aim of providing a useful mathematical tool to analyse transport features of complex real world systems, such as geophysical flows. This approach has been implemented to find coherent structures in fluid flow [14], and a finite-time version of this theory has been used to detect atmospheric vortices and oceanic eddies in geophysical flows [15, 11].

However, from the mathematical perspective, the following questions remain completely unsolved:

•

Model or data errors: Do these structures – obtained using either models of geophysical flows or observational data, both of which contain errors – correspond to real features of the underlying flows?

•

Numerical errors: Are these structures robust to numerical errors in the numerical schemes applied to the models or observational data in order to extract the ergodic-theoretic objects?

The aim of this work is to provide an initial step in establishing conditions for the stability of Lyapunov exponents and so-called Oseledets spaces, the essential components underlying multiplicative ergodic theory, in an infinite-dimensional context. The infinite dimensionality aspect is crucial to be able to eventually encompass the setting of transfer operators – a powerful mathematical tool used to model transport in dynamical systems. In the infinite-dimensional context, and aside from works focusing exclusively on the i.i.d. perturbation (noise) setting, stability results have only been established either (i) under uniform hyperbolicity assumptions on the underlying cocycle, which for example cover the case of random perturbations of a fixed map [1, 6]; or (ii) for the top (first) component of the splitting, in the context of transfer operators [9], where the leading Lyapunov exponent is always 0, corresponding to a random fixed point.

Early results concerning stability of Lyapunov exponents for finite-dimensional (matrix) cocycles include [25, 20, 21, 18]. In the setting of invertible matrix cocycles, the closest results to this work are due to Ledrappier, Young and Ochs [26, 23, 24]. The difficulty of the stability problem at hand, even in the finite-dimensional setting, is highlighted by the existence of negative stability results for Lyapunov exponents of matrix cocycles [3, 4], which show that for non-uniformly hyperbolic cocycles, carefully chosen arbitrarily small perturbations may collapse the entire spectrum of Lyapunov exponents to a single exponent. In this finite-dimensional setting, the stability problem remains an active topic of research, and related recent results include [5, 2]. In the setting of semi-invertible matrix cocycles, the authors established stability results under stochastic perturbations in [10, 8].

In this paper, we study cocycles taking values in compact operators on a separable Hilbert space. The unperturbed cocycle is assumed to be strongly coercive, with exponentially-decaying transmission between higher order modes, so that the leading Oseledets spaces tend to be concentrated on low order modes. This issue of the cocycle sending an arbitrarily high-order mode to a low-order mode does not arise in the finite-dimensional setting. Additionally, unlike the finite-dimensional case, there is no natural Lebesgue-like measure on the infinite-dimensional space of perturbations. Hence as our model of noise we use additive Gaussian perturbations. The Gaussian nature of the perturbations allows for unbounded changes, and is also convenient for calculations. In order to maintain the noise as a small perturbation, the Gaussian perturbations are required to have stronger exponential decay than the unperturbed cocycle. We regard the model as a natural generalisation of the finite-dimensional Ledrappier-Young setting to infinite dimensions.

The main results of the paper, Theorems A and B, yield, respectively, convergence of Lyapunov exponents and Oseledets spaces of the randomly perturbed cocycles. The method of proof of stability of Lyapunov exponents builds on the work of Ledrappier and Young [23], which dealt with Lyapunov exponents in invertible matrix cocycles, as well as on our recent work [10], which had to handle the complications arising from non-invertibility of the matrices. The motivation for studying the case of non-invertible matrices is that transfer operators are generally not invertible. The strategies in all three papers, [23, 10] and this one, are similar in spirit: The idea is to split long sequences of matrices observed along the cocycle into good and bad blocks, depending on whether or not the long term behaviour of the cocycle corresponds to the observed behaviour within the block, and then handle carefully the concatenations. However, at the technical level, there are substantial complications arising from the need to handle wild perturbations occurring in possibly higher and higher modes.

As in the previous works [24, 10], the stability of Oseledets spaces is deduced from the stability of the Lyapunov exponents, but the strategy of the proof here is different. The approach of Ochs [24] applies only to invertible matrices, and the proof is essentially finite-dimensional. The core of the argument is: if the perturbed slowest Oseledets spaces were often far from its unperturbed counterpart, the contribution to the bottom exponent of the perturbed system on this part of the base space would be at least $\lambda_{d-1}$ . Hence, convergence of the exponents implies the perturbed and unperturbed slow spaces are mostly nearby. This is basically an expectation argument. Subsequent Oseledets spaces are similarly controlled using exterior powers. The approach of [10] in the context of not necessarily invertible matrices relies on the use of Möbius transformations or graph transforms. The essence of the argument is one fixes all of the perturbations to the matrices other than the perturbation at time $-1$ . Since there is exponential contraction in a cone around the unperturbed fast space (that is the span of the $k$ -dimensional Oseledets spaces with largest Lyapunov exponents), all but a very small set of perturbations at time $-1$ cause the fast space to fall into the basin of attraction, and to end up near the unperturbed fast space. While this approach would still apply in the infinite-dimensional case, the new argument of this paper has the advantages that it is simpler and more general; in particular, it does not rely on any special structure for the perturbations, such as absolute continuity, which played a role in [10]. All that is required is that the perturbations are small with high probability. The approach in the current paper goes as follows: if the perturbed $k$ -dimensional fast space is not close to the unperturbed fast space at time $2N$ (where $N$ is the block size), then the minimum angle between the perturbed fast space at time $N$ and the unperturbed slow space at time $N$ must be small. For this to happen, the minimum angle between the perturbed fast space at time 0 and the unperturbed slow space at time 0 has to be exponentially small. Whenever this happens, there is a growth drop of the $k$ -dimensional volume of order $\exp(-(\lambda_{k}-\lambda_{k+1})N)$ over this block. An expectation argument ensures that this must happen rarely because otherwise the perturbed $\lambda_{k}$ would be much less than the unperturbed $\lambda_{k}$ .

2. The model and principal results

Throughout the paper $\sigma\colon\Omega\to\Omega$ is an invertible measurable transformation, $\mathbb{P}$ is an ergodic invariant measure, and $H$ is a separable Hilbert space with basis $e_{1},e_{2},\ldots$ .

The Hilbert-Schmidt norm is $\|A\|_{\mathsf{HS}}^{2}=\sum_{i,j}\langle Ae_{i},e_{j}\rangle^{2}$ . Define a stronger norm: $\|A\|_{\mathsf{SHS}}^{2}=\sum_{i,j}2^{2(i+j)}\langle Ae_{i},e_{j}\rangle^{2}$ . We frequently think of operators with bounded HS norm as infinite matrices where the entries are square-summable. We write $\mathsf{HS}$ for the collection of Hilbert-Schmidt operators on $H$ (those operators, $A$ , satisfying $\|A\|_{\mathsf{HS}}<\infty$ ), and $\mathsf{SHS}$ for the collection of strong Hilbert-Schmidt operators (those operators satisfying $\|A\|_{\mathsf{SHS}}<\infty$ ).

We write $A^{(n)}_{\omega}$ for the unperturbed cocycle: $A^{(n)}_{\omega}=A_{\sigma^{n-1}\omega}\cdots A_{\omega}$ , and call $A\colon\Omega\to\mathsf{SHS}$ the generator of the operator cocycle. Throughout the article, $\Delta$ will denote the random Hilbert-Schmidt operator with independent normal entries with mean 0 and where the $(i,j)$ entry has standard deviation $3^{-(i+j)}$ . Write $\gamma$ for the measure on $\mathsf{SHS}$ corresponding to this distribution. We apply a sequence of independent perturbations $\mathbf{\Delta}=(\Delta_{n})_{n\in\mathbb{Z}}$ , where each $\Delta_{n}$ has the distribution above. For $\omega$ lying in the base space, we denote by ${\bar{\omega}}$ the pair $(\omega,\mathbf{\Delta})$ specifying the point of the base space and the sequence of perturbations. The space of such pairs is denoted by $\bar{\Omega}$ , and is equipped with the transformation $\bar{\sigma}=\sigma\times s$ , where $s$ is the left shift on the sequence of perturbations and the ergodic invariant measure $\bar{\mathbb{P}}=\mathbb{P}\times\gamma^{\mathbb{Z}}$ . The perturbed cocycle is parameterized by $\epsilon$ (a measure of the size of the perturbation) and defined by

[TABLE]

Theorem A.

Let $\sigma\colon\Omega\to\Omega$ be an invertible measurable transformation and let $\mathbb{P}$ be an ergodic invariant measure for $\sigma$ . Let $H$ be a separable Hilbert space and let $A\colon\Omega\to\mathsf{SHS}$ be the generator of an operator cocycle satisfying $\int\log\|A_{\omega}\|_{\mathsf{SHS}}\,d\mathbb{P}(\omega)<\infty$ .

Let $\bar{\Omega}$ , $\bar{\sigma}$ and $\bar{\mathbb{P}}$ be as defined above. For each parameter $\epsilon>0$ , define a new cocycle $A^{\epsilon}\colon\bar{\Omega}\to\mathsf{SHS}$ over $\bar{\sigma}$ with generator $A^{\epsilon}({\bar{\omega}})=A(\omega)+\epsilon\Delta_{0}$ . Then the Lyapunov exponents of $A^{\epsilon}$ (listed with multiplicity) converge to those of $A$ as $\epsilon\to 0$ .

Theorem B.

Assume the hypotheses and notation of Theorem A. Let the (at most countably many) distinct Lyapunov exponents of the cocycle $A$ be $\lambda_{1}>\lambda_{2}>\ldots>-\infty$ , with corresponding multiplicities $d_{1},d_{2},\ldots$ . Let the corresponding Oseledets decomposition be $\mathsf{SHS}=Y_{1}(\omega)\oplus Y_{2}(\omega)\oplus\ldots$ . Let $D_{0}=0$ , $D_{i}=d_{1}+\ldots+d_{i}$ and let the Lyapunov exponents (with multiplicity) be $\infty>\mu_{1}\geq\mu_{2}\geq\ldots>-\infty$ , so that $\mu_{j}=\lambda_{i}$ if $D_{i-1}<j\leq D_{i}$ .

Let $\mathcal{U}_{i}=(\lambda_{i}-\alpha,\lambda_{i}+\alpha)$ be a neighbourhood of $\lambda_{i}$ not containing any other exponent of the unperturbed cocycle. Let $\epsilon_{0}$ be such that for each $\epsilon\leq\epsilon_{0}$ and each $D_{i-1}<j\leq D_{i}$ , the $j^{\text{th}}$ Lyapunov exponent $\mu_{j}^{\epsilon}$ of the perturbed cocycle satisfies $\mu_{j}^{\epsilon}\in\mathcal{U}_{i}$ . For $\epsilon<\epsilon_{0}$ , let $Y^{\epsilon}_{i}(\bar{\omega})$ denote the sum of the Oseledets subspaces of $A^{\epsilon}$ having exponents in $\mathcal{U}_{i}$ . Then $Y^{\epsilon}_{i}(\bar{\omega})$ converges in probability to $Y_{i}(\omega)$ as $\epsilon\to 0$ .

For $\lambda>1$ , we let $\mathcal{D}_{\lambda}$ be the diagonal matrix whose $(i,i)$ entry is $\lambda^{-i}$ . Formally we can write the random operator $\Delta$ from Theorem A as $\mathcal{D}_{3}N\mathcal{D}_{3}$ , where $N$ is a countably infinite square matrix of independent standard normal random variables.

Throughout the remainder of the paper there will be numerous constants. We will mostly just use the symbol $C$ to indicate a constant, where $C$ may refer to different constants at different places, even in the same proof. That is, whenever we write $C$ , we refer to a quantity that may depend on $k$ (the number of exponents that we aim to control), and on the underlying dynamical system, but not on $\epsilon$ , the size of the perturbations. The exception to this will be some of the principal propositions where estimates are collected for assembly in Section 9. In these propositions, constants will be numbered according to the proposition in which they are found, so that $C_{\ref{lem:trivial}}$ , for example, is defined in Lemma 34.

We briefly describe the structure of the proof of Theorem A since there is considerable preparation before we start the proof. The bulk of the proof is concerned with giving a lower bound for the sum of the $k$ leading perturbed exponents, that is the maximal logarithmic growth rate of $k$ -volumes. Given $\epsilon$ , one defines a block length, $N\sim|\log\epsilon|$ . For a large $n$ , we estimate the top exponents of the product ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ , a perturbed block of length $nN$ . First, we replace the (sub-additive) logarithmic $k$ -volume growth, $\Xi_{k}(\cdot)$ by a related approximately super-additive quantity, $\tilde{\Xi}_{k}(\cdot)$ (Sections 7 and 8). We use this super-additivity to split ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ into good super-blocks (of length a multiple of $N$ ) and bad blocks (of length $N-2$ ), that is $\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})\gtrsim\tilde{\Xi}_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})\gtrsim\sum\tilde{\Xi}_{k}(\text{blocks})$ . In section 4, ingredients for the estimate $\Xi_{k}(G^{\epsilon})\gtrsim\Xi_{k}(G)$ are established, where $G$ represents a good super-block and $G^{\epsilon}$ its perturbed version. In sections 5 and 6, ingredients for $\tilde{\Xi}_{k}(B^{\epsilon})\gtrsim\tilde{\Xi}_{k}(B)$ are established (where $B$ is a bad block and $B^{\epsilon}$ is its perturbed version). The estimates $\tilde{\Xi}_{k}(B)\gtrsim\Xi_{k}(B)$ and $\tilde{\Xi}_{k}(G^{\epsilon})\gtrsim\Xi_{k}(G^{\epsilon})$ are based on ingredients in Section 8. Re-assembling the pieces using sub-additivity of $\Xi_{k}$ and accounting for the errors gives the result.

3. Notation and the quantity $\tilde{\Xi}_{k}$

Recall that the Grassmannian of a Banach space is the space of closed complemented subspaces. In a Hilbert space, every closed subspace is complemented (by its orthogonal complement). We define $\mathcal{G}_{k}(H)$ to be the space of (necessarily closed) $k$ -dimensional subspaces of $H$ and $\mathcal{G}^{k}(H)$ to be the space of closed $k$ -codimensional subspaces of $H$ . The collection of all closed subspaces of $H$ will be written $\mathcal{G}(H)$ . We will reserve the symbol $S$ for the unit sphere of $H$ throughout the article.

We define a metric on $\mathcal{G}(H)$ by

[TABLE]

that is the Hausdorff distance between the intersections of the two subspaces with the unit sphere. We remark that this differs by at most a bounded factor from another metric, the ‘gap’ between closed subspaces defined in Kato [19]. This is a complete metric on $\mathcal{G}(H)$ .

We also make use of a measure of transversality between two subspaces of complementary dimensions: if $U\in\mathcal{G}^{k}(H)$ and $V\in\mathcal{G}_{k}(H)$ , then

[TABLE]

The normalization is chosen so that if $U$ and $V$ have a common vector, then ${\perp}(U,V)=0$ , while if they are orthogonal complements, then ${\perp}(U,V)=1$ . We have the reverse triangle inequality: if $U^{\prime},U\in\mathcal{G}^{k}(H)$ , $V,V^{\prime}\in\mathcal{G}_{k}(H)$ , then ${\perp}(U^{\prime},V^{\prime})\geq{\perp}(U,V)-\angle(V,V^{\prime})-\angle(U,U^{\prime})$ .

We already introduced the classes of linear operators $\mathsf{HS}$ and $\mathsf{SHS}$ on $H$ with their associated norms, so that we have $\mathsf{SHS}\subset\mathsf{HS}\subset K(H)$ , where $K(H)$ stands for the compact linear operators on $H$ . We write $\|\cdot\|_{\mathsf{op}}$ for the operator norm, so that $\|\cdot\|_{\mathsf{SHS}}\geq\|\cdot\|_{\mathsf{HS}}$ for elements of $\mathsf{SHS}$ and $\|\cdot\|_{\mathsf{HS}}\geq\|\cdot\|_{\mathsf{op}}$ for elements of $\mathsf{HS}$ .

For compact operators on $H$ , the notions of singular vectors and singular values pass directly from the finite-dimensional case. If $A\in K(H)$ , we write $s_{1}(A)\geq s_{2}(A)\geq\ldots$ for the singular values (with multiplicity in decreasing order). The maximal logarithmic rate of $k$ -dimensional volume growth is given by $\Xi_{k}(A):=\log(s_{1}(A)\cdots s_{k}(A))$ .

Define

[TABLE]

where $\Pi_{k}$ denotes orthogonal projection onto the subspace of $H$ spanned by $e_{1},\ldots,e_{k}$ and $\Delta$ and $\Delta^{\prime}$ are independent copies of the random Hilbert-Schmidt operator. The key reason for the introduction of $\tilde{\Xi}_{k}$ is that it satisfies an approximate super-additivity property (see Proposition 24) that complements the sub-additivity of $\Xi_{k}$ .

We denote by $\bar{\Omega}$ , the space $\Omega\times\mathsf{SHS}^{\mathbb{Z}}$ and act on $\bar{\Omega}$ with the transformation $\sigma\times s$ , where $s$ is the left-shift map on $\mathsf{SHS}^{\mathbb{Z}}$ . The space $\bar{\Omega}$ is equipped with the measure $\mathbb{P}\times\gamma^{\mathbb{Z}}$ , where $\gamma$ is the multi-variate normal distribution on $\mathsf{SHS}$ described above in which distinct elements of $\Delta$ are independent and the $(i,j)$ element is normal with mean 0 and variance $3^{-2(i+j)}$ . We write $\bar{\omega}$ for a typical element of $\bar{\Omega}$ , that is a pair $(\omega,\mathbf{\Delta})$ , where $\mathbf{\Delta}=(\Delta_{n})_{n\in\mathbb{Z}}$ .

Informally, we expect an inequality like $\tilde{\Xi}_{k}(A)\geq\Xi_{k}(A)-\operatorname{\mathcal{E}^{\text{L}}}_{k}(A)-\operatorname{\mathcal{E}^{\text{R}}}_{k}(A)$ . By $\operatorname{\mathcal{E}^{\text{L}}}_{k}(A)$ (which stands for ‘left energy’), we mean a measure of the modes on which the top $k$ left singular vectors are distributed, while $\operatorname{\mathcal{E}^{\text{R}}}_{k}(A)$ measures the modes where the right singular vectors are supported. For example, if the top left singular vectors are $e_{7}$ , $e_{8}$ , $e_{11}$ and $e_{13}$ , we expect $\operatorname{\mathcal{E}^{\text{L}}}_{4}(A)$ to be approximately 39 $\log 3$ .

Lemma 1.

Let $V$ be a $k$ -dimensional subspace of $H$ . Let $D$ be a bounded operator on $H$ . There exists an orthonormal basis $v_{1},\ldots,v_{k}$ for $V$ with the property that $Dv_{1},\ldots,Dv_{k}$ are mutually orthogonal.

This follows from the singular value decomposition of finite-dimensional operators.

Lemma 2.

Let $U$ and $V$ be $k$ -dimensional subspaces of $H$ . Then the two quantities appearing in the definition of $\angle(U,V)$ are equal:

[TABLE]

Proof.

Let $\Pi_{U}$ be the orthogonal projection onto $U$ and $\Pi_{V}$ be the orthogonal projection onto $V$ . Then the singular vectors of $\Pi_{V}\circ\Pi_{U}$ give an orthogonal basis of $U$ , $u_{1},\ldots,u_{n}$ with images $s_{1}v_{1},\ldots,s_{n}v_{n}$ , where $v_{1},\ldots,v_{n}$ form an orthogonal basis of $V$ (if $\Pi_{V}\Pi_{U}u_{i}=0$ , then $v_{i}$ can be chosen to be an arbitrary unit vector of $V$ satisfying the orthogonality condition). Write $u_{i}=s_{i}v_{i}+w_{i}$ with $w_{i}\in V^{\perp}$ . One can then check that $\langle u_{i},v_{j}\rangle=0$ if $i\neq j$ . Notice that $u_{i}$ and $s_{i}v_{i}$ are either equal or non-collinear. It follows from the above that $U+V$ may be expressed as the orthogonal direct sum $\operatorname{lin}\{u_{1},v_{1}\}\oplus\ldots\oplus\operatorname{lin}\{u_{n},v_{n}\}$ . One can now check that the linear map $R$ from $U+V$ to itself mapping $u_{i}$ to $v_{i}$ and vice versa is an isometry interchanging $U$ and $V$ . Applying this map yields the desired equality. ∎

Let $V$ be a $k$ -dimensional subspace of $H$ , and $\Pi$ be the orthogonal projection onto $V$ . We define the energy of $\Pi$ (also the ‘energy of $V$ ’) to be

[TABLE]

where the $(v_{i})$ are as guaranteed by the Lemma 1 with the operator $D$ taken to be $\mathcal{D}_{3}$ .

Lemma 3.

For any $k\in\mathbb{N}$ , there exists a $C>0$ such that if $\Pi$ and $\Pi^{\prime}$ are orthogonal projections onto $k$ -dimensional subspaces and $Q\subset\mathsf{HS}$ satisfies $\gamma(Q)\geq\frac{1}{2}$ , then

[TABLE]

Proof.

Let $u_{1},\ldots,u_{k}$ be the basis guaranteed by Lemma 1 (applied with $D=\mathcal{D}_{3}$ ) for the range of $\Pi$ and $v_{1},\ldots,v_{k}$ be the corresponding basis for $\Pi^{\prime}$ .

Now $\Xi_{k}(\Pi\Delta\Pi^{\prime})=\log\det|M|$ , where $M$ is a random matrix whose $(i,j)$ entry is $\langle u_{i},\Delta v_{j}\rangle$ . The entries of $M$ therefore have a multi-variate normal distribution. Each has mean 0, so the unconditioned distribution of $M$ is determined by the covariance of the pairs of entries of the matrix.

Using the fact that the coordinates of the $u$ ’s and $v$ ’s are bounded and the entries of $\Delta$ decay exponentially, we calculate

[TABLE]

where for the second line, we used the fact that distinct entries of $\Delta$ are independent, and so have 0 covariance. We see then, by the choice of $u$ ’s and $v$ ’s, that distinct entries of $M$ have 0 covariance, and so are independent. The variance of the $(i,j)$ entry of the matrix is $\|\mathcal{D}_{3}u_{i}\|^{2}\|\mathcal{D}_{3}v_{j}\|^{2}$ , hence the unconditioned distribution of the $(i,j)$ entry of the matrix is $\|\mathcal{D}_{3}u_{i}\|\|\mathcal{D}_{3}v_{j}\|$ times a standard normal.

Notice that the entire $i$ row has a multiplicative factor of $\|\mathcal{D}_{3}u_{i}\|$ and the entire $j$ column has a multiplicative factor of $\|\mathcal{D}_{3}v_{j}\|$ , so that the determinant is $\prod_{i}\|\mathcal{D}_{3}u_{i}\|\prod_{j}\|\mathcal{D}_{3}v_{j}\|\det(N_{k})$ , where $N_{k}$ is a $k\times k$ random matrix with independent standard normal entries, so that taking logarithms, we see $\Xi_{k}(\Pi\Delta\Pi^{\prime})=-\operatorname{\mathcal{E}}_{k}(\Pi)-\operatorname{\mathcal{E}}_{k}(\Pi^{\prime})+\log|\det N_{k}|$ .

Replacing $\Delta$ with a conditioned version has the effect of multiplying the density of $N_{k}$ by a factor in the range $[0,2]$ . Since $\log|\det(N_{k})|$ is an integrable function, there are uniform upper and lower bounds for $\int\log|\det(N_{k})|\rho(N_{k})$ over all functions $\rho$ taking values in $[0,2]$ , so that

[TABLE]

as required. ∎

Corollary 4.

There exists $K>0$ such that if $\Pi^{\prime}$ and $\Pi^{\prime\prime}$ are two orthogonal projections and $Q\subset\mathsf{HS}$ satisfies $\gamma(Q)>\frac{1}{2}$ , then

[TABLE]

Proof.

By Lemma 3, we have the following

[TABLE]

where $C$ is the constant from Lemma 3.

We calculate that $\operatorname{\mathcal{E}}_{k}(\Pi_{k})=\frac{1}{2}k(k-1)\log 3$ , so that combining the inequalities, we obtain

[TABLE]

where $K=3C+k(k-1)\log 3$ . ∎

4. Good Blocks

This section deals with good blocks. The strategy we follow goes back to Ledrappier and Young in the context of invertible matrices [23, Lemmas 3.3, 3.6 & 4.3], and it was later used in [10]. Lemma 7 is the main tool to control the effect of perturbations on good blocks. Lemma 8 collects standard facts about Lyapunov exponents, Oseledets splittings and their approximations via singular vectors, which are used to define good blocks. Lemma 9 establishes the conditions defining tame perturbations. Proposition 10 provides a lower bound on $\Xi_{k}$ over a sequence of tame perturbations, comparable with $\Xi_{k}$ for the unperturbed cocycle.

For each $k\in\mathbb{N}$ , we define $E_{k}(A)$ to be the space spanned by the images of the singular vectors with $k$ largest singular values under $A$ , and $F_{k}(A)$ to be the space spanned by the orthogonal complement of the pre-image of $E_{k}(A)$ under $A$ . Thus, $F_{k}(A)$ is exactly the space spanned by those singular vectors of $A$ whose singular value is not amongst the $k$ largest. We note that the spaces $F_{k}(A),E_{k}(A)$ are uniquely defined when the singular values $s_{k}(A)$ and $s_{k+1}(A)$ are distinct. We will always use our results in this setting, and therefore do not worry about the possibility of non-uniqueness.

We collect some properties of singular values and singular vectors for compact operators on Hilbert spaces and matrices.

Lemma 5.

Let $A$ be a compact operator on a Hilbert space, $H$ . Let the singular values be $s_{1}(A),s_{2}(A),\ldots$ .

(a)

$s_{j}(A)=\min_{V\in\mathcal{G}^{j-1}(H)}\max_{x\in V\cap S}\|Ax\|$ ; 2. (b)

$s_{j}(A)=\max_{V\in\mathcal{G}_{j}(H)}\min_{x\in V\cap S}\|Ax\|$ ; 3. (c)

$|s_{j}(A)-s_{j}(B)|\leq\|A-B\|_{\mathsf{op}}$ ;

Proof.

The characterizations (a) and (b) are well known.

To show (c), using (b), let $V$ be a $j$ -dimensional space such that $\|Ax\|\geq s_{j}(A)$ for all $x\in V\cap S$ . Then $\|Bx\|\geq s_{j}(A)-\|A-B\|_{\mathsf{op}}$ for all $x\in V\cap S$ , so that using (b) again, we see $s_{j}(B)\geq s_{j}(A)-\|A-B\|_{\mathsf{op}}$ . By symmetry, $s_{j}(A)\geq s_{j}(B)-\|A-B\|_{\mathsf{op}}$ , giving the result. ∎

Lemma 6.

Let $U\in\mathcal{G}^{k}(H)$ and $V\in\mathcal{G}_{k}(H)$ . Then $s_{k}(\Pi_{U^{\perp}}\Pi_{V})\geq{\perp}(U,V)$ .

Proof.

Choose $v\in V$ with $\|v\|=1$ . Let $v=u+w$ with $u\in U$ and $w\in U^{\perp}$ . Let $\hat{u}\in U\cap S$ be such that $u=\|u\|\hat{u}$ ( $\hat{u}$ may be chosen arbitrarily if $u=0$ ) and let $\theta$ be the angle between $\hat{u}$ and $v$ , so that $0<\theta\leq\frac{\pi}{2}$ . By assumption $\|\hat{u}-v\|\geq\sqrt{2}\,{\perp}(U,V)$ . We have $\|\hat{u}-v\|=2\sin\frac{\theta}{2}$ . Notice that $\|w\|=\|\Pi_{U^{\perp}}v\|=\sin\theta=2\sin\frac{\theta}{2}\cos\frac{\theta}{2}\geq\sqrt{2}\,{\perp}(U,V)\cos\frac{\theta}{2}$ . Since $\theta\leq\frac{\pi}{2}$ , we see $\|\Pi_{U^{\perp}}v\|\geq{\perp}(U,V)$ for all $v\in V\cap S$ . ∎

Lemma 7.

For any $\delta<\frac{1}{2}$ , there exists a $K>\delta^{-(4k+3)}$ such that if (i) the $k$ th singular value of a compact linear operator $A:X\to X$ exceeds $K$ ; (ii) the $(k+1)$ st singular value of $A$ is at most 1; and (iii) $\|B-A\|\leq 1$ , then the following hold:

(a)

$e^{-\delta}\leq s_{j}(A)/s_{j}(B)\leq e^{\delta}$ * for each $j\leq k$ and $s_{j}(B)\leq 2$ for each $j>k$ ;* 2. (b)

$\angle(E_{k}(A),E_{k}(B))$ * and $\angle(F_{k}(A),F_{k}(B))$ are less than $\delta$ ;* 3. (c)

If $V$ is any subspace of dimension $k$ such that ${\perp}(V,F_{k}(A))>\delta$ , then $\angle(BV,E_{k}(A))<\delta$ ; 4. (d)

If $V$ is a subspace of dimension $k$ and ${\perp}(V,F_{k}(A))>2\delta$ , then $|\det(B|_{V})|\geq\delta^{k}\exp\Xi_{k}(B)$ .

Proof.

For each closed subspace $W$ of $X$ , let $\Pi_{W}:X\to W$ be the orthogonal projection onto $W$ .

(a)

For the first part, notice that by assumption, for $j\leq k$ , we have $s_{j}(A)\geq K$ . Also by Lemma 5(c), we have $|s_{j}(A)-s_{j}(B)|\leq 1$ , so that $\frac{K}{K+1}\leq s_{j}(A)/s_{j}(B)\leq\frac{K}{K-1}$ . The second part of the claim follows from Lemma 5(c) also. 2. (b)

Let $K>1+\frac{6}{\delta}$ . For symmetry, in this part, we assume only $s_{k}(A),s_{k}(B)\geq K-1$ , $s_{k+1}(A),s_{k+1}(B)\leq 2$ and $\|A-B\|_{\mathsf{op}}\leq 1$ .

Let $v\in S$ satisfy $d(v,F_{k}(A)\cap S)\geq\delta$ . We will show that $v\not\in F_{k}(B)$ . Let $v=u+w$ with $u\in F_{k}(A)$ and $w\in F_{k}(A)^{\perp}$ . By assumption, $\|w\|\geq\frac{\delta}{2}$ , so that $\|Bv\|\geq\|Av\|-1\geq\|Aw\|-1>(K-1)\|w\|-1>2$ . On the other hand, if $v\in F_{k}(B)$ , then $\|Bv\|\leq s_{k+1}(B)\leq 2$ . The identical argument shows that if $v\in F_{k}(A)\cap S$ , then $d(v,F_{k}(B)\cap S)<\delta$

To show the closeness of the fast spaces, first let $v\in F_{k}(B)^{\perp}\cap S$ , and write $v$ as $au+w$ , where $u\in F_{k}(A)\cap S$ and $w\in F_{k}(A)^{\perp}$ . Let $u^{\prime}\in F_{k}(B)\cap S$ satisfy $\|u-u^{\prime}\|<\delta$ (such a $u^{\prime}$ exists by the paragraph above). Now $\langle v,u\rangle=\langle v,u^{\prime}\rangle+\langle v,u-u^{\prime}\rangle$ . The first term is 0 and the second term is less than $\delta$ in absolute value. Hence $|a|<\delta$ and $\|w\|\geq\frac{1}{2}$ . Now $Bv=aAu+Aw+(B-A)v$ . In particular, $\|Bv-Aw\|\leq 2\delta+1\leq 2$ while $\|Bv\|\geq K-1$ . Hence if $z\in E_{k}(B)\cap S$ , we have $d(z,E_{k}(A))\leq 2/(K-1)$ , so $d(z,E_{k}(A)\cap S)\leq 4/(K-1)$ . The identical argument holds if the roles of $A$ and $B$ are reversed, so $\angle(E_{k}(A),E_{k}(B))<4/(K-1)<\delta$ . 3. (c)

Let $K>4/\delta^{2}+2/\delta$ . Let $v\in V\cap S$ and write $v=u+w$ with $u\in F_{k}(A)$ and $w\in F_{k}(A)^{\perp}$ . By assumption, $\|w\|\geq\delta$ . Hence $\|Aw\|\geq K\delta$ , while $\|Au\|\leq 1$ . Since $\|B-A\|_{\mathsf{op}}\leq 1$ , we have $\|Bv-Aw\|\leq\|Bv-Av\|+\|Av-Aw\|\leq 2$ , so that $\|Bv-Aw\|/\|Bv\|\leq 2/(K\delta-2)$ . Hence for an arbitrary element, $y$ of $BV\cap S$ , we have $d(y,E_{k}(A))\leq 2/(K\delta-2)<\frac{\delta}{2}$ and $d(y,E_{k}(A)\cap S)\leq 4/(K\delta-2)<\delta$ . By Lemma 2, we deduce that $\angle(BV,E_{k}(A))<\delta$ as required. 4. (d)

We have that $\log|\det(B|_{V})|\geq\Xi_{k}(\Pi_{E_{k}(B)}B|_{V})=\Xi_{k}(B\Pi_{F_{k}(B)^{\perp}}|_{V})=\Xi_{k}(B\Pi_{F_{k}(B)^{\perp}}\Pi_{V})=\Xi_{k}(B\Pi_{F_{k}(B)^{\perp}})+\Xi_{k}(\Pi_{F_{k}(B)^{\perp}}\Pi_{V})\geq\Xi_{k}(B)+k\log\delta.$ The last inequality follows from the facts that $\Xi_{k}(B\Pi_{F_{k}(B)^{\perp}})=\Xi_{k}(B)$ ; and ${\perp}(F_{k}(B)^{\perp},V)\geq{\perp}(F_{k}(A)^{\perp},V)-\angle(F_{k}(A)^{\perp},F_{k}(B)^{\perp})>\delta$ so that $\|\Pi_{F_{k}(B)^{\perp}}\Pi_{V}v\|\geq\delta\|v\|$ for every $v\in V$ by Lemma 6, hence $\Xi_{k}(\Pi_{F_{k}(B)^{\perp}}\Pi_{V})\geq k\log\delta$ . The claim follows.

∎

The following lemma underlies the definition of good blocks: Using the notation of the lemma, if $n\geq n_{0}$ and $\omega\in G$ , and we say the block $A^{(n)}_{\omega}$ is good. See [10, Lemma 2.4] for a proof in the context of matrix cocycles, which applies without changes in our setting.

Lemma 8 (Good blocks).

Let $\sigma$ be an invertible ergodic measure-preserving transformation of $(\Omega,\mathbb{P})$ and let $A\colon\Omega\to\mathsf{SHS}$ be a measurable map, taking values in the strong Hilbert-Schmidt operators on $H$ , and such that $\int\log^{+}\|A(\omega)\|_{\mathsf{SHS}}\,d\mathbb{P}(\omega)<\infty$ . Let the Lyapunov exponents of the cocycle $A$ be $\infty>\mu_{1}\geq\mu_{2}\geq\ldots\geq-\infty$ , counted with multiplicities. Suppose $k\geq 1$ is such that $\mu_{k}>0>\mu_{k+1}$ . Let $E_{k}(\omega)$ and $F_{k}(\omega)$ denote the $k$ -dimensional and $k$ -codimensional Oseledets spaces of $A$ at $\omega$ corresponding to Lyapunov exponents $\mu_{1}\geq\dots\geq\mu_{k}$ and $\mu_{k+1}\geq\dots$ , respectively.

Let $\xi>0$ and $\delta_{1}>0$ be given. Then there exist $n_{0}>0$ , $\tau\leq\min(\delta_{1},\frac{1}{4}\mu_{k})$ and $0<\delta\leq\delta_{1}$ such that: for all $n\geq n_{0}$ , there exists a set $G\subseteq\Omega$ with $\mathbb{P}(G)>1-\xi$ such that for $\omega\in G$ , we have

(a)

$\perp(F_{k}(\omega),E_{k}(\omega))>10\delta$ ; 2. (b)

$\angle(E_{k}(A^{(n)}_{\omega}),E_{k}(\sigma^{n}\omega))<\delta$ ; 3. (c)

$\angle(F_{k}(A^{(n)}_{\omega}),F_{k}(\omega))<\delta$ ; 4. (d)

$e^{(\mu_{k}+\tau)n}>s_{k}(A^{(n)}_{\omega})>\max(K(\delta),e^{(\mu_{k}-\tau)n})$ * and $s_{k+1}(A^{(n)}_{\omega})<1$ , where $K(\delta)$ is as given in Lemma 7.* 5. (e)

$\frac{1}{n}\sum_{i=0}^{n-1}\log(1+\|A_{\sigma^{i}\omega}\|_{\mathsf{SHS}})<2\int\log(1+\|A_{\omega}\|_{\mathsf{SHS}})\,d\mathbb{P}(\omega)$ .

Assume that $\epsilon>0$ is fixed. A perturbation $\Delta$ is said to be tame if $|\Delta_{s,t}|\leq\epsilon^{-1/2}(\frac{2}{3})^{s+t}$ for all $s,t$ (otherwise $\Delta$ is wild). A quick calculation shows that if $\Delta$ is tame, then $\|\epsilon\Delta\|_{\mathsf{HS}}<2\sqrt{\epsilon}$ .

Lemma 9 (Good block length).

Let $\sigma:(\Omega,\mathbb{P})\circlearrowleft$ be an ergodic measure-preserving transformation. Let $A:\Omega\to\mathcal{B}(H)$ be a measurable map, taking values in the bounded linear operators on $H$ , such that $\log^{+}\|A(\omega)\|_{\mathsf{op}}$ is integrable. There exists $C_{\ref{lem:goodPert}}>0$ such that for all $\eta_{0}>0$ , there exists $\epsilon_{0}$ such that for all $\epsilon<\epsilon_{0}$ , there exists $G\subseteq\Omega$ of measure at least $1-\eta_{0}$ such that for all $\omega\in G$ , if $(\Delta_{n})\in\mathsf{HS}^{\mathbb{Z}}$ satisfies $\Delta_{n}$ is tame for each $0\leq n<N$ , then

[TABLE]

where $\bar{\omega}=(\omega,(\Delta_{n}))$ , $N=\lfloor C_{\ref{lem:goodPert}}|\log\epsilon|\rfloor$ and ${A^{\epsilon}}_{\bar{\omega}}^{(N)}=A^{\epsilon}_{N-1}(\bar{\omega})\dots A^{\epsilon}_{1}(\bar{\omega})A^{\epsilon}_{0}(\bar{\omega})$ .

The probability that one of $\Delta_{0},\ldots,\Delta_{N-1}$ is wild is $O(e^{-1/(2\epsilon)})$ .

Proof.

Let $g(\omega)=\log^{+}(\|A_{\omega}\|_{\mathsf{op}}+1)$ and let $C>0$ satisfy $\int g(\omega)\,d\mathbb{P}(\omega)<1/(2C)$ . Notice that provided $\epsilon<\frac{1}{4}$ (and assuming that the perturbations $(\Delta_{n})_{0\leq n<N}$ are tame, so that $\|\epsilon\Delta_{n}\|_{\mathsf{op}}\leq\|\epsilon\Delta_{n}\|_{\mathsf{HS}}\leq 2\sqrt{\epsilon}$ for $0\leq n<N$ ), $\log^{+}\|A_{\bar{\sigma}^{n}\bar{\omega}}^{\epsilon}\|\leq g(\sigma^{n}\omega)$ for each $0\leq n<N$ , and

[TABLE]

There exists $n_{0}$ such that for $N\geq n_{0}$ , $g(\omega)+\ldots+g(\sigma^{N-1}\omega)\leq N/(2C)-\log(4N)$ on a set of measure at least $1-\eta_{0}$ , hence $2N\sqrt{\epsilon}\exp(g(\omega)+\ldots+g(\sigma^{N-1}\omega))\leq\frac{1}{2}\sqrt{\epsilon}\exp(N/(2C))$ on a set of measure at least $1-\eta_{0}$ . In particular, provided $\lfloor C|\log\epsilon_{0}|\rfloor>n_{0}$ , taking $N=\lfloor C|\log\epsilon|\rfloor$ , we have $\|{A^{\epsilon}}_{\bar{\omega}}^{(N)}-A^{(N)}_{\omega}\|_{\mathsf{op}}\leq 1$ provided that the perturbations $\Delta_{0},\ldots\Delta_{N-1}$ are all tame.

Recall that $(i,j)$ th entry of $\Delta$ is distributed as $3^{-(i+j)}$ times a standard normal random variable. Hence the probability that $|\Delta_{i,j}|>\epsilon^{-1/2}(\frac{2}{3})^{i+j}$ is $\mathbb{P}(|N|>\epsilon^{-1/2}2^{i+j})$ . Using a standard estimate on the tail of a normal random variable [7, Theorem 1.2.3], this is at most $\frac{2\sqrt{\epsilon}}{\sqrt{2\pi}}2^{-(i+j)}\exp(-2^{2i+2j-1}/\epsilon)$ .

In particular, using the union bound, the probability that one of $\Delta_{0},\ldots,\Delta_{N-1}$ is wild is $O(e^{-1/(2\epsilon)})$ . ∎

We comment that once $\xi>0$ and $\delta_{1}>0$ are fixed, Lemma 8 guarantees the existence of an $n_{0}$ such that for all sufficiently large $n$ , the good set defined in the lemma has measure at least $1-\xi$ . Now for $\epsilon$ sufficiently small, the length $N=\lfloor C_{\ref{lem:goodPert}}|\log\epsilon|\rfloor$ exceeds $n_{0}$ . For the remainder of the proof, we let $G$ be the good set from Lemma 8 with $n$ taken to be $N$ (so that the good set, $G$ , depends on $\xi$ , $\delta_{1}$ and $\epsilon$ , but this dependence will not be made explicit). We further introduce the notation $\bar{G}=G\cap\bigcap_{i=0}^{N-1}\{\Delta_{i}\text{ is tame}\}$ , which we shall also use for the remainder of the proof.

Proposition 10 (Glueing good blocks).

Under the assumptions of Lemma 8, suppose $j<l$ and $\bar{\sigma}^{jN}\bar{\omega},\bar{\sigma}^{(j+1)N}\bar{\omega},\ldots,\bar{\sigma}^{(l-1)N}\bar{\omega}\in\bar{G}$ . Then,

[TABLE]

Proof.

Let $B_{n}=A^{(N)}_{\sigma^{nN}\omega}$ and $\tilde{B}_{n}={A^{\epsilon}}_{\bar{\sigma}^{nN}{\bar{\omega}}}^{(N)}$ . This is proved by induction using Lemma 7. Recall that since $B_{n}$ is a good block, $\|B_{n}-\tilde{B}_{n}\|\leq 1$ by Lemma 9. We let $\tilde{V}_{j}=V_{j}=F_{k}(B_{j})^{\perp}$ and define $V_{n+1}=B_{n}V_{n}$ and $\tilde{V}_{n+1}=\tilde{B}_{n}\tilde{V}_{n}$ .

We claim that the following hold, for each $n=j,j+1,\dots,l-1$ :

(i)

$\angle(V_{n},\tilde{V}_{n})<2\delta$ ; 2. (ii)

$\perp(V_{n},F_{k}(B_{n}))>\delta$ and $\perp(\tilde{V}_{n},F_{k}(B_{n}))>\delta$ .

Item (i) and the first part of (ii) hold immediately for the case $n=j$ . The second part of (ii) holds because $\tilde{V}_{j}=V_{j}=F_{k}(B_{j})^{\perp}$ and $\angle(F_{k}(B_{j}),F_{k}(\tilde{B}_{j}))<\delta$ by Lemma 7(b).

Given that (i) and (ii) hold for $n=m$ , $B_{m}$ is a good block and $\tilde{B}_{m}$ is a good perturbation, Lemma 7(c) implies that $\angle(V_{m+1},E_{k}(B_{m}))<\delta$ , $\angle(\tilde{V}_{m+1},E_{k}(B_{m}))<\delta$ so that $\angle(\tilde{V}_{m+1},V_{m+1})<2\delta$ , yielding (i) for $n=m+1$ .

Making use of the induction hypothesis and Lemma 8, we have that $\angle(E_{k}(\sigma^{(m+1)N}\omega),E_{k}(B_{m}))<\delta,\angle(F_{k}(\sigma^{(m+1)N}\omega),F_{k}(B_{m}))<\delta$ and $\perp(E_{k}(\sigma^{(m+1)N}\omega),F_{k}(\sigma^{(m+1)N}\omega))>10\delta$ . Thus, we obtain (ii) for $n=m+1$ .

Hence using Lemma 7(d), we see that $\log|\text{det}(\tilde{B}_{n}|_{\tilde{V}_{n}})|\geq k\log\delta+\Xi_{k}(\tilde{B}_{n})\geq k\log\delta-k\delta+\Xi_{k}(B_{n})\geq\Xi_{k}(B_{n})+2k\log\delta$ , where we made use of Lemma 7(a) for the second inequality.

Since $\Xi_{k}(\tilde{B}_{l-1}\cdots\tilde{B}_{j})\geq\sum_{i=j}^{l-1}\log|\det(\tilde{B}_{i}|\tilde{V}_{i})|$ , summing yields

[TABLE]

as required. ∎

Lemma 11.

Let the Hilbert-Schmidt cocycle, $A\colon\Omega\to\mathsf{HS}$ and all parameters and perturbations be as above. If $\bar{\sigma}^{iN}\omega\in\bar{G}$ for each $0\leq i<n$ , then $\angle\left(F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}),F_{k}(A^{(N)}_{\omega})\right)<\delta$ .

Proof.

By the first part of (2), $\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})>\sum_{i=0}^{n-1}\Xi_{k}(A^{(N)}_{\sigma^{iN}\omega})+2nk\log\delta$ . Also, $\Xi_{k+1}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})\leq\sum_{i=0}^{n-1}\Xi_{k+1}({A^{\epsilon}}_{\bar{\sigma}^{iN}{\bar{\omega}}}^{(N)})\leq\sum_{i=0}^{n-1}\Xi_{k}({A^{\epsilon}}_{\bar{\sigma}^{iN}{\bar{\omega}}}^{(N)})+n\log 2\leq\sum_{i=0}^{n-1}\Xi_{k}(A^{(N)}_{\sigma^{iN}\omega})+n\log 2+nk\delta\leq\sum_{i=0}^{n-1}\Xi_{k}(A^{(N)}_{\sigma^{iN}\omega})-2nk\log\delta$ by Lemma 7(a). Since we have $\log s_{k+1}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})=\Xi_{k+1}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})-\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})$ , we deduce $s_{k+1}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})\leq\delta^{-4nk}$ .

On the other hand, if $v\in S$ is such that ${\perp}(v,F_{k}(A^{(N)}_{\omega}))>\delta$ , an inductive argument exactly like the proof of Proposition 10 shows that $\|{A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}v\|\geq(\frac{\delta}{3})^{n}e^{-n\delta}\prod_{i=0}^{n-1}s_{k}(A^{(nN)}_{\omega})\geq(\delta^{3}K(\delta))^{n}$ . The choice of $K(\delta)$ in Lemma 7 ensures $\|{A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}v\|>s_{k+1}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})$ , so that $v\not\in F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})$ . ∎

Proposition 12.

Let $\omega$ be such that $\bar{\sigma}^{iN}{\bar{\omega}}\in\bar{G}$ for $0\leq i<n$ . Then for any $V$ such that ${\perp}(V,F_{k}(A^{(N)}_{\omega}))>2\delta$ , one has $\log|\det({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}|_{V})|\geq\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(nN)})+k\log\delta$ .

Proof.

We argue as in Lemma 7(d).

[TABLE]

where we used Lemma 6 for the last line. Lemma 11 and the triangle inequality allow us to conclude. ∎

5. Comparing perturbed and unperturbed bad blocks (Type I)

We distinguish two ways in which a block can be bad: types I and II. A type I bad block is one where the unperturbed cocycle has bad properties. On the other hand, a type II bad block is one where the unperturbed cocycle is well-behaved, but the perturbations are wild.

Conditional on being in a type I bad block, the perturbations are unconstrained, whereas conditional on being in a type II bad block at least one perturbation is constrained to be large. For later use with the type II bad blocks, we state some of the lemmas when one is conditioned to be in a high probability event (but the high probability event will be taken to be the whole space when dealing with type I blocks.)

Lemma 13.

Let $k>0$ . There exists a $C>0$ with the following property. Let $T$ be a multi-variate normal Hilbert-Schmidt-valued random operator whose entries have mean 0, let $A\in\mathsf{HS}$ and let $\Pi$ and $\Pi^{\prime}$ be orthogonal projections onto $k$ -dimensional subspaces of $H$ . Then for any subset $Q$ of $\mathsf{HS}$ such that $\mathbb{P}(T\in Q)\geq\frac{1}{2}$ , one has

[TABLE]

where $x^{-}$ denotes $\min(x,0)$ .

Proof.

We assume $\Xi_{k}(\Pi A\Pi^{\prime})>-\infty$ as otherwise the result is trivial.

Let $\tilde{\Pi}$ be $\Pi$ composed with an isometry from the range of $\Pi$ to $\mathbb{R}^{k}$ and similarly let $\tilde{\Pi}^{\prime}$ be an isometry from $\mathbb{R}^{k}$ to the range of $\Pi^{\prime}$ . Then we have $\Xi_{k}(\Pi B\Pi^{\prime})=\log|\det(\tilde{\Pi}B\tilde{\Pi}^{\prime})|$ for any bounded operator $B$ on $H$ so that we need to show

[TABLE]

Let $Y=\tilde{\Pi}A\tilde{\Pi}^{\prime}$ and $Z=\tilde{\Pi}T\tilde{\Pi}^{\prime}$ , so that $Y$ is a fixed $k\times k$ matrix and $Z$ is a $k\times k$ matrix-valued random variable with multivariate normal entries. By our earlier assumption, $Y$ is invertible, so let $X=ZY^{-1}$ (this also has multi-variate normal entries for unconditioned $T$ ). We then need a lower bound for $\operatorname{\mathbb{E}}\big{(}\log\det(I+X)\big{|}T\in Q\big{)}$ .

The unconstrained matrix-valued random variable $X$ can be written as $\sum_{l=1}^{d}N_{l}B^{l}$ , where the $B^{l}$ are fixed $k\times k$ matrices, $d$ is the dimension of the support of $X$ (at most $k^{2}$ depending on the pattern of entries in the unperturbed $A$ ’s) and the $N_{l}$ are independent standard normal random variables (see for example [7, Example 3.9.2]).

Let $\Psi$ denote the map from $\mathbb{R}^{d}$ to $M_{k\times k}$ defined by $x\mapsto\sum x_{l}B^{l}$ . Let $\mathcal{S}$ be the image under $\Psi$ of the unit sphere and $\mu$ be the measure on $\mathcal{S}$ that is the push-forward of the normalized volume measure on the unit sphere. The unconditioned measure on $X$ is then the push forward of $\mu\times C_{d}r^{d-1}e^{-r^{2}/2}\,dr$ , where $C_{d}$ is chosen so that $C_{d}\int r^{d-1}e^{-r^{2}/2}\,dr=1$ . The conditioned measure on $X$ (since the event being conditioned upon is of measure at least $\frac{1}{2}$ ) is of the same form, but the density is multiplied by a varying factor in the range [0,2].

It then suffices to lower bound

[TABLE]

In particular, it is enough to give a uniform lower bound for

[TABLE]

as $d$ ranges over the range 1 to $k^{2}$ and $M$ ranges over $M_{k\times k}$ .

For each fixed $M$ , write $p_{M}(r)=\det(I+rM)$ , so that $p_{M}$ is a polynomial of degree $k$ satisfying $p_{M}(0)=1$ . Hence $p_{M}(r)$ can be written as a product $\prod_{i=1}^{k}(1-b_{i}r)$ . Define

[TABLE]

so that $G(d,M)\geq\sum_{i=1}^{k}F(d,b_{i})$ . Hence it suffices to show that $F(d,b)$ is uniformly bounded below as $b$ runs over the complex plane and as $d$ runs over the range $1$ to $k^{2}$ .

Next, notice that $\log|1-br|\geq\log|1-\operatorname{Re}(b)r|$ , so $F(d,b)\geq F(d,|b|)$ and it suffices to give a lower bound for positive real values of $b$ . Also

[TABLE]

For $b\geq\frac{1}{2}$ , $F(b,d)\geq\frac{1}{b^{d}}\int_{0}^{2}\log|1-r|r^{d-1}\,dr\geq-2^{d}/b^{d}\geq-4^{d}$ . For $0<b<\frac{1}{2}$ , one has

[TABLE]

where $\alpha=2/(1+d)$ . This converges to $-2/(d+1)$ as $b$ approaches 0 from the right. By continuity and compactness, for each of the finitely many values of $d$ , $F(d,b)$ is bounded below as $b$ ranges over $(0,\frac{1}{2}]$ . ∎

Proposition 14.

Let $k>0$ . Then there exists a $C_{\ref{prop:step2tilde}}$ with the following property. For every finite sequence $A_{0},\ldots,A_{n-1}$ of Hilbert-Schmidt operators, let $\Delta_{0},\ldots,\Delta_{n-1}$ be independent copies of the perturbation $\Delta$ as described above. Let $A^{\epsilon}_{i}$ denote $A_{i}+\epsilon\Delta_{i}$ .

Then one has

[TABLE]

Proof.

We have

[TABLE]

We focus on giving a lower bound for one of the terms in the summation. We write such a term as

[TABLE]

This expectation should be interpreted as being conditioned on the values of $\Delta_{j+1},\ldots,\Delta_{n}$ , so that $L=(A_{n}+\epsilon\Delta_{n})\cdots(A_{j+1}+\epsilon\Delta_{j+1})$ .

The above expectation can be rewritten as:

[TABLE]

Once $\Delta$ and $\Delta^{\prime}$ are fixed, the inner expectation is

[TABLE]

Now let $\Pi$ be the orthogonal projection onto the orthogonal complement of the kernel of $\Pi_{k}\Delta L$ and $\Pi^{\prime}$ be the orthogonal projection onto the range of $R\Delta^{\prime}\Pi_{k}$ . Then we have

[TABLE]

Now the quantity in (4) is

[TABLE]

Applying Lemma 13 with $Q=\mathsf{HS}$ , this is bounded below by $-C$ , independently of $\Delta$ and $\Delta^{\prime}$ , so that the quantity in (3) is also bounded below by $-C$ . Since there are $n$ such terms, the statement in the lemma follows. ∎

6. Type II bad block perturbations

Here we give an argument for good blocks in the base that have large perturbations. We will obtain a drop in $\tilde{\Xi}_{k}$ over a bad block of size $O(\log\epsilon)$ at worst, that is a drop of size $O(1)$ per symbol since blocks are of length proportional to $|\log\epsilon|$ . However since the frequency of these blocks is $O(e^{-C/\epsilon})$ , the contribution of this drop to the singular values of a large string of blocks is minuscule.

Lemma 15.

There exists a constant $C>0$ such that if $N$ is a standard normal random variable and $\Lambda>2$ , then for each $a\in\mathbb{C}$ ,

[TABLE]

Before giving the proof, let us give a heuristic explanation for why this should be true. Conditional on $N\geq\Lambda$ , the distribution of $N$ is approximately $\Lambda+\text{Exp}(\Lambda)$ , that is it typically takes values that are $\Lambda+O(1/\Lambda)$ . The worst case for the inequality is approximately when $a=1/\Lambda$ and then the quantity inside the logarithm is roughly $O(1/\Lambda^{2})$ .

Proof.

We first recall that $\int_{0}^{a}\log x\,dx=a(\log a-1)$ , so that the average value of the logarithm function over $[0,a]$ is $\log a-1$ . We claim that for any interval $J$ , one has

[TABLE]

Indeed, this follows already for intervals $[0,a]$ with $0<a<1$ , and hence for sub-intervals of $[0,1]$ and $[-1,0]$ . For intervals $[-a,b]$ with $a<0<|a|\leq b\leq 1$ , we have $1/(a+b)\int_{-a}^{b}\log^{-}|x|\,dx\geq 1/b\int_{-b}^{b}\log^{-}|x|\,dx=2(\log b-1)$ . If the interval $J$ is entirely outside $[-1,1]$ , the inequality is trivial; and if $J$ intersects $[-1,1]$ , we have already established the inequality for $J\cap[-1,1]$ , from which the inequality for $J$ follows.

For $a\in\mathbb{C}$ , the integrand in the statement reduced if $a$ is replaced by $|a|$ so we may assume $a>0$ . If $a>2/\Lambda$ , the integral is 0.

If $1/(3\Lambda)\leq a\leq 2/\Lambda$ , let $I=[\Lambda,\frac{2}{a})$ , the sub-interval of $[\Lambda,\infty)$ where $\log|1-ax|<0$ ; and $J=[\frac{1}{a}-\frac{1}{a\Lambda^{2}},\frac{1}{a}+\frac{1}{a\Lambda^{2}}]$ , the interval where $\log|1-ax|<-2\log\Lambda$ .

The quantity to be bounded is

[TABLE]

The ratio of the two integrals over $I\setminus J$ is bounded below by $-2\log\Lambda$ . Using (5), the ratio of the two integrals over $I\cap J$ is bounded below by $2(-2\log\Lambda-1)\max_{I\cap J}e^{-x^{2}/2}/\min_{I\cap J}e^{-x^{2}/2}\geq 2e^{2/(a^{2}\Lambda^{2})}(-2\log\Lambda-1)\geq-2e^{18}(2\log\Lambda+1)$ . Since both ratios are bounded below by a constant multiple of $\log\Lambda$ , so is the ratio of the sums.

If $a<1/(3\Lambda)$ , we argue similarly. In this case, we let $J=[\frac{1}{2a},\frac{3}{2a}]$ . On $I\setminus J$ , $\log|1-ax|$ is bounded below by $-\log 2$ , so that

[TABLE]

On $I\cap J$ , we have $e^{-x^{2}/2}\leq e^{-1/(8a^{2})}$ . Also $\int_{\Lambda}^{\infty}e^{-x^{2}/2}\,dx\geq e^{-\Lambda^{2}/2}/(2\Lambda)$ , using [7, Theorem 1.2.3]. Hence

[TABLE]

using (5). When $a=1/(3\Lambda)$ , this is $4(-\log 2-1)3\Lambda^{2}e^{-5\Lambda^{2}/8}$ and the lower bound increases as $a$ is further reduced. Minimizing this expression over $\Lambda$ , we see that there is a $C$ , independent of $\Lambda$ , such that $\operatorname{\mathbb{E}}\big{(}\log^{-}|1-aN|\big{|}N\geq\Lambda\big{)}\geq-C$ for all $|a|<1/(3\Lambda)$ . ∎

Lemma 16.

Let $k>0$ and $\Delta$ be as throughout the article. There exists $C>0$ such that for all sufficiently small $\epsilon>0$ , for each $a,b$ and each pair of $k$ -dimensional orthogonal projections $\Pi$ and $\Pi^{\prime}$ ,

[TABLE]

where $\textsf{Wild}_{a,b}$ is the event that $\Delta$ satisfies $|\Delta_{l,m}|<(\frac{2}{3})^{l+m}\epsilon^{-1/2}$ for each $(l,m)$ that is lexicographically smaller than $(a,b)$ and $|\Delta_{a,b}|\geq\epsilon^{-1/2}(\frac{2}{3})^{a+b}$ (where $(l,m)$ is lexicographically smaller than $(a,b)$ if $l<a$ or $l=a$ and $m<b$ ).

Proof.

We deal with the case $\Delta_{a,b}$ positive. The case where it is negative is exactly analogous. Let $B_{a,b}$ be the collection of those $\Delta$ satisfying $\Delta_{a,b}\geq\epsilon^{-1/2}(\frac{2}{3})^{a+b}$ (and no other condition). The argument of Lemma 9 shows that $\mathbb{P}(\textsf{Wild}_{a,b}|B_{a,b})>\frac{1}{2}$ . This allows us to deduce as in the proof of Lemma 13 that

[TABLE]

Hence it suffices to show that

[TABLE]

Using the same reduction as in Lemma 13, the calculation reduces to showing that there is a $C$ such that for sufficiently small $\epsilon>0$ , one has for an arbitrary $k\times k$ multi-variate normal matrix-valued random variable, $R$ , whose entries have zero mean and for an arbitrary rank 1 $k\times k$ matrix $Y$ ,

[TABLE]

where $N$ is an independent standard normal random variable. First fixing $N$ and taking the expectation over $R$ using Lemma 13 (taking $Q$ to be the full range of $\Delta$ ), we obtain

[TABLE]

Hence it suffices to show

[TABLE]

Since $Y$ has rank 1, the polynomial $\det(I+tY)$ is of the form $1+at$ . To see this, notice the determinant is unchanged if $I+tY$ is conjugated by an orthogonal matrix, $O$ . Then choose $O$ so that the first column spans the range of $Y$ so that $O^{-1}(I+tY)O=I+t\tilde{Y}$ , where $\tilde{Y}$ has only one non-zero row. $\det(I+tY)$ is then $1+t\tilde{Y}_{1,1}$ . Hence we are seeking a lower bound for

[TABLE]

which is of the desired form by Lemma 15. ∎

Proposition 17.

There exists a $C_{\ref{prop:badtriangleineq}}>0$ with the following property. For any $m>0$ , let $B$ be the event that at least one of the perturbations $\Delta_{0},\ldots,\Delta_{m-1}$ is wild. Then

[TABLE]

Proof.

We write $B$ as $B_{0}\cup\ldots\cup B_{m-1}$ , where $B_{i}$ is the event that the $i$ th perturbation matrix is wild, and all previous ones are tame. Since the $B_{i}$ are disjoint, it suffices to establish that there is a $C>0$ such that for each $i$ ,

[TABLE]

We argue as in Proposition 14:

[TABLE]

As in Proposition 14, finding lower bounds for this reduces to finding lower bounds for $\operatorname{\mathbb{E}}\Big{(}\tilde{\Xi}_{k}(\Pi A^{\epsilon}_{\bar{\sigma}^{j}{\bar{\omega}}}\Pi^{\prime})-\tilde{\Xi}_{k}(\Pi A_{\sigma^{j}\omega}\Pi^{\prime})\Big{|}B_{i}\Big{)}$ .

In this case, for $j>i$ , the conditional distribution of $\Delta_{j}$ is the same as the distribution used in Lemma 13 with $Q=\mathsf{HS}$ , so that lemma gives a bound

[TABLE]

In the case $j<i$ , $\Delta_{j}$ is conditioned to be tame. By Lemma 9, this is a set of probability (much) greater than $\frac{1}{2}$ , so that Lemma 13 gives a similar bound to (7).

Finally, we address the term with $j=i$ . Given that $\Delta_{i}$ is wild, the probability that the first oversized entry occurs in the $(a,b)$ coordinate is $O(\exp(-\frac{1}{2}\epsilon^{-1}(2^{2a+2b}-1)))$ (as seen from the estimate $\mathbb{P}(N>t)\approx(2\pi)^{-1/2}e^{-t^{2}/2}/t$ for large $t$ [7, Theorem 1.2.3]).

Hence by conditioning and using Lemma 16, we obtain

[TABLE]

Combining equations (7) and the equation (8), we obtain the statement of the proposition. ∎

7. Joining good and bad blocks

Lemma 18.

For all $k\in\mathbb{N}$ , there is a constant $C>0$ such that for any $A\in\mathsf{HS}$ , any orthogonal projections $\Pi_{1}$ and $\Pi_{2}$ onto $k$ -dimensional subspaces, and any $Q\subset\mathsf{HS}$ such that $\mathbb{P}(\Delta\in Q)\geq\frac{1}{2}$ , one has

[TABLE]

Proof.

Let $\tilde{\Pi}_{1}$ be an isometry from the range of $\Pi_{1}$ to $\mathbb{R}^{k}$ . Similarly let $\tilde{\Pi}_{2}$ be the post-composition of $\Pi_{2}$ with an isometry from $\mathbb{R}^{k}$ to the span of the range of $\Pi_{2}$ . Let $\tilde{A}=\tilde{\Pi}_{1}A\tilde{\Pi}_{2}$ and let $\tilde{\Delta}=\tilde{\Pi}_{1}\Delta\tilde{\Pi}_{2}$ be the $k\times k$ multi-variate normal induced from the unconditioned distribution of $\Delta$ .

As in Lemma 13, we radially disintegrate the random variables $\tilde{\Delta}$ , writing $\tilde{\Delta}$ as $t\tilde{M}$ , where $\tilde{M}$ belongs to a ‘unit sphere’ equipped with a normalized probability measure and $t$ having an absolutely continuous distribution on $[0,\infty)$ with density $r^{k^{2}-1}e^{-r^{2}/2}/\Gamma(k^{2}/2)$ . On conditioning on $\Delta\in Q$ , the density is bounded above by $2r^{k^{2}-1}e^{-r^{2}/2}/\Gamma(k^{2}/2)$ We prove that there is a $C>0$ such that for all $\tilde{M}$ of rank $k$ ,

[TABLE]

Notice that since the matrices are $k\times k$ , $\Xi_{k}$ is just the logarithm of the absolute value of the determinant. Let $p(r)=\det(\tilde{A}+r\tilde{M})/\det(r\tilde{M})$ , a polynomial in powers of $1/r$ of degree at most $k$ with constant coefficient 1. It can therefore be expressed as $p(r)=\prod_{i=1}^{d}(1-b_{i}/r)$ , with $d\leq k$ .

We are trying to bound

[TABLE]

As in the proof of Lemma 15, it suffices to give a bound in the case where $b>0$ . We have

[TABLE]

The logarithm is bounded below by $-\log 2$ on $(2b,\infty)$ , so that the contribution from this range is at least $-\Gamma(k^{2}/2)\log 2$ . For the contribution from the range $[\frac{b}{2},2b]$ , we have a lower bound of $-16(2b)^{k^{2}-2}e^{-b^{2}/8}$ (obtained by bounding $e^{-r^{2}/2}$ above by $e^{-b^{2}/8}$ ). Hence we obtain the required uniform lower bound. ∎

The following lemma plays a key role, as it provides an approximate super-additivity property for $\tilde{\Xi}_{k}$ (making strong use of the nature of the perturbations), complementing the well-known sub-additivity property of $\Xi_{k}$ .

Lemma 19.

There exists $C>0$ such that if $\Delta$ is distributed as above and $Q$ is any subset of $\mathsf{HS}$ such that $\mathbb{P}(Q\in\Delta)\geq\frac{1}{2})$ , then

[TABLE]

Proof.

We may assume that $L$ and $R$ have rank at least $k$ as otherwise there is nothing to prove. Recalling the definition of $\tilde{\Xi}$ , we have

[TABLE]

We first show that for fixed $\Delta_{1}$ and $\Delta_{2}$ ,

[TABLE]

We have $\Xi_{k}(\Pi_{k}\Delta_{1}L(A+\epsilon\Delta)R\Delta_{2}\Pi_{k})=\Xi_{k}(\Pi_{k}\Delta_{1}L)+\Xi_{k}(\overline{\Pi}(A+\epsilon\Delta)\overline{\overline{\Pi}})+\Xi_{k}(R\Delta_{2}\Pi_{k})$ and $\Xi_{k}(\Pi_{k}\Delta_{1}L(\epsilon\Delta)R\Delta_{2}\Pi_{k})=\Xi_{k}(\Pi_{k}\Delta_{1}L)+\Xi_{k}(\overline{\Pi}(\epsilon\Delta)\overline{\overline{\Pi}})+\Xi_{k}(R\Delta_{2}\Pi_{k})$ , where $\overline{\Pi}$ is the orthogonal projection onto the $k$ -dimensional orthogonal complement of the kernel of $\Pi_{k}\Delta_{1}L$ and $\overline{\overline{\Pi}}$ is the orthogonal projection onto the range of $R\Delta_{2}\Pi_{k}$ . Hence

[TABLE]

Taking an expectation as $\Delta$ runs over $Q$ and using Lemma 18, we obtain (9). Hence, taking the expectation over $\Delta_{1}$ and $\Delta_{2}$ , we have

[TABLE]

For the last part of the argument, we have

[TABLE]

where $\overline{\Pi}$ and $\overline{\overline{\Pi}}$ are as above. By Corollary 4, the middle term is $\operatorname{\mathbb{E}}_{\Delta_{3}}\Xi_{k}(\overline{\Pi}\Delta_{3}\Pi_{k})+\operatorname{\mathbb{E}}_{\Delta_{4}}\Xi_{k}(\Pi_{k}\Delta_{4}\overline{\overline{\Pi}})\pm C$ . Substituting and recombining the expressions, we get

[TABLE]

as required. ∎

Since the statement includes the case where $\Delta$ is conditioned to lie in a large set, this is sufficient to cover the case where $\Delta$ is conditioned to be tame. We need a version of this inequality to deal with the case where $\Delta$ is constrained to be wild.

Lemma 20.

There exists $C>0$ such that for all polynomials, $p(x)$ , one has

[TABLE]

where $M(p)$ is the Mahler measure of $p$ : If $p(x)=a(x-z_{1})(x-z_{2})\cdots(x-z_{k})$ , then $M(p)=a\prod_{|z_{i}|>1}|z_{i}|$ .

Proof.

Write $p(x)$ as $a(x-z_{1})\cdots(x-z_{k})$ . The inequality then follows from

[TABLE]

While we will not give all the details, the idea is to notice that the integral can be expressed as $\operatorname{\mathbb{E}}\log|N-z|$ where $N$ is a standard normal random variable. If $z$ is small, then this is the integral of a function with a logarithmic singularity. If $z$ is large, then since $N$ is concentrated near 0, the integrand is close to $\log|z|$ with very high probability. ∎

Lemma 21.

For each $k>0$ , there exists a constant $C$ such that for each polynomial $p(x)=\sum_{i=0}^{k}a_{i}x^{i}$ , one has

[TABLE]

The proof can be found in Lang’s book [22, Theorem 2.8].

Lemma 22.

Let $\Lambda>2$ and let $N$ be a standard normal random variable. There exists a $C>0$ such that for all $a,b\in\mathbb{C}$ ,

[TABLE]

Proof.

The case where $|a|>|b|$ follows from Lemma 15 (writing $\log|a+bN|=\log|a|+\log|1+\frac{b}{a}N|$ ). If $|b|\geq|a|$ , then $|a+bN|\geq|b|\Lambda/2$ whenever $N>\Lambda$ . The result follows. ∎

Lemma 23.

There exists a constant $C>0$ such that for all $i,j$ ,

[TABLE]

where $\mathsf{Wild}_{i,j}$ is the event that $|\Delta_{i,j}|\geq(\frac{2}{3})^{i+j}\epsilon^{-1/2}$ and $|\Delta_{a,b}|<(\frac{2}{3})^{a+b}\epsilon^{-1/2}$ for all pairs $(a,b)$ that are lexicographically smaller than $(i,j)$ .

Proof.

As in the proof of Lemma 19, the proof reduces to showing a version of Lemma 18:

[TABLE]

We first compare $\operatorname{\mathbb{E}}\Xi_{k}\big{(}\Pi_{1}(A+\epsilon\Delta)\Pi_{2}\big{|}\mathsf{Wild}_{i,j}\big{)}$ to $\operatorname{\mathbb{E}}\Xi_{k}\big{(}\Pi_{1}(A+\epsilon\Delta)\Pi_{2}\big{|}\mathsf{Tame}_{i,j}\big{)}$ , where $\mathsf{Tame}_{i,j}$ is the event that $|\Delta_{a,b}|<(\frac{2}{3})^{a+b}\epsilon^{-1/2}$ for all pairs $(a,b)$ that are lexicographically smaller than $(i,j)$ . Fixing all entries of $\Delta$ other than $\Delta_{i,j}$ , this amounts to comparing $\operatorname{\mathbb{E}}\big{(}\log|\det(B+NZ)|\big{|}N>2^{i+j}\epsilon^{-1/2}\big{)}$ to $\operatorname{\mathbb{E}}\big{(}\log|\det(B+NZ)|\big{)}$ , where $B$ is an invertible $k\times k$ matrix and $Z$ is rank 1. As pointed out in Lemma 16, $\det(B+NZ)=a+bN$ for constants $a$ and $b$ , so that it suffices to compare $\operatorname{\mathbb{E}}\big{(}\log|a+bN|\big{|}N>2^{i+j}\epsilon^{-1/2}\big{)}$ to $\operatorname{\mathbb{E}}\log|a+bN|$ . By Lemma 22, the first of these is at least $\max(\log|a|,\log|b|)-C(i+j+\log\epsilon)$ and by Lemmas 20 and 21, the second of these is within $C$ of $\max(\log|a|,\log|b|)$ . We deduce that

[TABLE]

Hence, using the same cancellation argument that occurs in Lemma 19, we have

[TABLE]

Finally using Lemma 19 to bound $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R|\textsf{Tame}_{i,j})$ , the result follows. ∎

Proposition 24.

There exists $C_{\ref{prop:splitting}}>0$ with the following property: Let $L$ , $R$ , and $A$ be Hilbert-Schmidt operators and let $\Delta$ be the multivariate normal perturbation described earlier. Then each of $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)$ , $\operatorname{\mathbb{E}}\big{(}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)\big{|}\Delta\text{ is wild}\big{)}$ and $\operatorname{\mathbb{E}}\big{(}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)\big{|}\Delta\text{ is tame}\big{)}$ is bounded below by $\tilde{\Xi}_{k}(L)+\tilde{\Xi}_{k}(R)+C_{\ref{prop:splitting}}\log\epsilon$ .

Proof.

The cases of $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)$ , $\operatorname{\mathbb{E}}\big{(}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)\big{|}\Delta\text{ is tame}\big{)}$ are handled by Lemma 19. The case of $\operatorname{\mathbb{E}}\big{(}\tilde{\Xi}_{k}(L(A+\epsilon\Delta)R)\big{|}\Delta\text{ is wild}\big{)}$ is handled using Lemma 23 by conditioning on the first entry of $\Delta$ that is large analogously to the end of the proof of Proposition 17. ∎

8. Comparison of $\Xi_{k}$ and $\tilde{\Xi}_{k}$

Lemma 25.

Let $C_{k}$ be the expected value of $\log|\det N_{k}|$ where $N_{k}$ is a $k\times k$ matrix-valued random variable with independent standard normal entries. Let $n\geq k$ , let $A$ be an $n\times n$ matrix and let $N$ be a $k\times n$ matrix-valued random variable with independent standard normal entries. Then $\operatorname{\mathbb{E}}\Xi_{k}(NA)\geq\Xi_{k}(A)+C_{k}$ .

Proof.

Write $A=UDV$ where $U$ and $V$ are orthogonal and $D$ is diagonal with decreasing entries. Then by an argument like that in Lemma 3 (computing covariances between elements) $NU$ has the same distribution as $N$ , so that we have $\operatorname{\mathbb{E}}\Xi_{k}(NA)=\operatorname{\mathbb{E}}\Xi_{k}(NUDV)=\operatorname{\mathbb{E}}\Xi_{k}(ND)\geq\operatorname{\mathbb{E}}\Xi_{k}(ND\Pi_{k})$ . Notice that since $D$ is diagonal, $ND\Pi_{k}$ has the form $\begin{pmatrix}N_{k}D_{k}|0\end{pmatrix}$ , where $N_{k}$ is the left $k\times k$ submatrix of $N$ and $D_{k}$ is the top left $k\times k$ submatrix of $D$ . Hence $\operatorname{\mathbb{E}}\Xi_{k}(ND\Pi_{k})=\operatorname{\mathbb{E}}\Xi_{k}(N_{k}D_{k})=C_{k}+\Xi_{k}(D_{k})=C_{k}+\Xi_{k}(A)$ as required. ∎

Lemma 26.

Let $A$ , $B$ and $C$ be Hilbert-Schmidt matrices, and let $A_{n}=\Pi_{n}A\Pi_{n}$ . Then $\Xi_{k}(BA_{n}C)\to\Xi_{k}(BAC)$ as $n\to\infty$ .

Proof.

Let $R_{n}=A-A_{n}$ , so that $\|R_{n}\|\to 0$ . We have $|s_{i}(BA_{n}C)-s_{i}(BAC)|\leq\|B\|\cdot\|R_{n}\|\cdot\|C\|$ for each $i$ so that $s_{i}(BA_{n}C)\to s_{i}(BAC)$ for each $i$ . The conclusion follows. ∎

Proposition 27.

Let $k>0$ . Then there exists a constant $C_{\ref{prop:XikvsXiktilde}}$ such that for an arbitrary Hilbert-Schmidt operator $A$ on $H$ ,

[TABLE]

Proof.

We have $\tilde{\Xi}_{k}(A)=\operatorname{\mathbb{E}}_{\Delta,\Delta^{\prime}}\Xi_{k}(\Pi_{k}\Delta A\Delta^{\prime}\Pi_{k})$ where $\Delta$ and $\Delta^{\prime}$ are independent copies of the perturbation operator. Since $\Xi_{k}(\Pi_{k}\Delta A_{n}\Delta^{\prime}\Pi_{k})\leq k\log\|\Pi_{k}\Delta A_{n}\Delta^{\prime}\Pi_{k}\|_{\mathsf{op}}\leq k\log(\|\Delta\|_{\mathsf{op}}\cdot\|A_{n}\|_{\mathsf{op}}\cdot\|\Delta^{\prime}\|_{\mathsf{op}})$ ; $\|A_{n}\|_{\mathsf{op}}\leq\|A\|_{\mathsf{HS}}$ and $\operatorname{\mathbb{E}}\log\|\Delta\|_{\mathsf{op}}<\operatorname{\mathbb{E}}\|\Delta\|_{\mathsf{op}}\leq\operatorname{\mathbb{E}}\|\Delta\|_{\mathsf{HS}}<\infty$ , we see that the family of functions, $(\Delta,\Delta^{\prime})\mapsto\Xi_{k}(\Pi_{k}\Delta A_{n}\Delta^{\prime}\Pi_{k})$ is dominated by an integrable function. Hence, by the Reverse Fatou Lemma and Lemma 26, we have

[TABLE]

However, we have

[TABLE]

where $\Delta_{k\times n}$ denotes the random Hilbert Schmidt operator $\Delta$ with all entries outside the top left $k\times n$ corner replaced by 0’s (and $\Delta^{\prime}_{n\times k}$ similarly). Hence

[TABLE]

Applying Lemma 25 twice, we deduce $\tilde{\Xi}_{k}(A_{n})\geq\Xi_{k}(\mathcal{D}_{3}A_{n}\mathcal{D}_{3})+C$ , so that on taking the limit, we deduce $\tilde{\Xi}_{k}(A)\geq\Xi_{k}(\mathcal{D}_{3}A\mathcal{D}_{3})+C$ as required. ∎

Corollary 28.

There is a $C_{\ref{cor:twoglue}}$ with the following property. Let $L$ , $R$ , $A$ and $A^{\prime}$ be Hilbert-Schmidt operators and $\Delta$ and $\Delta^{\prime}$ be independent copies of the standard perturbation. Then we have

[TABLE]

The same inequality holds if either or both of $\Delta$ and $\Delta^{\prime}$ are constrained to be either tame or wild (or one of each).

Proof.

Let $L^{\prime}=L(A^{\prime}+\epsilon\Delta^{\prime})=L(A^{\prime}+\epsilon\Delta^{\prime})I$ . By Proposition 24, $\operatorname{\mathbb{E}}_{\Delta^{\prime}}\tilde{\Xi}_{k}(L^{\prime})\geq\tilde{\Xi}_{k}(L)+\tilde{\Xi}_{k}(I)+C_{\ref{prop:splitting}}\log\epsilon$ , with this inequality still satisfied if $\Delta^{\prime}$ is constrained to be tame or wild. By Proposition 27, $\tilde{\Xi}_{k}(I)$ is a finite constant. Finally, $\operatorname{\mathbb{E}}_{\Delta}\tilde{\Xi}_{k}(L^{\prime}(A+\epsilon\Delta)R)\geq\tilde{\Xi}_{k}(L^{\prime})+\tilde{\Xi}_{k}(R)+C_{\ref{prop:splitting}}\log\epsilon$ . Combining the inequalities, the result is proved. ∎

Lemma 29.

Let $f(t)=\sum_{i=1}^{n}a_{i}e^{b_{i}t}$ where $a_{i}>0$ for each $i$ . Then $f(t)$ is log-convex.

Proof.

We have $(\log f)^{\prime}=f^{\prime}/f$ , so that $(\log f)^{\prime\prime}=(ff^{\prime\prime}-(f^{\prime})^{2})/f^{2}$ . Now

[TABLE]

∎

Lemma 30.

Let $V$ be a $k$ -dimensional subspace of $H$ and let $\Pi_{V}$ be the orthogonal projection onto $V$ . Then $f(s):=\Xi_{k}(\mathcal{D}_{e^{s}}\circ\Pi_{V})$ is a convex function.

Proof.

We first prove that for $0<s<t$ , $f(s)\leq\frac{s}{t}f(t)$ . To see this, let $v_{1},\ldots,v_{k}$ be an orthogonal basis of $V$ such that $\mathcal{D}_{e^{t}}v_{1},\ldots,\mathcal{D}_{e^{t}}v_{k}$ are orthogonal. Then $f(s)\leq\sum_{i=1}^{k}\log\|\mathcal{D}_{e^{s}}v_{i}\|$ . By Lemma 29, $s\mapsto\log\|\mathcal{D}_{e^{s}}v_{i}\|=\frac{1}{2}\log(\sum_{j}e^{-2sj}{(v_{i})_{j}}^{2})$ is convex, so that $\log\|\mathcal{D}_{e^{s}}v_{i}\|\leq\frac{s}{t}\log\|\mathcal{D}_{e^{t}}v_{i}\|$ . Hence $f(s)\leq\frac{s}{t}f(t)$ as claimed.

Now if $0<a<b<c$ , let $W=\mathcal{D}_{e^{a}}V$ , let $s=b-a$ and $t=c-a$ . Let $\alpha=\Xi_{k}(\mathcal{D}_{e^{a}}\Pi_{V})$ . Now we have $f(a)=\alpha$ , $f(b)=\Xi_{k}(\mathcal{D}_{e^{b}}\Pi_{V})=\Xi_{k}(\mathcal{D}_{e^{b-a}}\mathcal{D}_{e^{a}}\Pi_{V})=\Xi_{k}(\mathcal{D}_{e^{b-a}}\Pi_{W})+\Xi_{k}(\mathcal{D}_{e^{a}}\Pi_{V})=\alpha+\Xi_{k}(\mathcal{D}_{e^{s}}\Pi_{W})$ . Similarly $f(c)=\alpha+\Xi_{k}(\mathcal{D}_{e^{t}}\Pi_{W})$ and the result follows from the above. ∎

Lemma 31.

Let $A$ be a Hilbert-Schmidt operator on $H$ . Then $g(s)\colon=\Xi_{k}(\mathcal{D}_{e^{s}}A)$ is a convex function. Similarly $h(s)\colon=\Xi_{k}(A\mathcal{D}_{e^{s}})$ is convex.

Proof.

Let $0<a<b<c$ . Let $V$ be the $k$ -dimensional space spanned by the top $k$ right singular vectors of $\mathcal{D}_{e^{b}}A$ and $\Pi_{V}$ be the orthogonal projection onto $V$ . Let $W=A(V)$ and $\Pi_{W}$ be the orthogonal projection onto $W$ . Then we have $\Xi_{k}(\mathcal{D}_{e^{t}}A\Pi_{V})=\Xi_{k}(\mathcal{D}_{e^{t}}\Pi_{W})+\Xi_{k}(A\Pi_{V})$ , the sum of a convex function and a constant by Lemma 30. Now $g(b)=\Xi_{k}(\mathcal{D}_{e^{b}}A)=\Xi_{k}(\mathcal{D}_{e^{b}}A\Pi_{V})\leq\frac{c-b}{c-a}\Xi_{k}(\mathcal{D}_{e^{a}}A\Pi_{V})+\frac{b-a}{c-a}\Xi_{k}(\mathcal{D}_{e^{c}}A\Pi_{V})\leq\frac{c-b}{c-a}g(a)+\frac{b-a}{c-a}g(c)$ as required.

We have $h(s)=\Xi_{k}(A\mathcal{D}_{e^{s}})=\Xi_{k}(\mathcal{D}_{e^{s}}A^{*})$ , which is convex by the above. ∎

Proposition 32.

Let $A$ be a Hilbert-Schmidt operator on $H$ . Then

[TABLE]

Proof.

Let $f(s,t)=\Xi_{k}(\mathcal{D}_{e^{s}}A\mathcal{D}_{e^{t}})-\Xi_{k}(A)$ . Since $\mathcal{D}_{a}$ is contractive for $a>1$ , we have $f(\log 3,0)\leq 0$ and $f(0,\log 2)\leq 0$ . Now Lemma 31 applied to $\Xi_{k}(\mathcal{D}_{3}A\mathcal{D}_{e^{t}})-\Xi_{k}(A)$ implies that $f(\log 3,\log 2)\leq\frac{\log 2}{\log 3}f(\log 3,\log 3)$ . Applying the lemma to $\Xi_{k}(\mathcal{D}_{e^{s}}A\mathcal{D}_{2})-\Xi_{k}(A)$ implies

[TABLE]

as required. ∎

Lemma 33.

Let $\sigma$ be an ergodic measure-preserving transformation of $(\Sigma,\mathbb{P})$ . Let $(f_{n})$ be a sub-additive sequence of functions (that is $f_{n+m}(\omega)\leq f_{n}(\sigma^{m}\omega)+f_{m}(\omega)$ for each $\omega\in\Omega$ and $n,m>0$ ) such that $\inf_{n>0}\int\frac{1}{n}f_{n}\,d\mathbb{P}>-\infty$ . For any $\epsilon>0$ , there exist $\chi>0$ and $n_{0}$ such that if $M\geq n_{0}$ and $A$ is any set with $\mathbb{P}(A)<\chi$ then $\int_{A}f_{M}\,d\mathbb{P}>-\epsilon M$ .

Proof.

Let $\alpha=\lim\int(f_{n}/n)\,d\mathbb{P}$ . Let $\epsilon>0$ be given. Let $\chi$ be small enough that $\int_{B}f_{1}\,d\mathbb{P}<\frac{\epsilon}{3}$ for any set $B$ with $\mathbb{P}(B)\leq\chi$ and so that $2\chi(\alpha+\frac{\epsilon}{3})>-\frac{\epsilon}{3}$ . By the Kingman sub-additive ergodic theorem, there exists $m_{0}$ such that for $M\geq m_{0}$ , $\mathbb{P}(\{\omega\colon f_{M}(\omega)>(\alpha+\frac{\epsilon}{3})M\})<\chi$ .

Now let $A$ be an arbitrary set with $\mathbb{P}(A)<\chi$ . We split $\Omega$ into three sets: $A$ , $G=\{\omega\in A^{c}\colon f_{M}(\omega)\leq(\alpha+\frac{\epsilon}{3})M\}$ and $B=A^{c}\setminus G$ (and note that $\mathbb{P}(G^{c})\leq 2\chi$ ). Now we have

[TABLE]

Hence we see

[TABLE]

as required. ∎

Lemma 34.

For all $k$ , there exists a $C_{\ref{lem:trivial}}$ such that for any bounded operator $A$ one has

[TABLE]

Proof.

We have $\tilde{\Xi}_{k}(A)=\operatorname{\mathbb{E}}_{\Delta_{1},\Delta_{2}}\Xi_{k}(\Pi_{k}\Delta_{1}A\Delta_{2}\Pi_{k})\leq 2\operatorname{\mathbb{E}}\Xi_{k}(\Delta)+\Xi_{k}(A)\leq 2k\operatorname{\mathbb{E}}\log\|\Delta\|_{\mathsf{op}}+\Xi_{k}(A)$ , where we used sub-additivity of $\Xi_{k}$ for the first inequality and the fact that $s_{i}(B)\leq\|B\|_{\mathsf{op}}$ for the second. Hence it suffices to show that $\operatorname{\mathbb{E}}\log\|\Delta\|_{\mathsf{op}}<\infty$ . But $\operatorname{\mathbb{E}}\log\|\Delta\|_{\mathsf{op}}\leq\operatorname{\mathbb{E}}\|\Delta\|_{\mathsf{op}}\leq\operatorname{\mathbb{E}}\|\Delta\|_{\mathsf{HS}}\leq\sum_{i,j}\operatorname{\mathbb{E}}|\Delta_{ij}|=\sum_{i,j}3^{-(i+j)}\operatorname{\mathbb{E}}|N|<\infty$ . ∎

9. Convergence of the Lyapunov exponents

Proof of Theorem A.

Rather than control the exponents directly, it is more straightforward, and clearly equivalent, to control the partial sums of the exponents. Let $\mu_{1}(A)\geq\mu_{2}(A)\geq\ldots$ denote the Lyapunov exponents of the cocycle $A$ listed with multiplicity in decreasing order. We then let $\Lambda_{k}(A)=\mu_{1}(A)+\ldots+\mu_{k}(A)$ . We are aiming to show that $\Lambda_{k}(A^{\epsilon})\to\Lambda_{k}(A)$ for each $k$ . By an argument of Ledrappier and Young [23], explained slightly differently in our earlier paper [10], it suffices to show that $\epsilon\mapsto\Lambda_{k}(A^{\epsilon})$ is upper semi-continuous for each $k$ ; and lower semi-continuous for those $k$ such that $\mu_{k+1}(A)<\mu_{k}(A)$ .

9.1. Upper semi-continuity

We shall show $\limsup_{\epsilon\to 0}\Lambda_{k}(A^{\epsilon})\leq\Lambda_{k}(A)$ . To see this, let $\eta>0$ . By the sub-additive ergodic theorem, there exists an $n$ such that $\frac{1}{n}\int\Xi_{k}(A^{(n)}_{\omega})\,d\mathbb{P}(\omega)<\Lambda_{k}(A)+\eta$ . As $\epsilon\to 0$ , we have $\|{A^{\epsilon}}_{{\bar{\omega}}}^{(n)}-A^{(n)}_{\omega}\|\to 0$ and hence $\Xi_{j}({A^{\epsilon}}_{{\bar{\omega}}}^{(n)})\to\Xi_{j}(A^{(n)}_{\omega})$ for all ${\bar{\omega}}\in\bar{\Omega}$ . Set $g({\bar{\omega}})=1+\|A(\omega_{0})\|$ and $h({\bar{\omega}})=\|\Delta_{0}\|$ . Then for $\epsilon<1$ , $\log\|{A^{\epsilon}}_{{\bar{\omega}}}^{(n)}\|\leq\sum_{i=0}^{n-1}\log(g+h)(\bar{\sigma}^{i}{\bar{\omega}})$ . Since this is integrable, the Reverse Fatou Lemma implies that $\limsup_{\epsilon\to 0}\frac{1}{n}\int\Xi_{j}({A^{\epsilon}}_{{\bar{\omega}}}^{(n)})\,d\bar{\mathbb{P}}({\bar{\omega}})<\Lambda_{k}(A)+\eta$ . Hence $\Lambda_{k}(A^{\epsilon})<\Lambda_{k}(A)+\eta$ for sufficiently small $\epsilon$ .

9.2. Choice of Parameters

Now we move to showing the lower semi-continuity of $\Lambda_{k}(A^{\epsilon})$ in the case where $\mu_{k+1}(A)<\mu_{k}(A)$ . We assume without loss of generality (by scaling the entire cocycle by a constant if necessary) that $\mu_{k+1}(A)<0<\mu_{k}(A)$ .

Let $\eta>0$ . We are seeking an $\epsilon_{0}$ such that for $\epsilon<\epsilon_{0}$ , $\Lambda_{k}(A^{\epsilon})>\Lambda_{k}(A)-\eta$ . First, choose an $n_{0}$ and $\chi$ such that the following inequalities are satisfied:

[TABLE]

That $n_{0}$ and $\chi$ can be chosen to satisfy the third inequality is a consequence of Lemma 33. Let $\delta$ be chosen so that $\mathbb{P}(G^{c})<\chi/2$ , where $G$ is the event that the block $A^{(N)}_{\omega}$ is good as in Lemma 8. Let $\epsilon_{1}$ be chosen so that $N_{\epsilon}:=\lfloor C_{\ref{lem:goodPert}}|\log\epsilon|\rfloor>n_{0}$ for all $\epsilon<\epsilon_{1}$ . Let $\epsilon_{2}$ be such that the probability that an $N_{\epsilon}$ -block of $\Delta$ ’s contains a wild perturbation is less than $\chi/2$ for all $\epsilon<\epsilon_{2}$ (such an $\epsilon_{2}$ exists by Lemma 9). Let $\bar{G}=\{{\bar{\omega}}\in\bar{\Omega}:\omega\in G;\Delta_{0},\dots,\Delta_{N-1}\text{ are tame}\}$ . We will only consider $\epsilon$ ’s that are smaller than $\epsilon_{1}$ and $\epsilon_{2}$ for the remainder of the argument. In particular $\bar{\mathbb{P}}(\bar{G}^{c})<\chi$ .

We need to control $\operatorname{\mathbb{E}}\Xi_{k}(A^{(nN)}_{{\bar{\omega}}})$ , where $N$ is the length of a block (as given by Lemma 9), and we let $n\to\infty$ . Here and below, the superscript $\epsilon$ indicates that we are studying the perturbed cocycle.

9.3. Replacing $\Xi_{k}$ with $\tilde{\Xi}_{k}$

We have

[TABLE]

by Lemma 34. The advantage of $\tilde{\Xi}_{k}$ over $\Xi_{k}$ is that it admits a lower bound in terms of sub-blocks.

9.4. Splitting ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ into good and bad blocks

Recall a block ${A^{\epsilon}}_{\bar{\sigma}^{jN}{\bar{\omega}}}^{(N)}$ is said to be good if $\sigma^{jN}{\bar{\omega}}\in\bar{G}$ , that is the unperturbed cocycle is well-behaved, and the perturbations are tame. Given ${\bar{\omega}}$ , we split up ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ into blocks of length $N$ . Whenever three or more consecutive blocks are good, we form a super-block, $G^{\epsilon}$ , consisting of the concatenation of the good blocks other than the first and last good blocks. All of the remaining blocks are called filler blocks. The $B^{\epsilon}$ are the filler blocks stripped of their first and last matrices.

We have

[TABLE]

where the splitting in the last line is into super-blocks (of variable length, all a multiple of $N$ ), here designated by $G^{\epsilon}$ , and filler blocks, $B^{\epsilon}$ , all of length $N-2$ and $E_{1}$ denotes an expected error term that we now estimate.

To obtain (11), we split the concatenation of $n$ blocks of length $N$ into the super-blocks and filler blocks as described above by repeatedly applying Proposition 24, which sacrifices a single matrix as ‘glue’ at each splitting site (or Corollary 28 in the case of two consecutive filler blocks when two matrices are sacrificed). Since the expected number of non-good $N$ -blocks is less than $\chi n$ and each such block gives rise to at most 4 transitions between adjacent blocks in the concatenation (the worst case happens when two super-blocks are joined by three fillers), we deduce $E_{1}\leq 4\chi n\max(C_{\ref{prop:splitting}},C_{\ref{cor:twoglue}})|\log\epsilon|$ . From Lemma 9, $|\log\epsilon|\leq 2N/C_{\ref{lem:goodPert}}$ , so that

[TABLE]

9.5. Comparison of $\tilde{\Xi}_{k}(G^{\epsilon})$ and $\Xi_{k}(G^{\epsilon})$

To bound one of the $\tilde{\Xi}_{k}(G^{\epsilon})$ , the contribution from one of the super-blocks, we first compare to $\Xi_{k}(G^{\epsilon})$ , the corresponding contribution to the genuine singular values; and then compare to $\Xi_{k}(G^{0})$ , the singular values of the unperturbed block. Recall that each $G^{\epsilon}$ is preceded by an $N$ -block $L^{\epsilon}$ and followed by an $N$ -block $R^{\epsilon}$ such that the enlarged block $L^{\epsilon}G^{\epsilon}R^{\epsilon}$ consists entirely of good blocks.

For the first comparison, we have

[TABLE]

using Propositions 27 and 32 respectively. Now

[TABLE]

by sub-additivity, and

[TABLE]

where we made use of Proposition 12 for the second inequality (Lemmas 7(c) and 8(a), (b) and (c) were used to ensure the hypotheses of that Proposition are satisfied). Combining inequalities (13), (14) and (15), we obtain

[TABLE]

By Lemmas 5(c), 8(d) and 9, we have $\Xi_{k}(L^{\epsilon})$ and $\Xi_{k}(R^{\epsilon})$ are non-negative. By Lemma 8(e), using sub-additivity, we have $\Xi_{k}(L^{\epsilon}\mathcal{D}_{2}^{-1}),\Xi_{k}(\mathcal{D}_{2}^{-1}R^{\epsilon})\leq 2kN\int\log(1+\|A_{\omega}\|_{\mathsf{SHS}})\,d\mathbb{P}(\omega)$ . Hence for each good block, we have

[TABLE]

9.6. Comparison of $\Xi_{k}(G^{\epsilon})$ and $\Xi_{k}(G^{0})$

Next, by Proposition 10, we have $\Xi_{k}(G^{\epsilon})\geq\Xi_{k}(G^{0})+2k\ell\log\delta$ , where $\ell$ is the number of blocks forming the $G^{\epsilon}$ super-block, so that overall, for each good block, we have

[TABLE]

where $G^{0}$ is the corresponding unperturbed block.

In summary,

[TABLE]

where $E_{2}$ is the combined contribution of the errors coming from good blocks via (16).

9.7. Comparison of $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})$ and $\tilde{\Xi}_{k}(B^{0})$

We next work on giving a lower bound for the terms of the form $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})$ . It turns out to be convenient to bound this in the opposite order than the way we obtained bounds for $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(G^{\epsilon})$ . Namely, we show $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})\gtrsim\tilde{\Xi}_{k}(B^{0})\gtrsim\Xi_{k}(B^{0})$ .

If the filler block $B^{\epsilon}={A^{\epsilon}}_{\bar{\sigma}^{jN+1}{\bar{\omega}}}^{(N-2)}$ is not type II bad, we have $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})\geq\tilde{\Xi}_{k}(B^{0})-C_{\ref{prop:step2tilde}}N$ by Proposition 14, where $B^{0}=A^{(N-2)}_{\sigma^{jN+1}\omega}$ , the unperturbed block. When $B^{\epsilon}$ is type II bad, we have $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})\geq\tilde{\Xi}_{k}(B^{0})+C_{\ref{prop:badtriangleineq}}(\log\epsilon-N)$ by Proposition 17. Since by Lemma 9, we have $\log\epsilon>-2N/C_{\ref{lem:goodPert}}$ , we get $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})\geq\tilde{\Xi}_{k}(B^{0})-C_{\ref{prop:badtriangleineq}}N(1+2/C_{\ref{lem:goodPert}})$ in this case. We therefore have in either case that

[TABLE]

9.8. Comparison of $\tilde{\Xi}_{k}(B^{0})$ and $\Xi_{k}(B^{0})$

For the estimate $\tilde{\Xi}_{k}(B^{0})\gtrsim\Xi_{k}(B^{0})$ , we use an argument similar to that in (13) and (14) above. Namely, let the matrices preceding and following $B^{0}$ in the unperturbed cocycle be $L^{0}$ and $R^{0}$ . We also write $\bar{B}^{0}=A^{(N)}_{\sigma^{jN}\omega}$ for the $N$ -block, $L^{0}B^{0}R^{0}$ . Then as before, we have

[TABLE]

We have the estimate for the subtracted terms in (19):

[TABLE]

where $F(\omega)=\sum_{i=0}^{N-1}\log^{+}\|A(\sigma^{i}\omega)\|_{\mathsf{SHS}}$ . This is a consequence of sub-additivity of $\Xi_{k}$ , the fact that $\|A\mathcal{D}_{2}^{-1}\|_{\mathsf{op}},\|A\|_{\mathsf{op}}\leq\|A\|_{\mathsf{SHS}}$ for every $A\in\mathsf{SHS}$ and $\Xi_{k}(A)\leq k\log\|A\|_{\mathsf{op}}$ . By the choice of $\chi$ , we have $\int_{\bar{G}^{c}}F(\omega)\,d\bar{\mathbb{P}}({\bar{\omega}})<\eta N/(108k)$ . The combined contribution from the subtracted terms in (19) to all of the $\tilde{\Xi}_{k}(B^{\epsilon})$ terms in (11) is bounded above by

[TABLE]

where $\mathsf{Filler}$ is $\bar{G}^{c}\cup\bar{\sigma}^{-N}\bar{G}^{c}\cup\bar{\sigma}^{N}\bar{G}^{c}$ , the set of points which are the first index of a filler block. Hence the expectation of the contribution of the subtracted terms in (19) is at most $\eta nN/12$ .

We use a similar argument to give a lower bound for the sum of the added $2\Xi_{k}(\bar{B}_{0})$ terms in (19). These terms are

[TABLE]

By the choice of $\chi$ , $\int_{B}\Xi_{k}(A^{(N)}_{\omega})\geq-\eta N/72$ for any set, $B$ , of measure at most $\chi$ . Hence, the expected value of the expression in (20) is bounded below by $-\eta nN/12$ .

Combining these estimates along all filler blocks occuring in (11), we see

[TABLE]

9.9. Combining the inequalities

At this point, we have (combining inequalities (11), (16), (18) and (21)),

[TABLE]

where $E_{3}$ comes from the contributions of (18) and (21). Then using (10),

[TABLE]

where $n_{\mathsf{Filler}}$ and $n_{\textsf{Super}}$ are the number of filler and super-blocks respectively in ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ . By sub-additivity, the first term in parentheses is at least $\operatorname{\mathbb{E}}\Xi_{k}(A^{(nN)}_{\omega})$ . We have $\operatorname{\mathbb{E}}n_{\mathsf{Filler}}<3\chi n$ and $\operatorname{\mathbb{E}}n_{\textsf{Super}}<\chi n$ ,

[TABLE]

As $\epsilon$ is reduced to 0, $\delta$ does not grow, but $N\to\infty$ so that for sufficiently small $\epsilon$ , we have

[TABLE]

Hence we deduce $\Lambda_{k}(A^{\epsilon})\geq\Lambda_{k}(A)-\eta$ , as required. ∎

10. Convergence of the Oseledets spaces

Proof of Theorem B.

Let $k=D_{i}$ be as in the statement of the theorem. Let us assume, by possibly rescaling the cocycle by a constant, that $\mu_{k}>0>\mu_{k+1}$ . Let $\delta_{0}<1$ and

[TABLE]

We will show that for every $0<\eta<1$ and every sufficiently small $\epsilon>0$ , $\bar{\mathbb{P}}(U_{\epsilon})<\eta$ .

Once this is established, convergence in probability of the Oseledets spaces $Y_{k}^{\epsilon}({\bar{\omega}})$ to $Y_{k}^{0}(\omega)$ follows via the identity $Y_{k}^{\epsilon}({\bar{\omega}})=E_{k}^{\epsilon}({\bar{\omega}})\cap F_{k-1}^{\epsilon}({\bar{\omega}})$ , and the fact that $F_{k-1}^{\epsilon}({\bar{\omega}})$ coincides with the orthogonal complement of the top $k$ -dimensional Oseledets space of the adjoint cocycle $(A^{\epsilon})^{*}$ , which converges in probability by the same argument. See [10, §4] for details.

In what follows, we will repeatedly apply Lemma 8, assuming $\xi<\frac{\eta}{3},\delta_{1}<\min\{\delta_{0},\frac{\mu_{k}\eta}{15k}\}$ , and so the value of $\tau$ provided by Lemma 8 satisfies $\tau\leq\delta_{1}\leq\frac{\mu_{k}\eta}{15k}$ . The corresponding value of $\delta<\delta_{0}$ provided by Lemma 8 will also be used in the application of Lemma 7.

Let

[TABLE]

where $N$ depends on $\epsilon$ as in Lemma 9. For sufficiently small $\epsilon$ , we have $\bar{\mathbb{P}}(\bar{G}\cap\bar{\sigma}^{-N}\bar{G})\geq 1-\frac{2\eta}{3}$ , so that once we show $\bar{\mathbb{P}}(W_{\epsilon})<\frac{\eta}{3}$ , we will be able to conclude that $\bar{\mathbb{P}}(U_{\epsilon})=\bar{\mathbb{P}}(\sigma^{-2N}U_{\epsilon})\leq\bar{\mathbb{P}}(W_{\epsilon})+\bar{\mathbb{P}}(\bar{G}^{c}\cup\bar{\sigma}^{-N}\bar{G}^{c})<\eta$ .

Lemma 35.

Suppose ${\bar{\omega}}\in W_{\epsilon}$ . Then ${\perp}(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),F_{k}(A^{(N)}_{\sigma^{N}\omega}))\leq\delta$ .

Proof.

We prove the contrapositive: Suppose ${\perp}(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),F_{k}(A^{(N)}_{\sigma^{N}\omega}))>\delta$ . Applying Lemma 7(c) (using $\sigma^{N}{\bar{\omega}}\in\bar{G}$ and setting $V=E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}})$ ), we see $\angle(E_{k}^{\epsilon}(\bar{\sigma}^{2N}{\bar{\omega}}),E_{k}(A^{(N)}_{\sigma^{N}\omega}))<\delta$ . As $\angle(E_{k}(A^{(N)}_{\sigma^{N}\omega}),E_{k}(\sigma^{2N}\omega))<\delta$ by Lemma 8(b), we deduce the bound $\angle(E_{k}^{\epsilon}(\bar{\sigma}^{2N}{\bar{\omega}}),E_{k}(A^{(N)}_{\sigma^{2N}\omega}))<2\delta$ , which contradicts ${\bar{\omega}}\in\bar{\sigma}^{-2N}U_{\epsilon}$ . ∎

Lemma 36.

If ${\bar{\omega}},\bar{\sigma}^{N}{\bar{\omega}}\in\bar{G}$ and ${\perp}(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),F_{k}(A^{(N)}_{\sigma^{N}\omega}))<\delta$ , then ${\perp}(E_{k}^{\epsilon}({\bar{\omega}}),F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}))<\delta^{-1}e^{-(\mu_{k}-\tau)N}$ .

Proof.

We will show the contrapositive. Assume ${\perp}(E_{k}^{\epsilon}({\bar{\omega}}),F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}))\geq\delta^{-1}e^{-(\mu_{k}-\tau)N}$ . Let $v\in E_{k}^{\epsilon}({\bar{\omega}})$ be of length 1. Let $v=u+w$ , with $u\in F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})$ and $w\in F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})^{\perp}$ . Then, $\|w\|\geq\delta^{-1}e^{-(\mu_{k}-\tau)N}$ , and so $\|{A^{\epsilon}}_{{\bar{\omega}}}^{(N)}w\|\geq\delta^{-1}$ . Also, $\|{A^{\epsilon}}_{{\bar{\omega}}}^{(N)}u\|\leq 2$ by Lemma 7(a). Normalizing ${A^{\epsilon}}_{{\bar{\omega}}}^{(N)}v$ and recalling that ${A^{\epsilon}}_{{\bar{\omega}}}^{(N)}E_{k}^{\epsilon}({\bar{\omega}})=E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}})$ and ${A^{\epsilon}}_{{\bar{\omega}}}^{(N)}F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})^{\perp}=E_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})$ , we obtain that a point of $E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}})\cap S$ is within $4\delta$ of $E_{k}({A^{\epsilon}}_{\omega}^{(N)})\cap S$ . Hence, by Lemma 2, $\angle(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),E_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}))<4\delta$ .

By Lemmas 7(b) and 8(b), $\angle(E_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}),E_{k}({\sigma^{N}\omega}))<2\delta$ . Hence, $\angle(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),E_{k}({\sigma^{N}\omega}))<6\delta$ . By Lemma 8(c), $\angle(F_{k}(A^{(N)}_{\omega}),F_{k}({\sigma^{N}\omega}))<\delta$ . Lemma 8(a) ensures that ${\perp}(E_{k}(\sigma^{N}\omega),F_{k}({\sigma^{N}\omega}))>10\delta$ , and combining with the above, we conclude that ${\perp}(E_{k}^{\epsilon}(\bar{\sigma}^{N}{\bar{\omega}}),F_{k}(A^{(N)}_{\sigma^{N}\omega}))>3\delta$ . ∎

Lemma 37.

If $\epsilon$ is sufficiently small that $\delta^{-1}+2<e^{k\tau N}$ , ${\bar{\omega}}\in\bar{G}$ and ${\perp}(E_{k}^{\epsilon}({\bar{\omega}}),F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}))<\delta^{-1}e^{-(\mu_{k}-\tau)N}$ , we have

[TABLE]

Proof.

By hypothesis, there exists a unit length $v\in E_{k}^{\epsilon}({\bar{\omega}})$ such that $v=f+f^{\perp}$ , with $f\in F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}),f^{\perp}\in F_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})^{\perp}$ and $\|f^{\perp}\|<\delta^{-1}e^{-(\mu_{k}-\tau)N}$ .

Now, since $E_{k}^{\epsilon}({\bar{\omega}})$ is $k$ -dimensional, $\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}|_{E_{k}^{\epsilon}({\bar{\omega}})})$ is the logarithm of the volume growth of any $k$ -dimensional parallelepiped in $E_{k}^{\epsilon}({\bar{\omega}})$ under ${A^{\epsilon}}_{{\bar{\omega}}}^{(N)}$ . Let $v,v_{2},\dots,v_{k}$ be an orthonormal basis for $E_{k}^{\epsilon}({\bar{\omega}})$ . Then,

[TABLE]

By the choice of $f$ , and using Lemma 7(a),

[TABLE]

Since $\|f^{\perp}\|<\delta^{-1}e^{-(\mu_{k}-\tau)N}$ , then $\operatorname{Vol}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}f^{\perp},{A^{\epsilon}}_{{\bar{\omega}}}^{(N)}v_{2},\dots,{A^{\epsilon}}_{{\bar{\omega}}}^{(N)}v_{k})\leq\|f^{\perp}\|e^{\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)})}<\delta^{-1}e^{(\mu_{1}+\ldots+\mu_{k-1}+k\tau)N}$ . ∎

Lemma 38.

There exists $\epsilon_{0}>0$ and $M\in\mathbb{N}$ such that for every $\epsilon<\epsilon_{0},N\geq M$ and $B\subset\bar{\Omega}$ , we have that

[TABLE]

In particular, for all sufficiently small $\epsilon$ , the above holds for $N$ chosen as in Lemma 9.

Proof.

By the $L^{1}$ convergence in the sub-additive ergodic theorem, there exists $M>0$ be such that $\|\Xi_{k}(A^{(n)}_{\omega})-n(\mu_{1}+\dots+\mu_{k})\|_{1}\leq n{\tau}$ for every $n\geq M$ . In particular, for every $n\geq M$ and every $B\subset\bar{\Omega}$ ,

[TABLE]

Notice that $\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(n)})\leq k\log^{+}\|{A^{\epsilon}}_{{\bar{\omega}}}^{(n)}\|_{\mathsf{op}}\leq k\sum_{j=0}^{n-1}(\log^{+}\|A_{\sigma^{j}\omega}\|_{\mathsf{op}}+\epsilon\|\Delta_{j}\|_{\mathsf{op}})$ , where we have used the fact that $\log^{+}(x+y)\leq\log^{+}(x)+|y|$ . For a fixed $n$ , this shows that the family of functions $g_{\epsilon}({\bar{\omega}})=\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(n)})$ for $0\leq\epsilon<1$ is dominated, and converges as $\epsilon\to 0$ to $\Xi_{k}(A^{(n)}_{\omega})$ . Hence, by the reverse Fatou lemma, for sufficiently small $\epsilon>0$ , $n\in\{M,\dots,2M-1\}$ and every $B\subset\bar{\Omega}$ ,

[TABLE]

Using sub-additivity of $\Xi_{k}$ , we conclude that for every $N\geq M$ , and every $B\subset\bar{\Omega}$ ,

[TABLE]

∎

Notice that if ${\bar{\omega}}\in W_{\epsilon}$ , then by Lemmas 35, 36 and 37 (each lemma establishing the hypothesis of the next one), then $\Xi_{k}({A^{\epsilon}}_{{\bar{\omega}}}^{(N)}|_{E_{k}^{\epsilon}({\bar{\omega}})})\leq(\mu_{1}+\ldots+\mu_{k-1}+2k\tau)N$ . Combining this with Lemma 38, we see

[TABLE]

Hence,

[TABLE]

In particular, in view of the convergence of the exponents, for all sufficiently small $\epsilon$ , we have $\bar{\mathbb{P}}(W_{\epsilon})\leq 5k\tau/\mu_{k}<\frac{\eta}{3}$ . ∎

Acknowledgements

GF and AQ acknowledge partial support from the Australian Research Council (DP150100017). The research of CGT has been supported by an ARC DECRA (DE160100147). AQ acknowledges the support of NSERC. The authors are grateful to the the School of Mathematics and Statistics at the University of New South Wales, the School of Mathematics and Physics at the University of Queensland and the Department of Mathematics and Statistics at the University of Victoria for their hospitality, allowing for research collaborations which led to this project.

Bibliography26

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Baladi, A. Kondah, and B. Schmitt. Random correlations for small perturbations of expanding maps. Random Comput. Dynam. , 4(2-3):179–204, 1996.
2[2] A. Blumenthal, J. Xue, and L.-S. Young. Lyapunov exponents for random perturbations of some area-preserving maps including the standard map. Ann. of Math. (2) , 185(1):285–310, 2017.
3[3] J. Bochi. Genericity of zero Lyapunov exponents. Ergodic Theory and Dynamical Systems , 22(6):1667–1696, 12 2002.
4[4] J. Bochi and M. Viana. The Lyapunov exponents of generic volume-preserving and symplectic maps. Ann. Math. , 161:1423–1485, 2005.
5[5] C. Bocker-Neto and M. Viana. Continuity of Lyapunov Exponents for Random 2D Matrices. Ar Xiv e-prints , Dec. 2010.
6[6] T. Bogenschütz. Stochastic stability of invariant subspaces. Ergodic Theory Dynam. Systems , 20(3):663–680, 2000.
7[7] R. Durrett. Probability: Theory and Examples (4th Edition) . Cambridge Univ. Press, 2010.
8[8] G. Froyland and C. González-Tokman. Stability and approximation of invariant measures of Markov chains in random environments. Stoch. Dyn. , 16(1):1650003, 23, 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Hilbert Space Lyapunov Exponent stability

Abstract.

1. Introduction

2. The model and principal results

Theorem A**.**

Theorem B**.**

3. Notation and the quantity Ξ~k\tilde{\Xi}_{k}Ξ~k​

Lemma 1**.**

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

Corollary 4**.**

Proof.

4. Good Blocks

Lemma 5**.**

Proof.

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

Lemma 8** (Good blocks).**

Lemma 9** (Good block length).**

Proof.

Proposition 10** (Glueing good blocks).**

Proof.

Lemma 11**.**

Proof.

Proposition 12**.**

Proof.

5. Comparing perturbed and unperturbed bad blocks (Type I)

Lemma 13**.**

Proof.

Proposition 14**.**

Proof.

6. Type II bad block perturbations

Lemma 15**.**

Proof.

Lemma 16**.**

Proof.

Proposition 17**.**

Proof.

7. Joining good and bad blocks

Lemma 18**.**

Proof.

Lemma 19**.**

Proof.

Lemma 20**.**

Proof.

Lemma 21**.**

Lemma 22**.**

Proof.

Lemma 23**.**

Proof.

Proposition 24**.**

Proof.

8. Comparison of Ξk\Xi_{k}Ξk​ and Ξ~k\tilde{\Xi}_{k}Ξ~k​

Lemma 25**.**

Proof.

Lemma 26**.**

Proof.

Proposition 27**.**

Proof.

Corollary 28**.**

Proof.

Lemma 29**.**

Proof.

Lemma 30**.**

Proof.

Lemma 31**.**

Proof.

Proposition 32**.**

Proof.

Theorem A.

Theorem B.

3. Notation and the quantity $\tilde{\Xi}_{k}$

Lemma 1.

Lemma 2.

Lemma 3.

Corollary 4.

Lemma 5.

Lemma 6.

Lemma 7.

Lemma 8 (Good blocks).

Lemma 9 (Good block length).

Proposition 10 (Glueing good blocks).

Lemma 11.

Proposition 12.

Lemma 13.

Proposition 14.

Lemma 15.

Lemma 16.

Proposition 17.

Lemma 18.

Lemma 19.

Lemma 20.

Lemma 21.

Lemma 22.

Lemma 23.

Proposition 24.

8. Comparison of $\Xi_{k}$ and $\tilde{\Xi}_{k}$

Lemma 25.

Lemma 26.

Proposition 27.

Corollary 28.

Lemma 29.

Lemma 30.

Lemma 31.

Proposition 32.

Lemma 33.

Lemma 34.

9.3. Replacing $\Xi_{k}$ with $\tilde{\Xi}_{k}$

9.4. Splitting ${A^{\epsilon}}_{{\bar{\omega}}}^{(nN)}$ into good and bad blocks

9.5. Comparison of $\tilde{\Xi}_{k}(G^{\epsilon})$ and $\Xi_{k}(G^{\epsilon})$

9.6. Comparison of $\Xi_{k}(G^{\epsilon})$ and $\Xi_{k}(G^{0})$

9.7. Comparison of $\operatorname{\mathbb{E}}\tilde{\Xi}_{k}(B^{\epsilon})$ and $\tilde{\Xi}_{k}(B^{0})$

9.8. Comparison of $\tilde{\Xi}_{k}(B^{0})$ and $\Xi_{k}(B^{0})$

Lemma 35.

Lemma 36.

Lemma 37.

Lemma 38.