Fast approximation of the $p$-radius, matrix pressure or generalised Lyapunov exponent for positive and dominated matrices
Ian D. Morris

TL;DR
This paper introduces a new algorithm for efficiently approximating the p-radius, matrix pressure, or generalized Lyapunov exponent for positive or dominated matrices, with significant improvements for low-dimensional cases.
Contribution
The authors develop a novel eigenvalue-based algorithm using Fredholm determinants to compute the p-radius for positive and dominated matrices, enhancing accuracy and efficiency.
Findings
Significant improvements over existing methods for low-dimensional positive matrix pairs.
The algorithm interprets the p-radius as a leading eigenvalue of a trace-class operator.
Applicable to matrices with positivity or domination properties, relevant in wavelet and fractal analysis.
Abstract
If A_1,...,A_N are real square matrices then the p-radius, generalised Lyapunov exponent or matrix pressure is defined to be the asymptotic exponential growth rate of the sum , where p is a real parameter. Under its various names this quantity has been investigated for its applications to topics including wavelet regularity and refinement equations, fractal geometry and the large deviations theory of random matrix products. In this article we present a new algorithm for computing the p-radius under the hypothesis that the matrices are all positive, or more generally under the hypothesis that they satisfy a weaker condition called domination. This algorithm is based on interpreting the p-radius as the leading eigenvalue of a trace-class operator on a Hilbert space and estimating that eigenvalue via approximations to the Fredholmβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Dynamics and Fractals Β· Mathematical Analysis and Transform Methods Β· Matrix Theory and Algorithms
Fast approximation of the -radius, matrix pressure or generalised Lyapunov exponent for positive and dominated matrices
Ian D. Morris
Abstract.
If are real matrices then the -radius, generalised Lyapunov exponent or matrix pressure is defined to be the asymptotic exponential growth rate of the sum , where is a real parameter. Under its various names this quantity has been investigated for its applications to topics including wavelet regularity and refinement equations, fractal geometry and the large deviations theory of random matrix products. In this article we present a new algorithm for computing the -radius under the hypothesis that the matrices are all positive, or more generally under the hypothesis that they satisfy a weaker condition called domination. This algorithm is based on interpreting the -radius as the leading eigenvalue of a trace-class operator on a Hilbert space and estimating that eigenvalue via approximations to the Fredholm determinant of the operator. In this respect our method is closely related to the work of Z.-Q. Bai and M. Pollicott on computing the top Lyapunov exponent of a random matrix product. For pairs of positive matrices of low dimension our method yields substantial improvements over existing methods.
1. Introduction
If is a tuple of real matrices and a real parameter, the limit
[TABLE]
exists by applying Feketeβs subadditivity lemma to the sequence
[TABLE]
if , or to the sequence if . The quantity (1), modulo some trivial variations in its definition, has been studied independently in at least three different contexts and literatures: under the name of generalised Lyapunov exponent the quantity has been studied for in [9, 41] where its investigation is motivated by the large deviations theory of random matrix products in statistical mechanics; under the name of matrix pressure, the quantity has been investigated for in the fractal geometry literature in view of its applications to the dimension of self-similar and self-affine limit sets ([11, 12, 13, 27, 34]); and in the joint spectral radius literature, the quantity has been investigated for in connection with its applications to wavelet regularity [8, 26, 42] and the control theory of discrete linear inclusions [22, 30]. Across all three literatures there has arisen the problem of computing or estimating the quantity β as may be seen for example in [23, 27, 31, 34, 36, 39, 41] β and it is with this that the present article is concerned. The principal result of this article is a new algorithm for the computation of in the case where the matrices are positive and is an arbitrary real number. More generally, our method extends to the case where the matrices strictly preserve a cone or multicone.
2. Statement of main result
In order to state our result let us present the definition of a multicone. Let us say that a cone in is a set which is closed, convex, has nonempty interior, satisfies for all real and satisfies . A multicone will be a tuple of cones in such that for some nonzero vector we have for all nonzero , and such that for distinct . The vector is called the transverse-defining vector of the multicone. We say that a matrix strictly preserves a cone if , and we say that strictly preserves a multicone if for every we have for some depending on . If strictly preserves a multicone then a simple pigeonhole argument demonstrates that some power of strictly preserves a cone, which implies that has a simple leading eigenvalue (which might be either positive or negative). We say that strictly preserves a multicone if every strictly preserves that multicone. We say that is multipositive if there exists a multicone which is strictly preserved by . The property of multipositivity admits characterisations which do not overtly refer to cones or multicones: for example, if is a tuple of invertible matrices then the multipositivity of is equivalent to the condition
[TABLE]
where denotes the singular value of the matrix , see for example [4, 5, 29]. The condition (2) above is sometimes called -domination or simply domination and has been explored in some detail in the dynamical systems literature [1, 5]; its applications to certain numerical invariants of sets of matrices have been investigated in such works as [6, 7].
For each we let denote the set of all finite sequences such that are integers between and . If a tuple of matrices is understood, given we define If then we write and call this the length of . Finally we let denote the spectral radius of the matrix , and we let denote the eigenvalues of listed in decreasing order of absolute value. Since our matrices will always strictly preserve a multicone the largest eigenvalue of will always be unique and the definition of unambiguous.
We may now state the principal result of this article, which is the following:
Theorem 1**.**
Let be multipositive, where , and let . For every define
[TABLE]
where denotes the characteristic polynomial of the matrix and its first derivative evaluated at . Define and
[TABLE]
for every . Then for all sufficiently large there exists a smallest positive real root of the polynomial , and there exist constants such that for all large enough integers
[TABLE]
Theorem 1 applies in particular if the matrices are all positive matrices, or if the matrices all strictly preserve a single cone . However, multipositive matrix tuples with neither of these two properties also exist: see [1]. We remark that since for every invertible matrix , a sufficient condition for the application of Theorem 1 is that the matrices be simultaneously conjugate to positive matrices.
The reader will notice that the order of convergence in Theorem 1 is strongest when the dimension of the matrix is and becomes weaker as the dimension is increased, although it is in all cases super-exponential in . The problem of estimating the implied constants and in (3) is not attempted in this article; we believe that in the case of tuples of positive matrices this should be feasible in principle, but would rely on difficult functional-analytic estimates such as an a priori bound for the cardinality of the relative covers arising in the application of [3, Theorem 4.7] to certain complex cones. In any event, convergence in Theorem 1 is fast enough to yield significant results in low dimensions. In the previous work [22], R. Jungers and V. Yu. Protasov investigated the problem of computing what in our notation corresponds to the quantity
[TABLE]
for the pair of matrices
[TABLE]
with , obtaining an estimate of . It happens that the pair is simultaneously conjugate to a pair of positive matrices; taking in Theorem 1 yields the estimate
[TABLE]
for the same quantity, which is empirically accurate to all decimal places shown.
We remark that in the literature on the generalised Lyapunov exponent, it is common to consider the quantity
[TABLE]
in place of the quantity as defined in (1), where is a probability vector. The quantity (5) can easily be included within the scope of (1) and Theorem 1 by replacing each instance of a matrix with the corresponding matrix . Concretely, this implies that the quantity (5) can be calculated using Theorem 1 by taking instead
[TABLE]
where , and leaving the rest of the theorem unchanged. For the remainder of the article we therefore ignore the issue of giving a probability weighting to each and concentrate on the calculation of the -radius as defined in (1).
It is possible to show that the quantities defined in Theorem 1 satisfy and therefore increase (or decrease) exponentially with . The efficiency of the estimate in Theorem 1 on the other hand relies on the quantities decreasing as . The small size of the quantities thus arises from additive cancellation among the relatively large terms in the sum defining each . In practical applications it is therefore important to compute the quantities to a precision exceeding that desired for the approximation to .
The remainder of this article is structured as follows. In Β§3 below we review the fundamental properties of and describe some existing techniques for its estimation. In Β§4 we describe in outline the techniques underlying the proof of Theorem 1 and in Β§5 the proof itself is presented. In Β§6 we present some examples of the computation of using the algorithms described herein.
3. Methods for estimating the -radius
3.1. Fundamental estimates
If and then by elementary estimates it follows that if and only if the joint spectral radius
[TABLE]
is zero. It is well known that the joint spectral radius is zero if and only if all of the products of length are zero, if and only if there exists a basis in which all of the matrices are simultaneously upper triangular with all diagonal entries equal to zero (for details see [21, Β§2.3.1]). Since the theory of the -radius is trivial in this situation we will for the remainder of this paper deal only with matrices for which the -radius is assumed to be nonzero. We remark that in the multipositive case considered in Theorem 1 every product has a simple leading eigenvalue and in particular is not the zero matrix, so in this case is guaranteed to be nonzero.
When the -radius admits an elementary description as the limit of a convergent sequence of upper bounds,
[TABLE]
as a consequence of the submultiplicativity relation
[TABLE]
Less trivially, when it may also be expressed as the limit of a convergent sequence of lower bounds:
[TABLE]
where , see [27, Theorem 1.2]. In particular the -radius can in principle be approximated to within any prescribed error by systematically computing the upper and lower bounds until they eventually agree to within the prescribed amount. However, since the computational effort involved increases exponentially with and the relative error may reasonably be presumed to be at least of the order of , and since the constant is relatively large even in the case , this procedure seems unlikely to have any value for practical computations. An illustration of this is presented in Β§6 below. We remark that an additional theoretical consequence of the above expressions is that the -radius varies continuously both in and in the matrix entries when is positive, since it is then equal to both an upper and a lower pointwise limit of sequences of continuous functions, hence continuous. When the computability and continuity of the -radius do not seem to have been as thoroughly investigated, but based on the related works [6, 28, 40] it seems likely that continuity should not hold and that systematic upper and lower estimation might be infeasible, at least when the matrices are not assumed to be positive or invertible.
When is a positive even integer, or when is a positive integer and the matrices preserve a cone, the identity
[TABLE]
has been discovered independently on several occasions [10, 36, 43]. (Here denotes the Kronecker power of the matrix , see for example [17, Β§4.2].) When is a positive integer and are not necessarily positive, the inequality
[TABLE]
may be obtained by the same means. Whilst in principle (8) represents an easy method for computing the -radius of positive matrices, the size of the auxiliary matrix increases exponentially with which prevents the use of the formula when is sufficiently large. For non-integer these results may nonetheless be exploited so as to yield upper bounds as follows. We observe that if and are real numbers such that , and , then for each
[TABLE]
using HΓΆlderβs inequality with and . It follows easily that
[TABLE]
and hence the function is convex. This yields the upper bound
[TABLE]
valid for all and , which does not seem to have been previously noted in the literature. We will see in Β§6 below that despite its crudity this estimate does not automatically provide a bad approximation and should not be discounted out of hand.
3.2. Resampled Monte Carlo methods
In [41], J. Vanneste introduced a method based on the interpretation of the -radius as an asymptotic moment of a random matrix product: given , and we may view the sum as the expectation of the random variable where each word of length is chosen with probability . This suggests the possibility of approximating for large by Monte Carlo estimation: if we choose words independently then by the law of large numbers, the average should for large enough give a reasonable approximation to the value which is that random variableβs expectation and hence a good approximation to as long as is reasonably large. However, except which is small, the variance of this random variable will be prohibitively large β indeed exponentially large in β which makes convergence in the strong law of large numbers unreasonably slow. To compensate for this Vanneste introduced a βgo-with-the-winnersβ resampling scheme along the lines of [15], which successively modifies the distribution of the random variable so as to retain the same mean while reducing the variance; see discussion in [41, Β§III] for details. The particular strength of this method is that it has very limited dependence on the number of matrices and their dimension; on the other hand, the accuracy of the results is relatively low in practice. See Β§6 below for further discussion.
3.3. The convex optimisation bounds of Jungers and Protasov
The article [23] introduced new systematic upper and lower bounds for the -radius in the case . If are non-negative matrices, Jungers and Protasov showed that the quantities
[TABLE]
[TABLE]
where denotes the entry of the matrix , satisfy
[TABLE]
for every . (Here we have modified the statement of their results in concordance with our definition of .) The quantities and are solutions to convex optimisation problems and as such may be efficiently approximated. In the case where preserves a more general cone (in the weak sense that for each ) analogous upper and lower bounds are given, but these are not in general the solutions to convex optimisation problems and as such are more difficult to efficiently or rigorously estimate. Since the matrices always preserve a cone irrespective of the structure of the original matrices , and since for all , this more general version of their method permits the estimation of for arbitrary and .
As with the upper and lower bounds (6) and (7) this system of estimation requires the computation of matrix products in order to obtain the approximation and as such is best suited to cases in which is small.
3.4. Eigenvalue methods
As has been previously observed by J. Vanneste [41, Β§II.B], the quantity can be represented as the leading eigenvalue of a linear operator on an infinite-dimensional function space in the following manner. Suppose that are invertible matrices and let . Let denote the space of lines through the origin in , with the distance between two lines defined to be the angle at which they intersect. For each nonzero let denote the line spanned by . Define an operator on the space of -HΓΆlder continuous functions by
[TABLE]
and observe that by a simple calculation
[TABLE]
for every , and . With only a little more work one may show that in fact
[TABLE]
and under mild algebraic non-degeneracy conditions on the matrices , a rather longer argument shows that is the largest eigenvalue of acting on if is chosen sufficiently small (see for example [16, Théorème 8.8]). This suggests the idea of calculating by approximating the operator with a large matrix representing the action of the matrices on a discretised version of . This approach was previously described in [41, §IV.A] but does not seem to have been investigated in detail. A version of this method was also suggested in [29, §8] for the purpose of estimating the Hausdorff dimensions of some self-affine limit sets.
To give a concrete example, in the case write for each and for let denote the shorter of the two arcs in from to , including the former endpoint but not the latter. Fix an integer . For each define an matrix by if and otherwise. Define now the matrix . Since corresponds to a version of acting on functions defined on a discretisation of into evenly-spaced points, we expect that for large the spectral radius of should give a reasonable approximation to . In principle it may be possible to demonstrate this rigorously using the methods of [25], but this does not seem to have so far been attempted in the literature and is certainly a problem beyond the scope of this article.
For two-dimensional matrices this method appears to yield approximations accurate to several decimal places in a tolerable amount of time (see Β§6 below) and it is apparent from the definition that the effect of increasing the number of matrices has at worst a polynomial effect on the running time of the algorithm. However the size of the matrix required in order to discretise into a mesh of prescribed size rises exponentially with the dimension , suggesting that this method is unlikely to be very useful for matrices which are not of low dimension. The question also arises of whether better estimates may be obtained by adapting the mesh locally so as to include more mesh points in regions where the derivative of one of the maps is large and fewer mesh points where it is small. Since the principal purpose of this article is to introduce the new algorithm given by Theorem 1, we leave these questions to other investigators.
4. Overview of the proof of Theorem 1
In the previous subsection we observed that admits an interpretation as the leading eigenvalue of a linear operator on an infinite-dimensional function space and considered the possibility of approximating such an operator directly by operators on finite-dimensional spaces. This is however not the only mechanism by which the leading eigenvalue of an operator may be calculated. In order to describe our chosen alternative we will briefly and informally review some concepts from the theory of trace-class linear operators; thorough formal treatments of this topic may be found in e.g. [14, 38].
If an operator on an infinite-dimensional Hilbert space has the property that the sequence of approximation numbers
[TABLE]
is summable then it is called trace-class. If this is the case then is a compact operator (since it is a limit in the norm topology of a sequence of finite-rank operators) and therefore its spectrum consists of [math] together with a finite or infinite set of eigenvalues, each of finite algebraic multiplicity, which has no nonzero accumulation points. It is not difficult to see that for every by direct manipulation of the definition and consequently every power of a trace-class operator is also trace-class. If is a trace-class operator on with finite or infinite sequence of nonzero eigenvalues , it is classical that the series converges absolutely to a quantity which is called the trace of and denoted . Moreover the quantity
[TABLE]
called the Fredholm determinant of , defines an entire holomorphic function in the variable with power series , say. It is also classical that in this case the zeros of are precisely the reciprocals of the nonzero eigenvalues of and that additionally
[TABLE]
for every , where and where is interpreted as zero if . It follows that if the traces can be easily calculated for , say, then an approximation to the Fredholm determinant can be constructed using (10) and it might be hoped that the smallest positive real root of the polynomial would provide a good estimate for the reciprocal of the leading eigenvalue of as long as the remainder is extremely small. In view of the equation (10) it follows that if the sequence can be shown to decay stretched-exponentially then this remainder will in fact be super-exponentially small, and this is indeed the approach which we will take in estimating . This general approach to estimating dynamical quantities via operator eigenvalues has been previously exploited in a number of prior articles of which we note [2, 18, 19, 29, 32, 33, 35].
The proof of Theorem 1 therefore proceeds via the introduction of a trace-class operator on a Hilbert space with the properties required by the argument sketched above: a stretched-exponential estimate on the singular values (which implies a stretched-exponential estimate on the eigenvalues via Weylβs inequality), the property , and a simple, computationally-feasible formula for the sequence of traces . The following result from [29] saves us the necessity of constructing such an operator from first principles:
Theorem 2** ([29, Corollary 5.1]).**
Let , let be real matrices and suppose that is a multicone for with transverse-defining vector . Then there exists a nonempty bounded open subset of the complex hyperplane such that the following properties hold. Let denote the separable complex Hilbert space of holomorphic functions for which the integral is finite, where denotes -dimensional Lebesgue measure on . For each define an operator by
[TABLE]
Then the operators are well-defined bounded linear operators on and:
- (i)
There exist such that for all and we have
[TABLE]
In particular each is trace-class. 2. (ii)
For every and we have
[TABLE] 3. (iii)
For every the spectral radius of is equal to
[TABLE] 4. (iv)
For all the spectral radius of is a simple eigenvalue of and there are no other eigenvalues of the same modulus.
Theorem 1 can thus be seen as a version of the eigenvalue-problem approach discussed in the previous section, but one which takes advantage of the special additional structure of trace-class operators. Note that since trace-class operators are compact operators they are very far from being invertible, and indeed an important feature of the hypotheses of Theorem 2 is that the transformations map a (not necessarily connected) patch of strictly inside itself β which results in a non-invertible action on the associated function space β as opposed to acting transitively on . This feature is precisely the content of the multicone hypothesis, and indeed the non-invertibility of the action on is critical in constructing a space on which the operators can act in a trace-class manner. As such any extension of the method of Theorem 1 to families of matrices with non-real eigenvalues is therefore likely to be impossible since such matrices would tend to act transitively on the phase space , preventing the construction of a suitable domain for a trace-class operator to act upon.
5. Proof of Theorem 1
The following result summarises the classical results on traces and determinants of trace-class operators on Hilbert spaces which will be required in our proof. It is a combination of several results from [38, Β§3], with the exception of the determinant formula for which may be found instead in, for example, [37, Theorem 6.8] or [14, Theorem IV.5.2].
Theorem 3**.**
Let be a complex separable Hilbert space, let be a trace-class operator acting on , and define and
[TABLE]
for every . Then the power series converges for all . The function is holomorphic, the zeros of are precisely the reciprocals of the nonzero eigenvalues of , and the degree of each zero of is equal to the algebraic multiplicity of the corresponding eigenvalue of . Moreover the coefficients satisfy the estimate
[TABLE]
for every .
We also require the following elementary lemma:
Lemma 5.1**.**
For each there exists a constant such that
[TABLE]
for all .
Proof.
Fix and . By adjusting the constant if necessary we may without loss of generality assume . Define
[TABLE]
Since clearly for every integer we have
[TABLE]
for every and the result follows. β
We may now begin the proof of Theorem 1. Fix and as in Theorem 1. By Theorem 2 there exist a complex separable Hilbert space and a trace-class linear operator such that is a simple isolated eigenvalue of , such that all other eigenvalues have absolute value strictly smaller than , such that
[TABLE]
for every and such that there exist constants such that for every . Define the sequence in accordance with Theorem 1 and note that we have for every . For each let be as defined in Theorem 3 and note that this coincides with the definition of the sequence in Theorem 1. We claim that there exist such that
[TABLE]
for every . To see this let and observe that by Theorem 3
[TABLE]
where we have used Lemma 5.1 with and have also used the elementary inequality
[TABLE]
which is valid since the series is an upper Riemann sum of the integral. The claim follows easily.
Now define a function by . It is clear from the estimate (11) that this power series has infinite radius of convergence and therefore is a well-defined holomorphic function on . By Theorem 3 we have for all and the zeros of are precisely the reciprocals of the nonzero eigenvalues of with the degree of each zero being equal to the algebraic multiplicity of the corresponding eigenvalue. By Theorem 2, is the largest eigenvalue of in absolute value and is a simple eigenvalue. It follows that we may choose a circular contour in which is centred somewhere on the real line, passes through [math], encloses and does not enclose or intersect the reciprocal of any eigenvalue of other than . Let and denote the centre point and radius of respectively. Since does not intersect the reciprocal of any eigenvalue of the function does not have any zeros on , so by compactness
[TABLE]
For each define a function by . Obviously each is a polynomial and is therefore holomorphic on . Via Lemma 5.1 the estimate (11) implies
[TABLE]
for all and some suitable constants . In particular
[TABLE]
and therefore there exists such that for all
[TABLE]
Applying RouchΓ©βs theorem on the circular contour we deduce that for all the functions and have the same number of zeros inside the contour , and the total degree of the zeros inside is the same for the function as it is for the function . Since has a unique zero inside and that zero is simple this means that has a unique zero inside for all large enough , and this zero is simple. Call this zero . Since is a polynomial with real coefficients its zeros are symmetrically located with respect to reflection in the real axis. Since the contour is circular with real centre, a zero of is enclosed by if and only if the complex conjugate of that zero is also so enclosed. It follows that the complex conjugate of is also enclosed by the contour and is therefore also a zero of . But has a unique zero inside . These statements can only be compatible if is equal to its own complex conjugate, and we conclude that is real. Since is enclosed by and is real it necessarily lies on the interval and is the unique zero of on that interval. In particular it is the smallest positive zero of the polynomial .
Define . To complete the proof of the theorem we will show that
[TABLE]
We first require a lower bound for the derivative for close to . Since is a simple zero of we have , and since it is also necessarily an isolated zero we may choose such that for all with , such that for all with , and such that the closed disc of radius and centre is enclosed by the contour . Since by compactness
[TABLE]
it follows via (13) in the same manner as before that there exists such that for all
[TABLE]
Applying RouchΓ©βs theorem again, this time to the circular contour with centre and radius , we see that for each there is a unique zero of within distance of . Since the disc of radius and centre is enclosed by , and encloses a unique zero of , we conclude that this zero must be and therefore for all .
Now define
[TABLE]
Since is a polynomial with real coefficients it takes only real values when restricted to and therefore the same is true of since it is the pointwise limit of as . Let and suppose that . By the Mean Value Theorem it follows that there exists a real number in the interval from to such that
[TABLE]
Since clearly we have and therefore
[TABLE]
This inequality is obviously also true for integers such that . In particular for all we have
[TABLE]
using the fact that . Thus
[TABLE]
for all using (12). We in particular have . If is taken large enough that for all we have , then for all we have
[TABLE]
and this completes the proof of the theorem.
6. Example: a pair of matrices considered by Jungers and Protasov
In the article [23] the -radius of the pair defined by
[TABLE]
was investigated motivated by its connection with Chaikinβs subdivision schemes and the regularity of refinable functions. The reader may easily check that if we define
[TABLE]
then the matrices and are both positive, so the pair strictly preserves a cone and Theorem 1 may be applied thereto. The results of applying the various methods of estimation to are tabulated in Figures 1β5 below. The reader will notice that by far the best results are those obtained by Theorem 1: the estimate obtained by evaluating all products of length up to 12 yields the estimate which is empirically accurate to all decimal places shown. Estimates of comparable complexity using the method of Β§3.3 give only the first two decimal places, albeit rigorously; the naΓ―ve upper and lower estimates described in Β§3.1 are not even sufficient to establish the first significant digit of . The methods of Β§3.2 and Β§3.4 perform somewhat better, being able to give non-rigorous estimates accurate to several decimal places. We also observe that the upper estimate arising from logarithmic convexity,
[TABLE]
gives a rigorous upper bound of
[TABLE]
which, remarkably, is more accurate than several of the other methods employed. Applying Theorem 1 with gives the estimate
[TABLE]
which is empirically accurate to all decimal places shown and provides the value of the estimate (4) mentioned in the introduction.
7. Conclusions
We have introduced a new method for estimating the -radius of low-cardinality sets of positive or dominated matrices and investigated its effectiveness in the case of a particular pair of matrices considered by Jungers and Protasov in connection with applications to Chaikinβs subdivision scheme. We have compared its results to those of a number of other estimation methods in the case of that example and obtained results apparently accurate to within an absolute error of approximately , versus approximately to for rival methods.
The new method has the disadvantage that the number of matrix products which must be computed in order to obtain the approximation to grows approximately as . In particular if the number of matrices being considered is greater than around 4, the computational burden of producing accurate results may be prohibitively large. This disadvantage is however shared by the methods of Β§3.1 and Β§3.3. In view of this consideration, when is large the methods of Β§3.2 and Β§3.4 may be preferable. Our method also, as presently formulated, does not provide a rigorous estimate of its own accuracy, and if rigorous bounds are sought then the method of Β§3.3, possibly in combination with the logarithmic-convexity bound (9) may be applied instead. For two-dimensional positive matrices it seems likely that an effective bound on the error could be given by adapting the arguments of [20, 24], but in higher dimensions this would require new technical results in order to bound the cardinality of the relative covers arising in the application of [3, Theorem 4.7] to the action of real linear maps on projective slices of complex cones.
8. Acknowledgements
This research was supported by the Leverhulme Trust (Research Project Grant number RPG-2016-194).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Avila, A., Bochi, J., and Yoccoz, J.-C. Uniformly hyperbolic finite-valued SL β ( 2 , β ) SL 2 β {\rm SL}(2,\mathbb{R}) -cocycles. Comment. Math. Helv. 85 , 4 (2010), 813β884.
- 2[2] Bai, Z.-Q. On the cycle expansion for the Lyapunov exponent of a product of random matrices. J. Phys. A 40 , 29 (2007), 8315β8328.
- 3[3] Bandtlow, O. F., and Jenkinson, O. Explicit eigenvalue estimates for transfer operators acting on spaces of holomorphic functions. Adv. Math. 218 , 3 (2008), 902β925.
- 4[4] Barnsley, M. F., and Vince, A. Real projective iterated function systems. J. Geom. Anal. 22 , 4 (2012), 1137β1172.
- 5[5] Bochi, J., and Gourmelon, N. Some characterizations of domination. Math. Z. 263 , 1 (2009), 221β231.
- 6[6] Bochi, J., and Morris, I. D. Continuity properties of the lower spectral radius. Proc. Lond. Math. Soc. (3) 110 , 2 (2015), 477β509.
- 7[7] Brundu, M., and Zennaro, M. Invariant multicones for families of matrices. Ann. Mat. Pura Appl. (4) . To appear.
- 8[8] Cabrelli, C. A., Heil, C., and Molter, U. M. Self-similarity and multiwavelets in higher dimensions. Mem. Amer. Math. Soc. 170 , 807 (2004), viii+82.
