Iterated random functions and regularly varying tails
Ewa Damek, Piotr Dyszewski

TL;DR
This paper studies the tail behavior of solutions to stochastic fixed point equations involving random Lipschitz functions, showing that the tail of the solution is comparable to that of the multiplicative coefficient under certain conditions.
Contribution
It provides new insights into the tail asymptotics of solutions to stochastic fixed point equations with Lipschitz functions, extending results to cases approximated by linear functions.
Findings
The tail of the solution R is comparable to the tail of A.
The tail behavior of R depends on the tail of log(A).
New results are obtained for the random difference equation.
Abstract
We consider solutions to so-called stochastic fixed point equation , where is a random Lipschitz function and is a random variable independent of . Under the assumption that can be approximated by the function we show that the tail of is comparable with the one of , provided that the distribution of is tail equivalent. In particular we obtain new results for the random difference equation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Iterated random functions and regularly varying tails
Ewa Damek, Piotr Dyszewski
Instytut Matematyczny, Uniwersytet Wroclawski, Plac Grunwaldzki 2/4, 50-384 Wroclaw, Poland
[email protected], [email protected]
Abstract.
We consider solutions to so-called stochastic fixed point equation , where is a random Lipschitz function and is a random variable independent of . Under the assumption that can be approximated by the function we show that the tail of is comparable with the one of , provided that the distribution of is tail equivalent. In particular we obtain new results for the random difference equation.
Key words and phrases:
Iterated random functions, Random difference equation, convolution equivalent distribution
Key words and phrases:
stochastic recursions, random difference equation, stationary distribution
2010 Mathematics Subject Classification:
60H25, 60J10
2010 Mathematics Subject Classification:
60H25, 60J10
The first author was partially supported by the NCN Grant UMO-2014/15/B/ST1/00060. The second author was partially supported by the National Science Centre, Poland (Sonata Bis, grant number DEC-2014/14/E/ST1/00588)
1. Introduction
Let be a sequence of independent identically distributed (iid) random Lipschitz functions. We consider the Markov chain defined by
[TABLE]
where is a random variable independent of . Under rather mild moment assumptions, the Markov chain possesses a unique stationary distribution. Suppose that is distributed according to it and let be a generic copy of independent of , then necessarily
[TABLE]
where denotes the equality in distribution. Distributional equations of this form appear in wide range of problems in applied probability. Beginning from the early nineties iterated function systems of i.i.d. Lipschitz maps (IFS) on a complete metric space have attracted a lot of attention: Alsmeyer [1], Arnold and Crauel [3], Brofferio and Buraczewski [6], Buraczewski and Damek [7], Diaconis and Friedman [15], Duflo [16], Elton [18], Henion and Hervé [27], Mirek [35] and they still do. In particular, it seems that modelling them after random difference equation (described below) has been very fruitful, see Alsmeyer [1] and Mirek [35].
The main example we wish to present here, is the so-called random difference equation occurring whenever is just affine transformation, i. e. . Then, the recursive formula for the Markov chain in question which, in this special instance, will be denoted by , takes the simple form
[TABLE]
where is a sequence of iid two-dimensional random vectors. Here the stochastic equation satisfied by is
[TABLE]
where is an independent copy of . It turns out, that due to the explicit expression of the function , the stationary solution can be explicitly represented by
[TABLE]
provided that the series is convergent. The series above can be interpreted as the current value of future payments represented by with discount factors represented by and therefore, it is very often called a perpetuity. Random variables of this form appear also in context of Additive Increase Multiplicative Decrease algorithms [26] or COGARCH processes [32], to name a few. For more detailed discussion on perpetuities and related processes we refer the reader to recent monographs [8, 28].
From the point of view of applications the key information about the distribution of is its tail asymptotic, that is
[TABLE]
We wish to recall several scenarios, were exhibits regularly varying tail. Assume that for the moment. The first example is related to the work of Kesten [29] and Goldie [22], which shows that if
[TABLE]
for a positive , the stationary distribution has power tail, i. e.
[TABLE]
for some implicitly given constant . Here, and in what follows, for two functions , , by we mean . The asymptotic (1.2) follows from the behaviour of , more precisely it is a direct consequence of the first assumption in (1.1). It may as well happen that a heavy tail of is caused by a heavy tail of . More precisely, the work of Grincevićius [25] which was later improved by Grey [24] treats the case
[TABLE]
where and is a slowly varying function, that is for any fixed . Then
[TABLE]
Finally, in the recent work Kevei [30] proves that, if
[TABLE]
for , then
[TABLE]
One important feature the scenarios (1.1)-(1.4) have in common is that either or contributes significantly to the asymptotic of , not both.
In order to get a more detailed information about the structure of the distribution of it is natural to consider the frontiers of the scenarios in question, where both coefficients have influence on the tail asymptotic of . First one, situated between (1.1) and (1.3) was recently investigated by Damek and Kołodziejek [14], is the case when
[TABLE]
which results in
[TABLE]
with some explicitly given slowly varying function .
The second one, being the frontier between (1.3)-(1.4), is
[TABLE]
where means that
[TABLE]
Up to our best knowledge, this case was not studied in the literature apart form two specific cases: independent and treated in [34] and so-called exponential functional of Lévy processes studied in [37]. Our aim is to present a robust approach to treat the scenario (1.5) and its counterpart for the iterated random functions and .
We will work under the assumption that can be well approximated by the affine transformation, that is
[TABLE]
with
[TABLE]
and, among some technical assumptions, that for
[TABLE]
In order to be able to successfully treat the case we will need to ensure that the successive iterations of are well-behaved, i.e. for fixed ,
[TABLE]
for some measurable function . Under the above, the main result of this article states that
[TABLE]
with .
The article is organized as follows. In Section 2 we recall some basic notions related to the class of convolution equivalent distributions. The results in the case of Random Difference Equation are stated in Section 3 and in the case of Iterated Random Functions in Section 4. The proofs are presented in Section 5. The Appendix contains proofs of some classical properties of the convolution equivalent distributions.
2. Convolution equivalent tails
Throughout the paper we would like to benefit from properties of convolution equivalent distributions. We begin by introducing some basic notation. We will consider distribution, say , with right-unbounded support. Write for th-convolution of and for its tail, that is .
Definition 2.1**.**
A distribution with right-unbounded support contained in is said to be tail equivalent if for any fixed , as
[TABLE]
and moreover
[TABLE]
for some . In that case we will write . By slight abuse of notation we will write for random variable , whenever its distribution is a member of .
This class was introduced independently by Chistyakov [9] and Chover et al. [11, 10]. The key feature of the class of convolution equivalent distributions is that only the right tail behaviour is of significance. For this reason it is natural to work with a wider class of distributions supported on the whole real line .
Definition 2.2**.**
We will say that a distribution with right-unbounded support contained in is tail equivalent if satisfies (2.1) and (2.2). If this is the case, we will write .
It is not difficult to see that if, and only if conditioned on the set is in . Equivalently, in term of the distributions
[TABLE]
Next property of distributions form the class will be particularly important for us.
Lemma 2.3**.**
Assume that . If for some distributions , , , then
[TABLE]
where . Moreover, implies . If on the other hand
[TABLE]
then
[TABLE]
and there is a function , such that for every
[TABLE]
Similarly, if
[TABLE]
then
[TABLE]
and for every
[TABLE]
For completeness reasons, the proofs of the above Lemma, and some other discussions regarding the class , can be found in the Appendix. In the case of , for (2.3), one would classically refer to [12].
Condition (2.2) present in Definition 2.1 seems to be and in fact it is technical. Since is the class we are mainly interested in, before we proceed any further we will present some sufficient conditions for . As it was proved by Klüppelberg [31] (see Theorem 2.1), for
[TABLE]
where denotes the class of subexponential densities, namely if
[TABLE]
for any fixed as , and
[TABLE]
Knowing sufficient conditions for , here Theorems 4.15 and 4.16 in [21], we can rewrite those in terms of and obtain the next two Corollaries.
Corollary 2.4**.**
Assume that , where , for any fixed as . If one can find a constant for which for sufficiently large , then provided that .
Corollary 2.5**.**
Suppose we have for . If is eventually concave and one can find a function such that
- •
* but as ,*
- •
* if -insensitive, i.e. as , uniformly in ,*
- •
* as ,*
then if additionally .
Example 2.6**.**
By Corollary 2.4, if for , and then . If on the other hand for and then again but this time by Corollary 2.5 with .
Other sufficient conditions, going beyond Corollaries 2.5 and 2.4 can be found in [19] and [12].
3. Random Difference Equation
We will start with the case when , in order to introduce the set-up to the problem and deliver some enlightening examples. For the sake of transparency, throughout this section we will assume that a.s. The results in full generality, including the case of two-sided will be treated in Section 4. For the needs of this Section, one can just take an iid sequence of two-dimensional random vectors , with , and consider a Markov chain given via
[TABLE]
The only condition we impose on at this point is independence form . By a well-known fact, if
[TABLE]
then the Markov chain possesses a unique stationary distribution which can be represented by a random variable of the form
[TABLE]
see [39] for the above or [23] of necessary and sufficient conditions for the convergence. By the stationary, will be a solution to the stochastic equation
[TABLE]
We would like to investigate in the case, where and have comparable tails. We will work under the assumption that . To state the conditions in Definition 2.2 explicitly, we will consider with regularly varying tail, namely for any satisfying
[TABLE]
as and . Moreover, denoting by an independent copy of , assume that
[TABLE]
for some . The case of , when is the class of subexponential distributions, was treated in [17, 33, 38]. To ensure that Cramér’s condition is not satisfied, assume
[TABLE]
At this point it is worth noting that condition (3.4) implies in particular that for any
[TABLE]
see for example [20]. As a particular consequence, the results of Grey [24] will also not apply directly. However, as we will see, one can use a similar approach as the one presented in [24].
Under the above, the tails of and are weakly equivalent, provided that the tail of is of the same order. Note, that if
[TABLE]
then in particular . In view of (3.6), Minkowski’s inequality entails
[TABLE]
for details we refer to Alsmeyer et al. [2] or to Section 5. Without any further assumptions, we were able to prove, that the tails of and are weakly equivalent. Next Proposition will follow form our main result, presented in the Section 4.
Proposition 3.1**.**
Suppose and that conditions (3.4) - (3.7) hold true. Then the Markov chain converges weakly to . Moreover, as ,
[TABLE]
Furthermore, if , then
[TABLE]
At this point we are obliged to mention that the constants we obtain in the claims of Proposition 3.1 are not optimal. Since our main goal is establishing the precise asymptotic of we will not pursue the optimal constants in Proposition 3.1.
To be able to determine the exact asymptotic of some additional conditions need to be imposed. Namely, assume that
[TABLE]
and
[TABLE]
where are some measurable function and denotes the distribution of . Imposing (3.8) and (3.9) will allow us to investigate the case of dependent and with comparable tails. Note that under the above , so that (3.8) and (3.9) imply (3.7). As one of the consequences coming from combining conditions (3.8) and (3.4) is a bound for function . Namely, we may write for any
[TABLE]
while for ,
[TABLE]
Whence, since , for ,
[TABLE]
where . Thus, for example . In a similar fashion we obtain the bound for of the form
[TABLE]
and as a consequence . Denote
[TABLE]
Assuming the presented conditions, we aim to prove the following result.
Theorem 3.2**.**
Assume (3.4) - (3.9) and that a.s. The Markov chain converges weakly to . Moreover,
[TABLE]
Note that under the assumptions of Theorem 3.2 the same comment may be made regarding the left tail of . More precisely,
[TABLE]
To see a few examples, how (3.8) and (3.9) come into play, we state the following Corollary, which treats the case when the tail of is negligible. This covers the possibility that and are independent, as treated in [34], and the possibility that the tail of is negligible, as treated by Kevei [30]. For simplicity we will assume that so that for any .
Corollary 3.3**.**
Assume and that (3.4) - (3.6) hold and moreover that
[TABLE]
for some . Then, as ,
[TABLE]
Proof.
We will invoke Theorem 3.2. To see why (3.8) holds with , take an arbitrary and consider the following two bounds. For the upper one write
[TABLE]
where the last term on the right-hand side is negligible, since in can be bounded viz.
[TABLE]
For the lower bound one can just write simply that
[TABLE]
where again the last term is negligible. Taking first and then yields the desired result. ∎
In turns out that in the case when the knowledge only of marginals of and is insufficient to determine . We note that by the example in the same vein as the one presented in [17].
Example 3.4**.**
We wish to compare the tails of with two types of input, i.e. two different vectors with the same marginals. Take any positive random variable such that and denote , . Firstly, consider and independent, both distributed as . Then, by Corollary 3.3 for
[TABLE]
one has
[TABLE]
Since the first and the second moment of that can be computed explicitly using (3.10), we have
[TABLE]
For the second input consider with the same distribution as . Then
[TABLE]
can be written as
[TABLE]
Invoking Corollary 3.3 once again yields
[TABLE]
where
[TABLE]
Summarizing, , but the asymptotic of tails of and are in general different, since differs form .
The Example above shows, that in order to determine the exact asymptotic of in the case when the tails of and are comparable, we need to have some information regarding the joint distribution of the vector . One example of such information is encrypted in conditions (3.8) and (3.9).
4. Iterated random functions
Natural direction, in which one can generalize Theorem 3.2 is by allowing to take negative values. Another one consists of replacing the function by a random, Lipschitz function . We aim to obtain both these generalizations in this Section, where we will give a statement of our main result in full generality. We will now consider a Markov chain with more general form than (3.1), namely
[TABLE]
where is a sequence of iid random Lipschitz functions, and is a random variable independent of these functions. Note that, if the functions are of the form , recursion (4.1) boils down to (3.1). Let denote a generic element of . As argued by Elton [18], under some mild moment assumptions on , has a unique stationary distribution which, realized by random variable , satisfies
[TABLE]
Our key assumption concerning the function is being Lipschitz with the Lipschitz constant
[TABLE]
satisfying
[TABLE]
Aiming to build upon observations made in previous section, we would like to argue that posses the same tail asymptotic as if is close to the function . This can be achieved in several ways, for example consider of the form
[TABLE]
where
[TABLE]
as and
[TABLE]
Note that, if unbounded, then necessary . Indeed, writing
[TABLE]
we notice that
[TABLE]
We will assume that satisfies the conditions (3.4)-(3.6), that is for
[TABLE]
as and
[TABLE]
Denoting by an independent copy of , we will also suppose that
[TABLE]
for some . To generalize the condition (3.8) and (3.9) assume for some measurable functions ,
[TABLE]
and
[TABLE]
where denotes the law of . Assuming the above we will prove our main result, which was already foreshadowed by the previous Section.
Theorem 4.1**.**
Assume that is a Lipschitz function satisfying (4.3)-(4.9). Then the Markov chain converges weakly to , which is a unique solution to (4.2). Moreover
[TABLE]
Moreover,
- a)
Suppose additionally that is non-decreasing and then
[TABLE]
- b)
Suppose that (4.10) and (4.11) hold. Then
[TABLE]
One novelty of our result is that it allows the non-linear term in to have a substantial contribution.
Example 4.2**.**
Suppose that has the following form
[TABLE]
where are independent and
[TABLE]
Then, if satisfies the conditions of Theorem 4.1,
[TABLE]
Example 4.3**.**
Consider of the form
[TABLE]
Then (4.2) can be expressed as
[TABLE]
In this special instance, shares a distribution with a supremum of a perturbed multiplicative random walk, that is
[TABLE]
Assume for simplicity that . Then . For we may write
[TABLE]
and so (4.4) and (4.5) are satisfied. If the assumptions of Theorem 4.1 are satisfied, that is among others
[TABLE]
we can infer the asymptotic of the form (4.12) which is a multiplicative equivalent of the result obtained in [36].
Example 4.4**.**
Take given viz.
[TABLE]
where and for some fixed positive . Then solves
[TABLE]
Also in this case, the distribution of has a very particular representation, being the supremum of the perpetuity sequence, that is
[TABLE]
Random variables of this form have connections to the ruin problem, for details see [13]. Since , we know that . For one has
[TABLE]
Again, if the assumptions of Theorem 4.1 are satisfied, which means that among others
[TABLE]
we can infer the asymptotic similar to (4.12).
5. Proofs
In order to establish all of our claims, we will proceed in the following fashion. We will prove the entire Theorem 4.1. From this, Proposition 3.1 and Theorem 3.2 will follow. Firstly note, that convergence in (3.4) is uniform in the following sense
[TABLE]
as for any . See for example Bingham et al. [4] We begin by noting that the convergence of follows form the result of Elton [18]. More precisely, note that (4.3) reads
[TABLE]
and that (4.4) and (4.5) imply that for some ,
[TABLE]
The main result of Elton [18] implies the next Proposition.
Proposition 5.1**.**
Assume that satisfies
[TABLE]
for some . Then the Markov chain converges weakly, to , which is a unique solution to (4.2).
We will now establish a weak tail equivalence of and .
Proposition 5.2**.**
Assume that satisfies (4.3)-(4.9). Then
[TABLE]
If we suppose additionally is non-decreasing, then
[TABLE]
Proof.
For the first claim take small enough for
[TABLE]
Next, pick , for which . Then it is true that for - a.a. ,
[TABLE]
where and . Note that by our assumptions . It is true that for any , if is independent from ,
[TABLE]
which means that
[TABLE]
where denotes the stochastic order, i.e. iff for any . Since, due to independence of and , it is also true that , we can infer by the merit of being positive that
[TABLE]
Inductively, we can show this way that for any ,
[TABLE]
Since converges in probability to [math] as , if we pass to the limit in (5.2) we get
[TABLE]
From now, we will focus on delivering the bound for the tail of . The key observation is that
[TABLE]
which means that is the unique stationary distribution of the Markov chain given via
[TABLE]
where is independent of the sequence of iid two-dimensional random vectors . By Proposition 5.1, the converges weakly to for any choice of . Form here, we will follow an idea presented previously by Grey [24]. Consider , where is an independent copy of and are some large constants. We have
[TABLE]
By Lemma 2.3 we can write for some constant ,
[TABLE]
First, pass with and get
[TABLE]
For large and an appropriate choice of we can ensure and obtain
[TABLE]
This results in
[TABLE]
Whence we can pick , such that for
[TABLE]
Define the law of r. v. via
[TABLE]
Then for any
[TABLE]
To see that this is in fact true, consider two possibilities, first of which is . Then
[TABLE]
For , so that is trivial. Now, inductively we can write for any , since ,
[TABLE]
This completes the proof of the upper bound since
[TABLE]
For the lower bound just note that if then and so
[TABLE]
where
[TABLE]
∎
After establishing it is relatively easy to get the exact asymptotic, provided that one is equipped with (4.10) and (4.11). Note that due to the bound
[TABLE]
we know that by the merit of the last Proposition,
[TABLE]
Lemma 5.3**.**
Suppose (4.3) - (4.11) are satisfied. Denote
[TABLE]
and
[TABLE]
Then for independent of we have
[TABLE]
and
[TABLE]
Proof.
The asymptotic of both probabilities and can be treated in the same fashion. Whence, we will consider only the first one. Pick and large . Decompose the probability of interest in the following fashion
[TABLE]
For the first term write
[TABLE]
Take such that . Since (5.1) holds with with , we can find a constant such that for - a.a.
[TABLE]
Whence, by the dominated convergence Theorem we are allowed to infer that
[TABLE]
Since , to treat the second term write
[TABLE]
where and . Here, we have for some constant ,
[TABLE]
For , let and so
[TABLE]
Notice that both and satisfy assumptions of Lemma 2.3. Indeed,
[TABLE]
By an appeal to Lemma 2.3 we get
[TABLE]
Finally, we treat in exactly the case fashion as and arrive at
[TABLE]
where and . This constitutes
[TABLE]
Take and to obtain the claim. ∎
Using the same decompositions and Fatou’s Lemma instead of the dominated convergence Theorem we also have a Lemma corresponding to the lower limits.
Lemma 5.4**.**
Suppose (4.3) - (4.11) are satisfied. Denote
[TABLE]
and
[TABLE]
Then for independent of we have
[TABLE]
and
[TABLE]
Proof of Theorem 4.1.
In view of Propositions 5.1 and 5.2, only (4.12) needs to be proved. Denote
[TABLE]
The fact that satisfies (4.2) combined with Lemmas 5.4 and 5.3 gives us
[TABLE]
and
[TABLE]
Since and , the two inequalities above imply that and . Thus, another appeal to Lemmas 5.4 and 5.3 yields
[TABLE]
Since this system can be solved explicitly, this proves our Theorem. ∎
Appendix
Here, we gathered some facts related to the classes and that we used in the article. Recall, that we will consider distribution with right-unbounded support. Write for th-convolution of and for its tail, that is . Before we prove Lemma 2.3 we need the following auxiliary result.
Lemma A.1**.**
Suppose that . Then for any fixed , the limit
[TABLE]
exist. Moreover
[TABLE]
Proof.
We proceed as in the proof of Lemma 2.7 in [20]. Let and let be two independent random variables with law . Then
[TABLE]
Hence
[TABLE]
The third term can be managed quite easily as , since by the merit of (2.1),
[TABLE]
For the same reason, by an appeal to the Lebesgue dominated convergence Theorem, we can identify the limit of the first term as
[TABLE]
In view of (2.2) we are allowed to conclude that
[TABLE]
This proves our first claim. The second one follows by letting . ∎
Lemma A.2**.**
Suppose that and that for and . Then for one has
[TABLE]
Proof.
We have
[TABLE]
which can be bounded further by integrating by parts
[TABLE]
which competes the proof. ∎
Proof of Lemma 2.3.
To prove (2.3) suppose . As in the proof of Lemma A.1 we write
[TABLE]
The first term, by the Lebesgue dominated convergence Theorem, tends to
[TABLE]
Note that the second term can be treated in exactly the same fashion. The third one, by Lemmas A.1 and A.2 in negligible, i.e.
[TABLE]
Finally, for the last term one has
[TABLE]
Letting we obtain (2.3). The proof of the fact, that whenever is exactly the same as that of Lemma 2.7 in [20]. The fact that the distributions there are supported on doesn’t play any role. In order to argue in favour of 2.5, fix and take big enough such that . Let be such that for
[TABLE]
Then in view of (A.3)
[TABLE]
Keeping fixed and taking possibly larger we have
[TABLE]
which shows (2.5) and (2.4). (2.6) and (2.7) are obtained in the same way. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] G. Alsmeyer. On the stationary tail index of iterated random lipschitz functions. Stochastic Process. Appl. , 126:209–233, 2016.
- 2[2] G. Alsmeyer, A. Iksanov, and U. Rösler. On distributional properties of perpetuities. Journal of Theoretical Probability , 22(3):666–682, 2009.
- 3[3] L. Arnold and H Crauel. Iterated function systems and multiplicative ergodic theory, in diffusion theory and related problems in analysis ii, m.pinsky and wihstatz, eds. pages 283–305, 1992.
- 4[4] N. H. Bingham, C. M Goldie, and J. L. Teugels. Regular variation , volume 27. Cambridge university press, 1989.
- 5[5] M. Borkovec and C. Klüppelberg. The tail of the stationary distribution of an autoregressive process with arch (1) errors. Annals of Applied Probability , pages 1220–1241, 2001.
- 6[6] S. Brofferio and D. Buraczewski. On unbounded invariant measures of stochastic dynamical systems. Ann. Probab. , 43:1456–1492, 2015.
- 7[7] D. Buraczewski and E. Damek. A simple proof of heavy tail estimates for affine type lipschitz recursions. Stochastic Process. Appl. , 2016.
- 8[8] D. Buraczewski, E. Damek, and T. Mikosch. Stochastic Models with Power-Law Tails . Springer, 2016.
