Persistence of heavy-tailed sample averages: principle of infinitely many big jumps
Ayan Bhattacharya, Zbigniew Palmowski, Bert Zwart

TL;DR
This paper investigates the tail behavior of the first exit time for a heavy-tailed random walk, revealing it follows a lognormal distribution and that large exit times are typically caused by a logarithmic number of big jumps.
Contribution
It establishes the lognormal tail distribution of the exit time and characterizes the typical jump pattern leading to large exit times for heavy-tailed random walks.
Findings
First exit time tail is of lognormal type.
Large exit times are caused by a logarithmic number of big jumps.
The behavior is characterized for random walks with regularly varying step sizes.
Abstract
We consider the sample average of a centered random walk in with regularly varying step size distribution. For the first exit time from a compact convex set not containing the origin, we show that its tail is of lognormal type. Moreover, we show that the typical way for a large exit time to occur is by having a number of jumps growing logarithmically in the scaling parameter.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Markov Chains and Monte Carlo Methods · Mathematical Dynamics and Fractals
Persistence of heavy-tailed sample averages: principle of infinitely many big jumps
Ayan Bhattacharya label=e1][email protected] [
Zbigniew Palmowski label=e2][email protected] [
Bert Zwartlabel=e3][email protected] [ Centrum Wiskunde & Informatica\thanksmarkm1 and Wrocław University of Science and Technology\thanksmarkm2
Centrum Wiskunde & Informatica
P.O. Box 94079
1090 GB Amsterdam, Netherlands
Wrocław University of Science and Technology
Department of Applied Mathematics
Faculty of Pure and Applied Mathematics
wyb. Stanisława Wyspiańskiego 27, 50-370 Wrocław, Poland
Abstract
We consider the sample average of a centered random walk in with regularly varying step size distribution. For the first exit time from a compact convex set not containing the origin, we show that its tail is of lognormal type. Moreover, we show that the typical way for a large exit time to occur is by having a number of jumps growing logarithmically in the scaling parameter.
60F99, 60G10, 60G50, 60G18, 60G52, 60K35, 60K40,
keywords:
[class=MSC]
keywords:
Persistency, Regular variation, Heavy-tailed distribution, Random walk, Large deviation
\startlocaldefs\endlocaldefs
,
and
t1Partially supported by Dutch Science foundation NWO VICI grant # 639.033.413 t2Partially supported by Polish National Science Centre Grant
2018/29/B/ST1/00756 (2019-2022)
1 Introduction
We consider an exit problem for the sample mean of an -valued random walk with zero mean, where the step size has a distribution which is of multivariate regular variation. Specifically, let be an i.i.d. sequence of random variables in () such that has a multivariate regularly varying distribution with index (written as ) where denotes a generic step. Therefore, there exists an increasing sequence of positive real numbers with and a non-null Radon measure on with \mu\big{(}\bar{\mathbb{R}}^{d}\setminus\mathbb{R}^{d}\big{)}=0 such that
[TABLE]
for every satisfying ( denotes the boundary of ) and ( denotes the closure of ). The limit measure necessarily obeys a homogeneity property, that is, there exists such that (where ) for every and . We assume that
[TABLE]
Additionally, we assume that the -valued random vector satisfies
[TABLE]
With , we associate the random walk
[TABLE]
for all In this paper, we investigate the behavior of the survival probability
[TABLE]
as , where is a compact convex set with non-empty interior that does not contain the origin and
[TABLE]
where denotes interior of the set . This assumption implies that for every . On the other hand, (1.3) and the LLN subsequently imply that and our aim is to establish its convergence rate.
Our motivation behind this investigation is two-fold. First of all, is an example of so-called persistence probability, that is the probability that sample average ‘persists’ in the set for at least steps. It can also be interpreted as the survival function of the first time the sample average exits from the set .
Persistence probabilities and related exit problems have recently received a lot of attention in probability theory and theoretical physics. In many situations of interest, for a stochastic process in discrete or continuous time and some exit time , it turns out that the behavior is either polynomial-like, that is , or exponential-like, that is for a non-negative parameter called the persistence exponent (or survival exponent). This exponent usually does not depend on the initial position of the process under consideration. Random walks and Brownian motions have been analysed in [13, 15, 20, 26, 34, 33]. For results on Gaussian processes, see [10, 16, 25], and references therein. If the process under consideration is stationary and one-dimensional, and the set is a shifted half-line, the law of corresponds to a first passage time. In this case, fluctuation theory (see [14]) may be applied; see e.g. the survey [3] for an overview concerning mainly Lévy processes and (integrated) random walks. Other one-dimensional processes have been studied; see for example [21] for autoregressive sequences. Recent work on time-homogeneous Markov chains can be found in [2]. When for all (hence ), the behavior of can be derived from Mogulskii’s theorem, cf. [11, Thm. 5.1.2, p. 176]. For a recent survey on persistence probabilities we refer to [8].
Our investigation distinguishes from the above-mentioned works by focusing on the sample average , which is a time-inhomogeneous -valued Markov chain. As mentioned in [8], the study of sample averages, and more generally occupation measures, is challenging. In the case investigated here, we find out that the asymptotics of is of lognormal type. That is, there exists a constant depending on the shape of the set and such that
[TABLE]
Thus, the behavior of is fundamentally different from the two earlier described cases. We manage to identify explicitly. For example, if and with , then the persistence exponent equals
[TABLE]
In the case , we provide a simple variational characterization of .
An explanation of this untypical asymptotics brings us to our second motivation of this paper, which is to obtain a sharper understanding of the nature of heavy-tailed large deviations. In turns out that the problem we consider exhibits a new qualitative phenomenon in the following sense: we prove that the typical way of getting a large exit time is by having a number of jumps which is growing logarithmic in the scaling parameter . Hence persistency in our case is caused by infinitely many large jumps. In other words, the principle of a single big jump used in a significant number of studies (see [19] and references therein) does not hold here.
In addition, heavy-tailed sample-path large deviations theorems such as recently derived in [29] do not apply either. In [29], a sample-path large deviations result for the rescaled random walk , with and , has been developed in the case . For a large collection of sets , the results in [29] imply that
[TABLE]
as with some rate function . This result can be applied to investigate the probability, for fixed ,
[TABLE]
If is not an integer, it can be shown that
[TABLE]
The intuition, which can be made precise using the conditional limit theorems in [29], is that the most likely way for to stay in the set for is by having large jumps. In the case we are interested in, jumps will not be sufficient for to be persistent. Therefore, has different asymptotics. Moreover, note that it is tempting to proceed heuristically, and take in (1.9). Apart from not being rigorous, the resulting guess of would actually be off by a factor .
There exist several approaches that can be used to derive the existence, as well as expressions of persistence exponents. In the case of more general processes, the Markovian structure is typically exploited. This allows to relate the persistence exponent to an eigenvalue of an appropriate operator, allowing to marshal analytic methods. This idea is related to identifying so-called quasi-stationary distributions (see [4] for the Brownian motion, [6, 13, 23] for random walks and Lévy processes, [9, 17] for time-homogeneous Markov processes and [1, 18, 24] for continuous-time branching processes and Fleming-Viot processes).
Our work is based on constructing a typical path for the random walk and showing that this path, sometimes also called the optimal path, is the most likely way for persistence to occur. For the optimal path is depicted in Figure 1 (where the jumps are coloured by red) and it is constructed in the following way. Fix a positive finite integer . Suppose that the path stays inside the envelope for all and the path is at at time . Because of the zero drift assumption, the random walk stays around as long as possible, that is until time . At time , it makes the first big jump so that it reaches to the maximum height () possible and stays there as long as possible, that is until . Then it again makes a jump. This strategy can be applied recursively, and the resulting path turns out to be the optimal sample path for the event . Suppose that denotes the time of the -th jump whose size is denoted by . Then we will show that a random time can be replaced by for large enough with high probability. Let denote the number of big jumps needed until time , i.e. . Then can be replaced by for large with high probability. As we said above, the optimal path can be represented by the random measure
[TABLE]
where is a Dirac measure putting unit mass at . Moreover, the probability of a jump of size during is of order . Therefore is roughly of order . This produces the required estimate where we write if for some constants and .
The main idea works also in dimension by choosing an ’optimal’ direction that is attaining the supremum , cf. (2.2) below. Using this, we create a convenient inner set of that is big enough to achieve a sharp enough lower bound for . For this inner set, we take a carefully constructed hypercuboid. A key property is then a certain closure property of a class of hypercuboids under a direct sum operation. Another essential feature of our approximation by a sequence of hypercuboids is that we need to allow the fluctuation of the random walk in some directions though the large jumps happen in the optimal direction only; see Figure 2.
This paper is organized as follows. In Section 2, we present the main results Theorem 2.1 (-dimensional random walk with ) and Theorem 2.3 () and their consequences along some important examples. In Theorem 2.1, we have assumed the angular measure to be absolutely continuous (with respect to the Lebesgue measure on the surface of the unit sphere) and so, this result does not apply to the -dimensional random walk with independent coordinates (angular measure becomes purely atomic). So in subsection 2.2, we present the persistence exponent for a multi-dimensional random walk such that the co-ordinates are independent and the exponent of regular variation might not be the same for every co-ordinate. In section 3, we present the proof of Theorem 2.1. The proof is divided into two parts. In subsection 3.1 and subsection 3.2, we derive upper and lower bound for the persistence exponent respectively. We further show that the upper and lower bound match and hence, Theorem 2.1 follows. The auxiliary results needed to derive the lower bound for the persistence exponent are proved in subsection 4.1. In subsection 4.2, we present a sketch of the proof of Theorem 2.3.
2 Main results
In the definition of regular variation on , we have seen that there exists a Radon measure satisfying the homogeneity property. We first consider . The homogeneity property of implies that can also be written as a product measure on where and . The distance between two sets will be denoted by . We need to introduce the polar coordinate transformation to write down the product measure form of . The polar co-ordinate transformation is given by , with . This has inverse transformation given by , where denotes scalar multiplication of the vector and a positive real number . The vector can be interpreted as the direction and is the distance in the direction .
It is known (e.g. Theorem 6.1 in [28]) that (1.1) is equivalent to the existence of a Radon measure on such that
[TABLE]
where and and is a measure on such that for any . We will assume that the spectral (angular) measure is absolutely continuous with respect to the Lebesgue measure on the unit sphere. Note that the spectral measure may not satisfy this assumption: for example it can be atomic if we consider the case where the components of the random vector are independent. Note also that the polar transform is a non-linear transform, that is, the polar transform of a random walk is not a random walk. Thus, the polar transform can not be used directly to get a one-dimensional positive random walk and compute the persistence exponent from this simpler object. But this decomposition helps to understand the limit. Intuitively, it is clear that the persistence exponent must be based on the radial part of the set under consideration.
We write for any measurable subset . We consider a compact and convex set which is bounded away from (). It is clear that is also compact. We can then write where and . It is clear that and are continuous functions of as the boundary of a bounded convex set is connected and for every as is bounded away from . Thus, we can conclude that is a continuous function of . Define
[TABLE]
Then there exists such that as is compact. This may be non-unique, in which case we fix an arbitrary solution throughout the paper. Without loss of generality, we can assume that points in the direction of the positive orthant of . If it is not the case, then we can rotate the axes to ensure that it holds. We are now ready to present the main result of this work.
Theorem 2.1**.**
Assume that the angular measure is absolutely continuous with respect to the Lebesgue measure, positive on the unit sphere and the set with non-empty interior is compact, convex such that . Under the conditions (1.2), (1.3) and (1.5), we have
[TABLE]
Remark 2.2*.*
The persistence exponent and in particular can be computed by developing an alternative representation for . It is not difficult to see that is equal to the largest value of such that where . Since any convex set in is the intersection of a countable number of half-spaces, there exist vectors and constants for such that
[TABLE]
where denotes the inner product of vectors and . Defining the convex function
[TABLE]
the problem of maximizing such that can now be equivalently written as the solution of the convex program
[TABLE]
subject to
[TABLE]
2.1 One-dimensional random walk and interval
For and the set with , we consider a collection of independent copies of the -valued, mean-zero regularly varying random variable such that
[TABLE]
for , such that a tail balance condition
[TABLE]
holds true, where is a slowly varying function. This is equivalent to assumption (1.1) in the case . With , we consider the associated random walk (without using boldface).
Theorem 2.3**.**
*Under the assumptions stated above, *
[TABLE]
for every .
Note that the above theorem is not a straightforward corollary of Theorem 2.1 since the associated angular measure is necessarily atomic in . However, we will briefly show later in the Appendix that its proof follows from the same steps as the proof of Theorem 2.1.
Theorem 2.3 can be used to derive an upper bound for the probability in Theorem 2.1 by projecting a -dimensional random walk in a certain direction. This leads to a natural upper bound for in terms of a persistence probability for a one-dimensional random walk. In particular, for any -dimensional vector ,
[TABLE]
where
[TABLE]
The assumptions on and imply that is an interval of the form . A natural question is now whether the bound
[TABLE]
for defined in (1.6) is sharp. This kind of bounding techniques are often applied in light-tailed large deviations. It can be shown that this bound is sharp if is a Euclidean ball bounded away from the origin. However, if is a rectangle in the positive orthant, then the bound is only sharp if and only if the diagonal connecting the southwest corner and northeast corner of also passes through the origin. We leave these details as an exercise.
2.2 Nonstandard regular variation
Suppose that is a random vector such that ’s are independent and have regularly varying tails with index of regular variation and slowly varying function . This is known by the name of nonstandard regular variation in the theory of regular variation (see [28, Subsect. 6.5.6]). Then exploiting the independence of components of we can get the following easy corollary of Theorem 2.3.
Corollary 2.4**.**
Suppose that the vector is such that ’s are independent and have regularly varying distribution with index of regular variation and each satisfies the assumptions in Theorem 2.3. Then
[TABLE]
Note that this cannot be obtained as a corollary of Theorem 2.1 as if the ’s are not equal. Even if for all , then it is known in the literature (see Section 6.5.1 in [28]) that the angular measure corresponding to the limit measure is purely atomic and concentrated on the axes which does not fall under the assumptions of Theorem 2.1. Moreover, when all ’s are identical, the expression for given in Theorem 2.1 does not coincide with the persistence exponent (2.12).
3 Proof of Theorem 2.1
The proof of Theorem 2.1 will be divided into proving the respective asymptotic lower and upper bounds.
3.1 Upper bound
We will show that
[TABLE]
Step 1. We divide the set of time points into smaller segments. Fix . Then we choose a positive integer such that
[TABLE]
Define , and recursively for all . We also define
[TABLE]
for all . As a consequence, we obtain and . Note that for all . Using these inequalities recursively combined with the fact yields
[TABLE]
The choice of in (3.2) makes the numerator in the second term in the right hand side of (3.4) well defined.
Define for all . Then we have the following bound for :
[TABLE]
using the product formula of conditional probability.
Step 2. Fix . Then it will be shown in Step 4 that there exists a positive integer such that
[TABLE]
where is some positive real number. If , then we can use this property to obtain
[TABLE]
using the independent increment property of the random walk. Combining (3.5) and (3.7), we infer that
[TABLE]
as for all using . Note that . This fact leads to the following form of the upper bound for :
[TABLE]
To bound this expression further, we shall use the following estimate, taken from [22, Lem. 2.1]:
[TABLE]
on . Fix . Note that as . So there exists a positive integer such that
[TABLE]
for all , where is the slowly varying function appearing in the tail distribution function of , and is some appropriately chosen positive finite real number. In addition, we have used the fact that is bounded above as is a slowly varying function and as .
Step 3. Fix . We now use Potter’s bound (see e.g. [27, Prop. 0.8(ii)]) which says that there exists an integer such that for all . Define . Combining the expressions obtained for the upper bound in Step 1 and Step 2, we have
[TABLE]
Using the upper bound for obtained in (3.4), straightforward algebra yields
[TABLE]
The upper bound (3.1) follows by taking logarithms, dividing by , letting , and finally .
Step 4. Here we shall prove the claim stated in (3.6). We first observe that as . This implies the existence of a positive integer such that for all . We consider from now on. It is clear that
[TABLE]
Note that is uniformly continuous in . Using that is compact and every continuous function attains its extrema on a compact set, we conclude that there exists an element such that . Using continuity of the distance function with the compactness of once again, we get
[TABLE]
for some pair of elements . To prove our claim, it is enough to show that there does not exist any pair of elements and such that . We prove this by contradiction, so we first assume that there exists a pair and of elements in such that holds. It follows from the property of the Euclidean norm that and so and are the vectors in in the same direction with . This contradicts the definition of (see (2.2)) as . Hence, the proof is complete.
3.2 Lower bound
The proof of the lower bound
[TABLE]
is much more demanding. Using (2.4), and the discussion following that equation, we define as the solution of
[TABLE]
subject to
[TABLE]
We can equivalently write this as as the solution of the problem
[TABLE]
subject to the constraints
[TABLE]
Since is compact, has compact level sets for levels . Since is continuous on and has a non-empty interior, there exists a such that the subset of is non-empty, and so we see that on in a neighborhood of [math]. Since is a composition of convex functions, it is jointly convex on . Thus, we can apply Theorem 4.2(c) of [7] with , , and to conclude that is continuous in a neighborhood of [math].
Consider the set with . It is clear that is a proper subset of . Note that is a convex and compact set. So there exist an optimal direction and a straight line (in the direction and the ratio of endpoints ) such that the straight line is contained in . It is immediate that . As a consequence of the Theorem 4.2(c) of [7], it follows that as . As , there exists a such that for all . Let us fix . Note that a segment of straight line (with ratio of the endpoints as ) in the direction lies in the interior of . Therefore, we can construct a hypercuboid inside the set (aligned in the direction and the ratio of the endpoints is in the direction ). As , all the constructions needed to obtain the lower bound in (3.61) are possible. Using the same steps, we obtain a lower bound as in (3.61) with replaced by . As can be chosen to be arbitrarily close to [math], we can let and obtain the desired lower bound.
Without loss of generality, we can now assume that for an interior of , that is for some . Indeed, if , then one can consider the set instead and then take at the last step of the proof.
Strategy of the proof: In the first step, we shall construct a hypercuboid (the direction of any point in the hypercuboid lies in the -neighbourhood of in ) which is aligned in the direction and contained in . We further show that converges (in the sense of convergence of sets) to the section of the straight line in in the direction as . As we are concerned about the lower bound of , we replace by . In the second step, we partition the index set into subsets as we did in the upper bound. The main difference here is that the partitions depend on the length of in the direction , and an auxiliary parameter (choice of depends on ). We also construct such that for large enough where is the left end point of . We then discuss the strategy for the lower bound in Step 3 and realize the strategy in the rest of the proof.
Steps 1.(Construction of a hypercuboid inside approximating the chord of in the direction .) Define for some . From the above assumption , we can fix satisfying . This implies that the solid cone has non-empty intersection with . We shall say a hypercuboid is aligned in the direction if the hypercuboid is specified by the orthogonal set of unit vectors with . We define to be the largest hypercuboid contained in . It is clear that as , converges to the straight line . Hence it is clear that converges to . These observations can be used to obtain that converges to using the notion of convergence of sets (as defined in [31, Def. 4.1]). Also, note that (1.5) implies that there exists an such that for some . To see this, suppose that where for all . Since and the sets are nested, we can apply the monotone convergence theorem to conclude that which is a contradiction to (1.5). So we can choose for small enough such that .
To specify the -dimensional hypercuboid , define
[TABLE]
Note that . Moreover, we have chosen in such a way that
[TABLE]
and
[TABLE]
We have
[TABLE]
Steps 2. (Partitioning the index set and construction of .) As in Step 1 of the upper bound, we divide into smaller pieces. Define
[TABLE]
and note that as by (3.20). Fix a constant small enough so that the following inequality holds:
[TABLE]
Note that such a always exists as allows us to choose . Define
[TABLE]
For large enough , we define
[TABLE]
where
[TABLE]
It is easy to check that for large enough . We fix a large integer . Then we have following lower bound for the right hand side of the expression (3.21):
[TABLE]
where
[TABLE]
and
[TABLE]
Step 3. We choose the number in (3.26) such that . We divide the segment into two parts and . The first part of the segment will be allowed to contribute only to the fluctuation of the random walk where the contribution will be at most of order . We will use the independent increment property of the random walk and the generalized Kolmogorov’s inequality (see (4.8)) to show that the probability of this event is close to one for large enough . Observe that the distance between the sets and is of order , which makes a jump of order necessary. This necessary jump will occur in the second part of the segment and this part will also contribute to the fluctuation. To analyze this segment we will introduce sets and such that . Due to the choice of , the jump can occur at any time point in the interval . This strategy, combined with the regular variation of and absolute continuity of the law of with respect to the uniform angular measure, produces the lower bound for the probability in (3.27) which is roughly of order . Then we let and to get the desired constants matching the constants in the upper bound.
To realize the strategy, we need the additional sets
[TABLE]
and
[TABLE]
In the following lemma we introduce their basic properties. Its proof will be given later in the Appendix.
Lemma 3.1**.**
For large enough , we have
[TABLE]
and
[TABLE]
Using this lemma, we get the following lower bound for the -th conditional probability in (3.27):
[TABLE]
for all large enough , where
[TABLE]
and
[TABLE]
We shall deal with each of these terms separately.
Term . Note that
[TABLE]
where
[TABLE]
Moreover, we have
[TABLE]
Using this inclusion, we obtain
[TABLE]
From the independent increment property of the random walk we can conclude that
[TABLE]
Thus, the lower bound obtained in (3.38) equals
[TABLE]
We shall now use the positivity of the angular measure to show that the projections of the random walk in each of the directions are one-dimensional random walks with the same asymptotic tail behaviour. Then, the generalized Kolmogorov inequality (stated in (4.8)) is used to obtain the required lower bound. We shall mention the lower bound in the next proposition which will be proved in the Appendix.
Proposition 3.2**.**
Fix . Under the assumptions in Theorem 2.1, there exists a large integer such that for all , we have
[TABLE]
Thus, from Proposition 3.2 it follows that
[TABLE]
for all large enough ’s.
Term . Note that
[TABLE]
We now define several sets which will be necessary for the rest of the analysis
[TABLE]
Note that the numerator of given in (3.43) has the following lower bound
[TABLE]
Finally, we can combine (3.43) and (3.47) to have
[TABLE]
We now decompose the event into disjoint events , where
[TABLE]
It follows from the definition of the events are exchangeable and hence have the same probabilities. So we have
[TABLE]
We now estimate . Note that
[TABLE]
using the independent increment property of the random walk. It can easily be derived from Proposition 3.2 that
[TABLE]
for large enough . We are now left with the estimate of the probability . To do that we shall use the fact that the random variable has a regularly varying tail. It follows easily from that for large enough where
[TABLE]
It is easy to see that is bounded away from . Using regular variation of the tail of , we get that
[TABLE]
This means that for large enough , we have
[TABLE]
where is a fixed small number. Combining these facts, we get
[TABLE]
for large enough .
Combining (3.50), (3.51) and (3.56), for large enough , we have
[TABLE]
Steps 4. It follows from (3.34), (3.42), and (3.57) that the -th conditional probability in (3.27) can be bounded from below by
[TABLE]
This estimate, combined with the product formula (3.27), yields the following lower bound for the probability (3.21):
[TABLE]
It follows from the definition of in (3.28) that
[TABLE]
We now use Potter’s bound to have for large enough where can be chosen to be arbitrarily small but positive. Now, some straightforward algebra combined with the estimate in (3.60) leads us to the following
[TABLE]
We can now let , and to get the desired constant in the right hand side of (3.61), using the continuity of in .
4 Rest of the proofs
This section is divided into two subsections. In subsection 4.1, we shall first prove the auxiliary results mentioned in subsection 3.2 to derive the lower bound (3.61). In Subsection 4.2, we provide a sketch of the proof of Theorem 2.3.
4.1 Proofs of auxiliary results
Proof of Lemma 3.1.
We first prove (3.32). It is clear from the definition of and that and it is enough to show that . To establish this, we first consider the direction . Note that
[TABLE]
Comparing the interval with the projection of the set along the direction , it follows from that for all and for large enough . We now consider the directions , where . Fix and note that
[TABLE]
as and for all . Comparing this interval with the projection of along the direction , it follows from that for large enough and for all . Hence the inclusion in (3.32) follows.
We proceed with a proof of (3.33). As , it will be enough to show that . Consider first the direction and note that
[TABLE]
Moreover, we have that for all and for large enough . This completes the proof of the inclusion in the direction. Fix now . Then
[TABLE]
as and . Note that for large enough and for all . This completes the proof of the inclusion stated in (3.33).
∎
Proof of Proposition 3.2.
Note that
[TABLE]
Fix for and such that . We claim
[TABLE]
for sufficiently large . To prove it we will use the following lemma.
Lemma 4.1**.**
Let and on with being absolutely continuous with respect to the Lebesgue measure. Then for any direction vector , we have where is a Radon measure on with
[TABLE]
Using Lemma 4.1, note that is a mean [math] random walk with steps for all . For , we will apply the generalized Kolmogorov inequality given in [32]:
[TABLE]
where is some constant and . In this case, as [32] noted, is regularly varying with index (or slowly varying if ). For we can apply the classical Kolmogorov inequality. In both cases we can bound
[TABLE]
where appears due to Potter’s bound applied to the slowly varying part of and is some constant. For sufficiently small, this upper bound gives (4.6) as with .
Similarly, we can prove that
[TABLE]
for large enough . Hence the proof of the proposition follows from the lower bound obtained in (4.5). ∎
Proof of Lemma 4.1.
To prove this lemma, we need to find such that
[TABLE]
for any such that . It is enough to show convergence in (4.9) for the collection of sets as these collection of intervals is a -system (see [28, Lem. 6.1]). We consider the case for . The set with can be handled similarly. If we consider , we get
[TABLE]
as is bounded away from and it can be proved that does not put any mass at the boundary of this set. Thus, the limit exists and satisfies the scaling homogeneity property. To complete the proof it suffices to show that \mu\Big{(}\{\mathbf{x}:\langle\mathbf{u},\mathbf{x}\rangle>1\}\Big{)}>0. We show this by using polar decomposition, invoking our assumption on the angular measure. Note that
[TABLE]
It is now enough to prove that {\rm Leb}\Big{(}\{\mathbf{y}:\langle\mathbf{u},\mathbf{y}\rangle>0\}\Big{)}>0. Note that if , then . This implies that . Finally, we note that is strictly positive, since contains only elements. Hence . ∎
4.2 Proof of Theorem 2.3
The proof is similar to the proof given in Section 3. Therefore, we will provide a brief sketch of the proof below to indicate the similarity and obvious differences between these two cases.
Upper bound. We follow the steps given in Subsection 3.1. We follow Step 1 with in the definition of . Then the one-dimensional analogues of (3.7) and (3.5) lead to the following inequality
[TABLE]
We can again use [22, Lemma 2.1] with to obtain the upper bound in (3.11). Then Step 3 produces the desired upper bound.
Lower bound. As , it follows that
[TABLE]
for any integer . Define and consider which satisfies
[TABLE]
We then define for every , with a fixed large integer and . We then decompose the index set where . We also construct a set such that for large enough . We then enforce for all large enough which yields the lower bound to of the required order. By construction, we make sure that and the distance between the sets and is of the order of magnitude . This event enforces the segment
BZ: the notation with ”:” is overloaded and it doesn’t look neat. Let’s discuss how to fix it.
to travel a distance of order . We then write down in the following product form
[TABLE]
By construction, the set is not accessible to the segment initially. Hence, we find a positive constant such that is accessible to and further decompose the segment into two parts given by and where and . In the first part of the segment, the random walk only contributes to the fluctuation (it can only travel a distance of order where ). The second part of the segment contains one necessary jump of order and the rest of the steps contribute to the fluctuation in an accumulated way.
To realize this strategy, we use the stationarity and the independence of the increments to write down the -th term in the product formula (4.14) in terms of . The generalization of Kolmogorov’s inequality (stated in (4.8)) is used to show that the first part can contribute to the fluctuation with high probability. The probability of the second part containing a jump of magnitude is roughly of order leading to the right constant in Theorem 2.3. Thus the proof follows if we choose the constant and construct in an appropriate way for large enough .
We define
[TABLE]
To realize the strategy fully, we shall design two auxiliary sets and such that for all and for all . We define
[TABLE]
It is easy to check that and satisfy the requirements for large enough (see proof of Lemma 3.1). Therefore, we have the following lower bound on the -th conditional probability in (4.14):
[TABLE]
We shall now derive lower bounds for the terms and separately.
Note that the term can be written as
[TABLE]
where . Observe that on the event , implies . Therefore, we have the following lower bound for the numerator in (4.19):
[TABLE]
We can now use the independence of the segments and , and the distributional identity to obtain the following lower bound for :
[TABLE]
We can now use the generalized Kolmogorov’s inequality when to conclude that the lower bound in (4.21) is close to one if we choose large enough.
We shall derive the exact asymptotics for the term for large enough . We want to create an envelope for the segment so that the segment contains exactly one large jump (of absolute magnitude ) to ensure . To write down the envelope explicitly, we need the following intervals
[TABLE]
For large enough , we have the following inclusion
[TABLE]
We now observe that the left-hand side of the inclusion (4.23) can be decomposed into two independent events using the independent increment property of the random walk. Combining these facts, we obtain the following lower bound for the term :
[TABLE]
We now decompose the event inside the probability in the right hand side of (4.24) into disjoint events by taking into account the location of the large jump in the interval . The following event helps to write down the decomposition
[TABLE]
for every . It is easy to check that implies the event inside the probability in (4.24). We can now use exchangeability of the random variables to see that for every and obtain the following lower bound for :
[TABLE]
For large enough , the last probability in (4.25) is very close to as we have seen earlier in the analysis of term and so, we can ignore that for the further analysis. We can use now regular variation to conclude that
[TABLE]
as . The lower bound now follows from simple algebra (see (3.12) in Step 3 in the proof of (3.1)), and by letting .
Acknowledgement
The authors are thankful to Guido Janssen for an analytic computation which led to a correct guess of the proper normalization of . The authors are thankful to a referee for valuable suggestions which improved the quality of the exposition.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Asselah, A., Ferrari, P.A., Groisman, P. and Jonckheere, M. (2016). Fleming–Viot selects the minimal quasi-stationary distribution: The Galton–Watson case. Ann. Inst. Henri Poincaré Probab. Stat. 52(2) 647–668.
- 2[2] Aurzada, F., Mukherjee, S. and Zeitouni, O. (2017). Persistence exponents in Markov chains. ar Xiv preprint ar Xiv:1703.06447 .
- 3[3] Aurzada, F. and Simon, T. (2015). Persistence probabilities and exponents. In Lévy matters V 183–224. Springer.
- 4[4] Bañuelos, R. and Smits R.G. (1997). Brownian motion in cones. Probab. Theory Related Fields 108(3) 299–319.
- 5[5] Basrak, B., Davis, R. and Mikosch, T. (2002). A characterization of multivariate regular variation. Ann. Appl. Probab. 12(3) 908–920.
- 6[6] Bogdan, K., Palmowski, Z. and Wang, L. (2018). Yaglom limit for stable processes in cones. Electron. J. Probab. 23(11) 1–19.
- 7[7] Bonnans, J. F. and Shapiro, A. (1998). Optimization problems with perturbations: a guided tour. SIAM Rev. 40(2) 228–264.
- 8[8] Bray, A.J., Majumdar, S.N. and Schehr G. (2013). Persistence and first-passage properties in nonequilibrium systems. Adv. in Physics 62(3) 225–361.
