Large deviations for infinite weighted sums of stretched exponential random variables
Frank Aurzada

TL;DR
This paper investigates the probabilities of rare large deviations in infinite weighted sums of independent random variables with stretched exponential tails, extending previous work to more general tail behaviors.
Contribution
It generalizes existing large deviation results to infinite sums with stretched exponential tails, beyond finite exponential moments.
Findings
Derived large deviation probabilities for infinite weighted sums with stretched exponential tails.
Extended previous finite sum results to infinite sums in this tail regime.
Provided new theoretical bounds for rare event probabilities in this setting.
Abstract
We study the large deviation probabilities of infinite weighted sums of independent random variables that have stretched exponential tails. This generalizes Kiesel and Stadtm\"uller (2000), who study the same objects under the assumption of finite exponential moments, and Gantert et al.\ (2014), who study finite weighted sums with stretched exponential tails.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Large deviations for infinite weighted sums of stretched exponential random variables
Frank Aurzada
Abstract
We study the large deviation probabilities of infinite weighted sums of independent random variables that have stretched exponential tails. This generalizes Kiesel and Stadtmüller [12], who study the same objects under the assumption of finite exponential moments, and Gantert et al. [8], who study finite weighted sums with stretched exponential tails.
Keywords: independent, identically distributed random variables; large deviations; stretched exponential random variables; weighted sums
2010 Mathematics Subject Classification: 60F10
1 Introduction
A classical result in probability theory is Cramér’s theorem for the large deviations of sums of independent, identically distributed random variables: If is an i.i.d. sequence with zero mean and for some the moment generating function is finite then
[TABLE]
It is also classical that Cramér’s theorem can be extended to a full large deviation principle; and it can be seen as the starting point of large deviation theory, see e.g. [6, 7].
Whenever the random variables do not have any finite exponential moment, the behaviour of the large deviations is different. This is due to the fact that then the large deviation event is produced by only one variable being unusually large. The classical result here (cf. [14]) is as follows: if is an i.i.d. sequence with stretched exponential tail, , as for some , and finite expectation then
[TABLE]
In this paper, we study weighted sums of i.i.d. random variables. There is quite some literature on large devations of weighted sums and their applications. The most recent general reference is Kiesel and Stadtmüller [12] (also see [1, 2, 3, 4, 5, 9, 10, 15] for further references). However, these papers deal with random variables that do have some finite exponential moment.
The only source, to the knowledge of the author, that deals with weighted sums of random variables that do not have any finite exponential moment is Gantert et al. [8]. There, finite sums of the type are considered when the random variables have stretched exponential tails.
In this note, we treat the case of infinite weighted sums with i.i.d. random variables having stretched exponential tails. Besides filling this gap in the literature, the motivation comes from Baysian statistics: There, one is interested in proving contraction rates for the posterior distribution for nonparametric inverse problems. There, estimates of the type studied here are important, see e.g. Lemma 5.2 in [13], [16], or [11] for results with Gaussian priors, which require large deviation estimates of squared Gaussians, i.e. with exponential moments. We mention that the present results are directly motivated by a forthcoming work of S. Agapiou and P. Mathé in that area for non-Gaussian priors.
The paper is structured as follows. In Section 2, we define the concrete setup for this paper and state our main result. The proofs are given in Section 3.
2 Main result
Let be an array of non-negative numbers (let for all to avoid trivialities). Let be a sequence of non-negative i.i.d. random variables, copies of the random variable with tail behaviour
[TABLE]
for some and .
We are interested in the probability
[TABLE]
The large deviation regime is characterized by the condition that the typical values of lie below , i.e.
[TABLE]
which we shall encode using assumption (4) below.
We can now formulate our main result, which is a “largest jump principle” for the large deviations of weighted sums of stretched exponential random variables. This means that the large deviation event is triggered by one of the terms in the sum being large, namely the one corresponding to the largest weight.
Theorem 2.1
Let be a sequence of non-negative i.i.d. random variables, copies of with tail behaviour (2). Further, let be non-negative numbers with
[TABLE]
and such that and . Then for any
[TABLE]
We stress that we do not need any regularity assumption on the sequence . Note that exists (for any ), because (4) implies that .
Example 2.2
The classical result (1) is retrieved for .
Example 2.3
In a motivating example from Baysian statistics, , which gives the large deviation probability of “remainder” sums: . Here is a positive, summable sequence and is a positive sequence.
Example 2.4
The work of Gantert et al. [8] in the case of non-negative random variables can be recovered as follows. They consider arrays with for . Their condition (B) implies that (4) holds with and that . Note that we do not require to be of order in this work.
Example 2.5
Examples where depends on in a different way are given for instance by moving averages, where
[TABLE]
for positive sequences , , . Such objects were studied by [12] under the assumption of finite exponential moments (cf. the remark on p. 938 in [12]).
Possible extensions of the present results include the case that has a polynomial tail (rather than stretched exponential) or the precise behaviour for the case of a supremum rather than a sum (see Lemma 3.2 below for a partial result). In the spirit of Example 2.3, one could also consider , where is random (cf. e.g. [2] for the case of finite sums). Further, one might want to add a slowly varying factor in (2).
3 Proofs
3.1 Auxiliary results for maxima
We start with two results for the rate of the probability
[TABLE]
which is the obvious analog of (3). We start with a lower bound.
Lemma 3.1
If then for any
[TABLE]
If then .
Proof: The claims follow immediately from the trivial estimate
[TABLE]
where .
We now turn to the corresponding upper bound. We shall prove it under more restrictive assumptions in order to avoid lengthy discussions (note that (4) is not necessary for the sup-problem). The stated lemma will be one ingredient in the proof of the main result.
Lemma 3.2
Assume that (4) holds and that . Then we have for any
[TABLE]
Proof: First note that we can assume w.l.o.g. that , as otherwise it can be absorbed as a constant factor into the sequence . The lower bound already follows from Lemma 3.1. For the upper bound, observe that
[TABLE]
Fix . It remains to use the tail bound for , which shows that the last term bounded from above as follows: For large enough ,
[TABLE]
with some constant . The remainder of the proof consists in a treatment of this sum:
[TABLE]
where we used in the second step that for large and, in the last step, the assumptions that and . Combining this with (6) and (7) shows
[TABLE]
Multiplying by , taking first and then shows the upper bound in the statement.
3.2 Proof of the main result
Here, we give the proofs of the lower and upper bound in Theorem 2.1, respectively.
Proof of the lower bound: Throughout, we use the notation .
Let us first treat the case that . Then the lower bound already follows from assumption (2) together with
[TABLE]
Assume . Then we can fix an with . We begin by noting that
[TABLE]
Since , (4) implies . Therefore, the first term on the right-hand side of (8), by (2), satisfies
[TABLE]
We will show that the second term on the right-hand side of (8) tends to one for fixed and . Combining this with the last formula will finish the proof of the lower bound in the theorem.
Note that, for large enough ,
[TABLE]
The last term tends to one, since by Chebyshev’s inequality
[TABLE]
which tends to zero (because the sum is bounded, by (4), and ), as required.
Proof of the upper bound: The first observation is that we can assume w.l.o.g. that , as can be absorbed as a constant factor into the sequence .
Step 1: Reduction step, main argument, overview.
Set and note that , by assumption. Further, fix such that also . First note that
[TABLE]
and the second term can be treated with Lemma 3.2, which shows that the second term has asymptotic order , as required by the assertion. If we can show that the first term is of the same or lower order, we obtain the statement.
Step 2: Exponential Chebychev inequality for the truncated random variables.
Let us consider the first term: For any , by the Markov inequality,
[TABLE]
Let us deal with the sum. Note that for we have (for small enough). Thus
[TABLE]
Setting we shall use the last estimate with
[TABLE]
Step 3: We show that the second sum in (10) tends to zero for fixed and .
First note that if then – using – we have
[TABLE]
Therefore,
[TABLE]
Further, it is elementary to show (see Lemma 3.3 below) that due to the tail estimate (2), which we use in the form for all and some , where , we have
[TABLE]
for any with .
In our case, and . Therefore, we see that the term on the right-hand side of (11) is bounded from above by
[TABLE]
The second sum in (10) is therefore bounded from above by
[TABLE]
where and . This can be treated as follows: Since for large enough , we have
[TABLE]
Now, is bounded by assumption (4). Further, since , the term tends to zero for fixed and . This finishes the proof of the fact that the second sum in (10) tends to zero.
Step 4: Final computations. Putting Step 3 together with (9) and (10), we have seen that for fixed and
[TABLE]
Multiplying by and using (4), we obtain
[TABLE]
Letting shows the assertion.
During the course of the last proof, we used the following completely elementary lemma.
Lemma 3.3
Let be a non-negative random variable with for all and . Then, for any and any ,
[TABLE]
Proof: Note that
[TABLE]
Acknowledgement. The author is indebted to Sergios Agapiou (University of Cyprus) and Peter Mathé (WIAS Berlin) for bringing this problem to his attention and to Marvin Kettner (Darmstadt) for valuable suggestions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] O. Bonin. Large deviation theorems for weighted sums applied to a geographical problem. J. Appl. Probab. , 39(2):251–260, 2002.
- 2[2] O. Bonin. Large deviation theorems for weighted compound Poisson sums. Probab. Math. Statist. , 23(2, Acta Univ. Wratislav. No. 2593):357–368, 2003.
- 3[3] S. A. Book. Large deviation probabilities for weighted sums. Ann. Math. Statist. , 43:1221–1234, 1972.
- 4[4] S. A. Book. A large deviation theorem for weighted sums. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete , 26:43–49, 1973.
- 5[5] D. Deltuvienė and L. Saulis. Asymptotic expansion of the distribution density function for the sum of random variables in the series scheme in large deviation zones. In Proceedings of the Eighth Vilnius Conference on Probability Theory and Mathematical Statistics, Part I (2002) , volume 78, pages 87–97, 2003.
- 6[6] A. Dembo and O. Zeitouni. Large deviations techniques and applications , volume 38 of Applications of Mathematics (New York) . Springer-Verlag, New York, second edition, 1998.
- 7[7] J.-D. Deuschel and D. W. Stroock. Large deviations , volume 137 of Pure and Applied Mathematics . Academic Press, Inc., Boston, MA, 1989.
- 8[8] N. Gantert, K. Ramanan, and F. Rembart. Large deviations for weighted sums of stretched exponential random variables. Electron. Commun. Probab. , 19:no. 41, 2014.
