On moderate deviations in Poisson approximation
Qingwei Liu, Aihua Xia

TL;DR
This paper investigates the accuracy of Poisson approximation for rare event counts, showing that with proper adjustments, the error estimates for moderate deviations are improved and applicable across various problems.
Contribution
It introduces refined error estimates for moderate deviations in Poisson approximation, applicable to multiple classical problems, with no unspecified constants and easy to implement.
Findings
Poisson tail probabilities better approximate rare event counts than normal tails.
Adjusted estimates improve error bounds for moderate deviations.
Applications include Poisson-binomial, matching, occupancy, birthday, random graphs, and 2-runs.
Abstract
In this paper, we first use the distribution of the number of records to demonstrate that the right tail probabilities of counts of rare events are generally better approximated by the right tail probabilities of Poisson distribution than {those} of normal distribution. We then show the moderate deviations in Poisson approximation generally require an adjustment and, with suitable adjustment, we establish better error estimates of the moderate deviations in Poisson approximation than those in \cite{CFS}. Our estimates contain no unspecified constants and are easy to apply. We illustrate the use of the theorems in six applications: Poisson-binomial distribution, matching problem, occupancy problem, birthday problem, random graphs and 2-runs. The paper complements the works of \cite{CC92,BCC95,CFS}.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
ON MODERATE DEVIATIONS IN POISSON APPROXIMATION
Qingwei Liu111School of Mathematics and Statistics, The University of Melbourne, VIC 3010, Australia, E-mail: [email protected]. Work supported in part by China Scholarship Council. and Aihua Xia222School of Mathematics and Statistics, The University of Melbourne, VIC 3010, Australia, E-mail: [email protected]. Work supported in part by Australian Research Council Grants No DP190100613.
(20missingmissing-missingmissing-missingmissing)
Abstract
In this paper, we first use the distribution of the number of records to demonstrate that the right tail probabilities of counts of rare events are generally better approximated by the right tail probabilities of Poisson distribution than those of normal distribution. We then show the moderate deviations in Poisson approximation generally require an adjustment and, with suitable adjustment, we establish better error estimates of the moderate deviations in Poisson approximation than those in [Chen, Fang & Shao (2013a)]. Our estimates contain no unspecified constants and are easy to apply. We illustrate the use of the theorems in six applications: Poisson-binomial distribution, matching problem, occupancy problem, birthday problem, random graphs and 2-runs. The paper complements the works of [Chen & Choi (1992), Barbour, Chen & Choi (1995), Chen, Fang & Shao (2013a)].
Key words and phrases: Stein-Chen method, Poisson approximation, moderate deviation.
AMS 2020 Subject Classification: Primary 60F05; secondary 60E15.
1 Introduction
An exemplary moderate deviation theorem is as follows (see [Petrov (1975), p. 228]). Let , , be independent and identically distributed (i.i.d.) random variables with and . If for some ,
[TABLE]
then there exist positive constants and depending on and such that
[TABLE]
where is the distribution function of the standard normal, . However, since the pioneering work [Chen (1975)], it has been shown [Barbour, Holst & Janson (1992)] that, for the counts of rare events, Poisson distribution provides a better approximation. For example, the distribution of the number of records [Dwass (1960), Rényi (1962)] in Example 1.1 below can be better approximated by the Poisson distribution having the same mean than by a normal distribution [Deheuvels & Pfeifer (1988)]. Moreover, a suitable refinement of the Poisson distribution can further improve the performance of the approximation [Borovkov (1988), Borovkov & Pfeifer (1996)].
The right tail probabilities of counts of rare events are often needed in statistical inference but these probabilities are so small that the error estimates in approximations of distributions of the counts are usually of no use because the bounds are often larger than the probabilities of interest. Hence it is of practical interest to consider their approximations via moderate deviations in Poisson approximation in a similar fashion to (1.2). However, there is not much progress in the general framework except the special cases in [Chen & Choi (1992), Barbour, Chen & Choi (1995), Chen, Fang & Shao (2013a), Tan, Lu & Xia (2018), Čekanavičius & Vellaisamy (2019)]. This is partly due to the fact that the tail behaviour of a Poisson distribution is significantly different from that of a normal distribution and this fact is observed by [Gnedenko (1943)] in the context of extreme value theory. In particular, [Gnedenko (1943)] concludes that the Poisson distribution does not belong to any domain of attraction while the normal distribution belongs to the domain of attraction of the Gumbel distribution.
Example 1.1
We use the distribution of the number of records to explain the difference of moderate deviations between Poisson and normal approximations. More precisely, let be i.i.d. random variables with a continuous cumulative distribution function. As the value of is always a record, for , we say is a record if . We define the indicator random variable
[TABLE]
that is, if a new record occurs at time and otherwise. Our interest is on the distribution of , denoted by . [Dwass (1960), Rényi (1962)] state that , are independent so
[TABLE]
We use to stand for the Poisson distribution with mean , for , and to stand for the normal distribution with mean and variance .
Let , and we consider approximations of by moderate deviations based on [Barbour, Chen & Choi (1995), Chen, Fang & Shao (2013a)] and . For , figures 2, 2 and 4 are respectively the plots of the ratios , and for the range of . As observed in [Borovkov & Pfeifer (1996)], Poisson and normal approximations to are resp. with order and , the numerical studies confirm that approximation by the Poisson distribution is better than that by normal distribution. In fact, it appears that the speed of convergence of to as is too slow to be of practical use. In the context of normal approximation to the distribution of integer valued random variables, a common practice is to introduce a 0.5 correction, giving the ratios , where is the smallest integer that is not less than . Figure 4 is the plot of the ratios and we can see that the ratios are still far away from the limit of 1. Finally, the difference between Figure 2 and Figure 4 shows that a minor change of the mean of the approximating Poisson can change the quality of moderate deviation approximation significantly, further highlighting the difficulty of obtaining sharp bounds in theoretical studies in the area. **
Example 1.1 shows that the distribution of the counts of rare events often has a heavier right tail than that of the corresponding normal distribution, approximations by the moderate deviations in the normal distribution are generally inferior to those by the moderate deviations in the Poisson distribution. The next example says that the parameter of the approximating Poisson distribution suggested in [Chen & Choi (1992), Barbour, Chen & Choi (1995), Chen, Fang & Shao (2013a)] is not optimal and some adjustment can significantly improve the quality of approximations by the moderate deviations in the Poisson distribution.
Example 1.2
With , let , and , then for a fixed ,
[TABLE]
which systematically deviates from 1 as moves away from 0. The systematic bias can be removed by introducing an adjustment into the approximate models: for a fixed ,
[TABLE]
or equivalently, with ,
[TABLE]
Example 1.2 suggests that it is more suitable to approximate the right tail probabilities by looking at the number of standard variations away from the mean, which is essentially the original idea of the translated (shifted) Poisson approximation [Barbour & Xia (1999), Röllin (2005), Röllin (2007)]. In this paper, we show that it is indeed better to approximate the right tail probabilities via the moderate deviations in the translated Poisson distribution.
Our approach does not rely on the boundedness of the Radon-Nikodym derivative as in [Chen & Choi (1992), Barbour, Chen & Choi (1995)] or the tacit assumption of well-behaved tail probabilities as in [Chen, Fang & Shao (2013a)], see Remark 2.4 for more details. For the case of Poisson-binomial, we show in Proposition 3.2 that our approach works for the case that the maximum of the success probabilities of the Bernoulli random variables is not small, such as the distribution of the number of records.
The paper is organised as follows. We state the main results in the context of local dependence, size-biased distribution and discrete zero-biased distribution in Section 2. The accuracy of our bounds is illustrated in six examples in Section 3. The proofs of the main results are postponed to Section 4 where we also establish Stein’s factors for Poisson moderate deviations in Lemma 4.1.
2 The main results
In this section, we state three theorems on moderate deviations in Poisson approximation, the first is under a local dependent structure, the second is with respect to the size-biased distribution and the last is in terms of the discrete zero-biased distribution.
We first consider a class of non-negative integer valued random variables satisfying the local dependent structure (LD2) in [Chen & Shao (2004)] (see also [Arratia, Goldstein & Gordon (1989)] for its origin). For ease of reading, we quote the definition of (LD2) below.
(LD2) For each , there exists an such that is independent of
and is independent of .
We set , , , and . We write
[TABLE]
As suggested in Example 1.2, we consider approximation to with being not too large and being an integer so that in and is in terms of the number of standard deviations of . In principle, the constant is chosen to minimise the error of approximation, however, our theory is formulated in such a flexible way that other choices of and are also acceptable. The three most useful choices of are , and , where stands for the largest integer in .
Theorem 2.1
With the setup in the preceding paragraph, assume that satisfies (LD2) and, for each , there exists a -algebra such that is measurable. Define
[TABLE]
where is the essential supremum of the random variable . Then for integer , and positive integer , we have
[TABLE]
where, with , ,
[TABLE]
Remark 2.2
Both and can be numerically computed in applications and they can’t be generally improved (see the proofs below). They are better than the “naive” counterparts derived through the total variation bounds in [Barbour & Eagleson (1984), Barbour, Holst & Janson (1992)]. Figure 5 provides details of
[TABLE]
for , from to . We would like to mention that for large and/or large , the tail probabilities are so small that the calculation using MATLAB produces unstable results since accumulated computation errors often exceed the tail probabilities, hence more powerful computational tools are needed to achieve the required accuracy or one has to resort to known approximations to the Poisson right tails and point probabilities.
Remark 2.3
Due to the discrete nature of Poisson distribution, it seems impossible to analytically simplify and at negligible costs for the diverse range of . **
Remark 2.4
If is chosen reasonably close to so that is bounded, then in the bound (2.1) converges to [math] when converges to . Our bound does not rely on the Radon-Nikodym derivative of with respect to , which is the crucial ingredient in [Chen & Choi (1992), Barbour, Chen & Choi (1995)]. On the other hand, the tacit assumption of [Chen, Fang & Shao (2013a)] is that for and in Theorem 2.1 is well-behaved and this assumption is hard to verify. The bound (2.1), although relatively crude, does not rely on this assumption and covers more general cases.**
Corollary 2.5
For the sum of independent non-negative integer valued random variables , let , , , . For any integer , let , , then for ,
[TABLE]
Remark 2.6
We leave in the upper bound (2.1) because the current approach can not remove it from the bound. Nevertheless, it is no more than and converges to zero exponentially fast with suitable choice of . For the sum of independent non-negative integer valued random variables in Corollary 2.5, if is at least less than by a few s, we can use [Chung & Lu (2006), Theorem 2.7] to obtain
[TABLE]
For any non-negative random variable with mean and distribution , the -size biased distribution [Cochran (1977), Arratia & Goldstein (2010)] is given by
[TABLE]
or equivalently by the characterising equation
[TABLE]
Theorem 2.7
Let be a non-negative integer-valued random variable with mean and variance , be an integer, . Then for integer , we have
[TABLE]
where .
Remark 2.8
Theorem 2.7 improves [Chen, Fang & Shao (2013a), Theorem 3] in a number of ways, with less restrictive conditions and no unspecified constants. **
The next theorem is based on the discrete zero-biased distribution defined in [Goldstein & Xia (2006)] and the approach is very similar to that in [Chen, Fang & Shao (2013b)]. For an integer valued random variable with mean and finite variance , we say that has the discrete -zero biased distribution [Goldstein & Xia (2006), Definition 2.1] if, for all bounded functions with ,
[TABLE]
where .
Theorem 2.9
Let be a non-negative integer-valued random variable with mean , variance , be an integer, and have the discrete -zero biased distribution and be defined on the same probability space as . Set and define
[TABLE]
Then, for integer , with , we have
[TABLE]
where .
3 Examples
As many applications of Poisson approximation rely on size biased distributions, we begin with a review of some facts about size biasing.
Size biasing has been of considerable interest for many decades (see [Barbour, Holst & Janson (1992)], [Ross (2011)], [Arratia, Goldstein & Kochman (2013)] and references therein). In the context of the sum of Bernoulli random variables, its size biasing is particularly simple. More precisely, if is a family of Bernoulli random variables with , then the size biased distribution of is
[TABLE]
where
[TABLE]
is a random element independent of having distribution , . Moreover, are said to be negatively related (resp. positively related) [Barbour, Holst & Janson (1992), p. 24] if one can construct such that (resp. ) for all . When are negatively related, we have
[TABLE]
where and . On the other hand, if are positively related, then
[TABLE]
3.1 Poisson-binomial trials
Let be independent Bernoulli random variables with , , and . When , the large deviation of is investigated in [Chen & Choi (1992), Barbour, Chen & Choi (1995)] with precise asymptotic order. We give two results for this particular case without the assumption being small, the first is direct consequences of the general results in Section 2 and the second is based on our approach using a more fine-tuned analysis and well-studied properties of the tail behaviour of .
Proposition 3.1
Recalling and in (2.2) and (2.3), for any integer , we have
[TABLE]
and, with and ,
[TABLE]
Proof The claim (3.4) is a consequence of Theorem 2.7 with and , as shown in (3.2).
The bound (3.5) is a special case of Corollary 2.5. Since is unimodal, [Mattner & Roos (2007), Corollary 1.6] says that
[TABLE]
On the other hand, , hence the upper bound (3.5) is an immediate consequence of Corollary 2.5 and (2.4).
One can also use Theorem 2.9 to obtain the same bound. More precisely, according to the construction of the discrete zero-biased distribution suggested in [Goldstein & Xia (2006)], let be a random variable independent of with distribution for , then we can write , giving . We then apply (3.6) to bound as
[TABLE]
and a routine calculation gives , hence (3.5) follows from (2.6) and (2.4).
Proposition 3.2
Define
[TABLE]
then for any integer with , we have
[TABLE]
The proof relies on more information of the solutions of Stein’s equation and it is postponed to the end of Section 4. The bound (3.7) improves [Chen, Fang & Shao (2013a), (3.1)] in two aspects: it contains no unspecified constants and it does not require being small. For the distribution of the number of records, the large deviation results in [Barbour, Chen & Choi (1995)] do not apply. However, recalling that , we apply Proposition 3.2 with the harmonic series and the Riemann zeta function to get the following estimate.
Corollary 3.3
For any integer with , we have
[TABLE]
where is Euler’s constant.
Remark 3.4
We conjecture that, with and , the bound in (3.5) can be significantly improved and the better estimate is likely dependent on the Radon-Nikodym derivative bound . **
3.2 Matching problem
For a fixed , let be a uniform random permutation of , be the number of fixed points in the permutation.
Proposition 3.5
For the random variable defined above and any integer , we have
[TABLE]
Proof of Proposition 3.5 In this case, the size-biased distribution can be coupled with as follows [Chatterjee, Diaconis & Meckes (2005)]. Let be uniformly distributed on , independent of , and define
[TABLE]
Set , one can easily verify that has the size-biased distribution of . Also, we can check that , giving . Let , using the above construction of , we can conclude that takes values in and . Since we have , and . On the other hand, allows us to get rid of the second term in (2.5). By Theorem 2.7 with , , the claim follows.
Remark 3.6
The bound (3.8) contains no unknown constants and improves the bound of [Chen, Fang & Shao (2013a), §3.3]. **
3.3 Occupancy Problem
The occupancy problem has a long history dating back to the early development of probability theory. General references on this subject can be found in classics, e.g., [Feller (1968), Vol 1, Chapter 2] and [Barbour, Holst & Janson (1992), Chapter 6].
The occupancy problem can be formulated as follows. Let balls be thrown independently of each other into boxes uniformly. Let be the indicator variable of the event that -th box being empty, so the number of empty boxes can be written as . Noting that , direct computation gives
[TABLE]
[TABLE]
Proposition 3.7
For the random variable defined above and any integer , we have
[TABLE]
where .
Proof of Proposition 3.7 For the sake of completeness, we provide the following proof which is essentially a repeat of [Barbour, Holst & Janson (1992), p. 23]. From the construction of -size biased distribution in (3.1), we can construct a coupling as follows. Let be uniform on , that is, we randomly pick one box with equal probability. If the selected box is not empty, we redistribute all balls in the box randomly into the other boxes with equal probability . Define as the indicator of the event that the box being selected is , and after the redistribution, box is empty. With this coupling in mind, one can verify that is negatively related so it follows from (3.2) that
[TABLE]
Now, applying Theorem 2.7 with yields (3.9).
3.4 Birthday problem
The classical birthday problem is essentially a variant of the occupancy problem. For this reason, we throw balls independently and equally likely into boxes and let be the indicator random variable of the event that ball and ball fall into the same box. The number of pairs of balls going into the same boxes (i.e., the number of pairs of people having the same birthdays) can be written as . Define , so [Chatterjee, Diaconis & Meckes (2005)] give the following construction of : label the balls from to , randomly choose two balls and and move ball into the box that is in, then is the number of pairs of balls before the move while is the number of pairs of balls after the move. Let be the event that and are from the same box. When occurs, so ; otherwise, and are from different boxes with and balls respectively, giving
[TABLE]
Hence,
[TABLE]
This, together with Theorem 2.7 and , gives the following Proposition.
Proposition 3.8
For the random variable defined above and any integer , we have
[TABLE]
where .
3.5 Triangles in the Erdős-Rényi random graph
Let be an Erdős-Rényi random graph on vertices with edge probability . Let be the complete graph on vertices, and be the set of all triangles in . For , let be the indicator that there is a triangle in at , i.e.
[TABLE]
Therefore the number of triangles in can be represented as . Clearly, is independent of if and don’t share a common edge. By analysing the numbers of shared edges, we obtain (see [Ross (2011), p. 255])
[TABLE]
[TABLE]
Proposition 3.9
For the random variable defined above and any integer , we have
[TABLE]
where .
Proof of Proposition 3.9 The following proof is a special version of the general argument in [Barbour, Holst & Janson (1992), p. 89]. Since and are independent if and have no common edges, a size biased distribution of can be constructed as follows. Let
[TABLE]
then . Here the union of graphs is in the sense of set operation of their vertices and edges. Let be a random element taking values in with equal probability and be independent of , then we can write . Because for all , (3.3) implies
[TABLE]
The claim follows from Theorem 2.7 with .
Remark 3.10
Since , if , then the error bound (3.10) is of the same order .
3.6 2-runs
Let be i.i.d. random variables with , . For each , define and, to avoid edge effects, we define for . The number of -runs in the Bernoulli sequence is defined as , then and variance .
Proposition 3.11
For any integer ,
[TABLE]
With , , then for any integer ,
[TABLE]
Proof For (3.11), we apply Theorem 2.7 with ,
[TABLE]
a uniform random variable on independent of , and
[TABLE]
giving
[TABLE]
Apropos of (3.12), we make use of Theorem 2.1. To this end, let , , , then [Barbour & Xia (1999), Lemma 5.1] with or for gives
[TABLE]
On the other hand, , ,
[TABLE]
and , , , hence and (3.12) follows from Theorem 2.1 by collecting these terms.
4 The proofs of the main results
The celebrated Stein-Chen method [Chen (1975)] is based on the observation that a non-negative random variable if and only if for all bounded functions , leading to a Stein identity for Poisson approximation as
[TABLE]
where . Since plays no role in Stein’s equation, we set and for . The following Lemma plays the key role in the proofs of the main results and it enables us to circumvent checking the moment condition (1.1) which seems to be inevitable in the existing procedure for proving moderate deviation theorems.
Lemma 4.1
For fixed , let . With , and , the solution of the Stein equation (4.1) has the following properties:
(i) , where ;
(ii) is negative and decreasing in ; and positive and decreasing in ;
(iii) and , where
[TABLE]
and
[TABLE]
(iv) and ;
where and are defined in (2.2) and (2.3).
For , death rates are bigger than the birth rate, so it seems intuitively obvious that is stochastically less than or equal to for such . In view of representation (4.11) and as shown in (4.6), this is equivalent to , leading to the following conjecture.
Conjecture 4.2
We conjecture that for all and the gap increases exponentially as a function of .**
Proof of Lemma 4.1 We build our argument on the birth-death process representation of the solution
[TABLE]
where is a birth-death process with birth rate , unit per capita death rate and initial state [Barbour (1988), Barbour & Brown (1992), Brown & Xia (2001)]. For convenience, we adopt the notation in [Brown & Xia (2001)]: for , define
[TABLE]
and
[TABLE]
Applying Lemmas 2.1 and 2.2 of [Brown & Xia (2001)] with birth rate , death rate , and , we have
[TABLE]
and for ,
[TABLE]
where, as in Theorem 2.1,
[TABLE]
One can easily simplify (4.3) to get
[TABLE]
which, together with (4.4) and the balance equations
[TABLE]
implies
[TABLE]
It follows from [Brown & Xia (2001), Lemma 2.4] that for ,
[TABLE]
which, together with (4.7), ensures
[TABLE]
hence, and combining (4.4), (4.5) and (4.6) gives as claimed in (i).
Apropos of (ii), because of (4.9) and (4.10), it remains to show that is decreasing in the two ranges. To this end, we will mainly rely on the properties of the solution (4.2). Let be an exponential random variable with mean and independent of birth-death process , then can be represented as
[TABLE]
hence we obtain from (4.2) and the strong Markov property in the second last equality that
[TABLE]
This enables us to give another representation of (4.8) as
[TABLE]
and so
[TABLE]
For , using the strong Markov property again in the equalities below, we have
[TABLE]
where the inequality follows from
[TABLE]
Similarly, for , is stochastically less than or equal to , so
[TABLE]
Hence, for and , which concludes the proof of (ii).
In terms of (iii), we use (ii) to obtain
[TABLE]
Likewise,
[TABLE]
Since (iv) is clearly an immediate consequence of (iii), (2.2) and (2.3), the proof of Lemma 4.1 is complete.
Proof of Theorem 2.1 As in the proof of Lemma 4.1, we set and , then
[TABLE]
Define
[TABLE]
then it follows from (4.1) that
[TABLE]
For the estimate of , from , we know that , thus
[TABLE]
which gives
[TABLE]
For the estimate of , denoting , we have
[TABLE]
Using Lemma 4.1 (ii), we have is negative for all except , which implies and
[TABLE]
and
[TABLE]
hence
[TABLE]
By taking
[TABLE]
we have from (4.14) that
[TABLE]
where the third last equality is because and is independent of , and the last equality is due to the assumption that is measurable. Using (4.15) in (4.16), we obtain
[TABLE]
Now, combining Lemma 4.1 (iii), (iv), (4.12), (4.13) and (4.17) gives (2.1).
Proof of Corollary 2.5 Under the setting of the local dependence, the claim follows from Theorem 2.1 by taking .
Proof of Theorem 2.7 Recall the Stein representation (4.12) and the estimate (4.13), it remains to tackle (4.14). However,
[TABLE]
thus
[TABLE]
Hence, combining (4.12), (4.13) and (4) completes the proof.
Proof of Theorem 2.9 Again, we make use of the Stein representation (4.12) and the estimate (4.13) so that it suffices to deal with (4.14). To this end, we have
[TABLE]
However, with ,
[TABLE]
and a similar argument for (4.15) ensures
[TABLE]
hence
[TABLE]
The claim follows from combining (4.12), (4.13) and (4.19) and using Lemma 4.1 (iii), (iv).
Proof of Proposition 3.2 The first inequality of (3.7) is a direct consequence of [Hoeffding (1956)]. For the second inequality, let and be the solution of the Stein identity (4.1) with , set , , the following argument is standard (see [Barbour, Holst & Janson (1992), p. 6]) and we repeat it for the ease of reading:
[TABLE]
For any non-negative integer valued random variable such that the following expectations exist, the summation by parts gives
[TABLE]
On the other hand, [Barbour, Chen & Choi (1995), Proposition 2.1] ensures
[TABLE]
so using Lemma 4.1 (ii), we have
[TABLE]
However, by (4.2), since is the stationary distribution of , , leading to
[TABLE]
where are i.i.d. random variables independent of . Combining (4.20), (4.21) and (4.22), we have
[TABLE]
For the first term of (4.23), using [Barbour, Holst & Janson (1992), Proposition A.2.1 (ii)], we obtain
[TABLE]
For the second term of (4.23), we use the crude estimate of (see [Barbour, Holst & Janson (1992), Lemma 1.1.1] or Remark 2.2), so applying [Barbour, Holst & Janson (1992), Proposition A.2.1 (ii)] again,
[TABLE]
The bound (3.7) follows by collecting (4.23), (4.24) and (4.25).
Acknowledgements We thank the anonymous referees for suggesting the “naive bound” in Remark 2.2 and comments leading to the improved version of the paper. We also thank Serguei Novak for email discussions about the quality of the bounds presented in the paper versus the “naive bound”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Arratia & Goldstein (2010)] Arratia, R. & Goldstein, L. (2010). Size bias, sampling, the waiting time paradox, and infinite divisibility: when is the increment independent? ar Xiv:1007.3910.
- 2[Arratia, Goldstein & Gordon (1989)] Arratia, R., Goldstein, L. & Gordon, L. (1989). Two moments suffice for Poisson approximations: The Chen-Stein method. Ann. Probab. 17 , 9–25.
- 3[Arratia, Goldstein & Kochman (2013)] Arratia, R., Goldstein, L. & Kochman, F. (2013). Size bias for one and all. ar Xiv preprint ar Xiv:1308.2729.
- 4[Barbour (1988)] Barbour, A. D. (1988). Stein’s method and Poisson process convergence. J. Appl. Probab. 25 (A), 175–184.
- 5[Barbour & Brown (1992)] Barbour, A. D. & Brown, T. C. (1992). Stein’s method and point process approximation. Stochastic Process. Appl. 43 , 9–31.
- 6[Barbour, Chen & Choi (1995)] Barbour, A. D., Chen, L. H. Y. & Choi, K. P. (1995). Poisson approximation for unbounded functions, I: Independent summands. Statist. Sinica 2 , 749–766.
- 7[Barbour & Eagleson (1984)] Barbour, A. D. & Eagleson, G. K. (1984). Poisson Convergence for Dissociated Statistics. Journal of the Royal Statistical Society. Series B (Methodological) 46 , 397–402.
- 8[Barbour, Holst & Janson (1992)] Barbour, A. D., Holst, L. & Janson, S. (1992). Poisson Approximation. Oxford Univ. Press.
