Relaxation of monotone coupling conditions: Poisson approximation and beyond
Fraser Daly, Oliver Johnson

TL;DR
This paper relaxes the monotonicity assumptions in size-bias couplings to establish explicit Poisson approximation bounds for various dependent random variables, broadening applicability to noisy models and other approximations.
Contribution
It introduces methods to relax monotonicity conditions in size-bias couplings, enabling Poisson approximation bounds based on first two moments for more general dependent variables.
Findings
Effective bounds for models with noise contamination
Poisson approximation for associated or negatively associated variables
Extensions to Poincaré inequality and normal approximation
Abstract
It is well-known that assumptions of monotonicity in size-bias couplings may be used to prove simple, yet powerful, Poisson approximation results. Here we show how these assumptions may be relaxed, establishing explicit Poisson approximation bounds (depending on the first two moments only) for random variables which satisfy an approximate version of these monotonicity conditions. These are shown to be effective for models where an underlying random variable of interest is contaminated with noise. We also give explicit Poisson approximation bounds for sums of associated or negatively associated random variables. Applications are given to epidemic models, extremes, and random sampling. Finally, we also show how similar techniques may be used to relax the assumptions needed in a Poincar\'e inequality and in a normal approximation result.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Relaxation of monotone coupling conditions: Poisson approximation and beyond
Fraser Daly111Department of Actuarial Mathematics and Statistics and the Maxwell Institute for Mathematical Sciences, Heriot-Watt University, Edinburgh EH14 4AS, UK. E-mail: [email protected]; Tel: +44 (0)131 451 3212; Fax: +44 (0)131 451 3249
Oliver Johnson222School of Mathematics, University of Bristol, University Walk, Bristol BS8 1TW, UK. Email: [email protected]; Tel: +44 (0)117 928 8632
Abstract It is well-known that assumptions of monotonicity in size-bias couplings may be used to prove simple, yet powerful, Poisson approximation results. Here we show how these assumptions may be relaxed, establishing explicit Poisson approximation bounds (depending on the first two moments only) for random variables which satisfy an approximate version of these monotonicity conditions. These are shown to be effective for models where an underlying random variable of interest is contaminated with noise. We also give explicit Poisson approximation bounds for sums of associated or negatively associated random variables. Applications are given to epidemic models, extremes, and random sampling. Finally, we also show how similar techniques may be used to relax the assumptions needed in a Poincaré inequality and in a normal approximation result.
MSC 2010 Primary: 62E17. Secondary: 60E15, 60F05, 62E10
1 Introduction
It is well-known that in many situations exploiting negative or positive dependence structure is an effective way to establish Poisson approximation results. For example, Barbour, Holst and Janson [3] treat many applications of Poisson approximation for sums of (dependent) Bernoulli random variables which are negatively related, that is, satisfy
[TABLE]
for all and increasing functions .
The Poisson approximation bounds given in [3] under this negative relation assumption have the advantage of only depending on the first two moments of . In general, such bounds require much more detailed information about the in order to be evaluated. More precisely, let be the total variation distance, defined for non-negative integer-valued random variables and by
[TABLE]
where . Then if is a sum of negatively related Bernoulli random variables with , and has a Poisson distribution with mean , then Corollary 2.C.2 of [3] gives the bound
[TABLE]
This upper bound is considerably simpler to evaluate in practice than more general Poisson approximation bounds, many of which involve, for example, for each and . Thus, if negative relation can be shown to hold, it is worthwhile taking advantage of it. Straightforward Poisson approximation bounds are also available when are positively related, a property which is defined analogously to (1), but with the inequality reversed.
Within a more general approximation framework, Daly, Lefèvre and Utev [10] recently showed that the upper bound of (2) continues to hold when is a non-negative, integer-valued random variable with mean , under the assumption that
[TABLE]
where has the -size-biased distribution, defined by
[TABLE]
and ‘’ denotes the usual stochastic ordering. If is a sum of negatively related Bernoulli random variables, then is also shown to satisfy (3), so (3) is referred to as a negative dependence condition for . Daly, Lefèvre and Utev [10] also show simple Poisson approximation results under an analogous positive dependence condition expressed in terms of monotonicity of the size-biased coupling.
Daly and Johnson [9] proved a simple Poincaré inequality for under the same condition (3), (again, with an upper bound depending only on the first two moments of ). Boundedness and monotonicity conditions of a similar type are also exploited in several of the normal approximation theorems discussed by Chen, Goldstein and Shao [6].
Our aim in this work is to demonstrate how the strict condition of assumptions such as (3) may be relaxed. The structure of the remainder of the paper is as follows. In Section 2 we derive Poisson approximation bounds (Theorem 2.2) for random variables which come close (in a certain sense) to satisfying either (3) or its positive dependence analogue. This relaxation of these conditions is motivated by recent work of Cook, Goldstein and Johnson [7], who established a concentration inequality which can control the spectral gap of random graphs. In Section 2.2 we use Theorem 2.2 to derive explicit Poisson approximation results for models where an underlying random variable of interest , satisfying (3), or its positive dependence analogue, is contaminated by noise. In Section 3, we show how these results can be applied to the Martin-Löf epidemic model and extremes of associated random variables.
In Section 4 we consider Poisson approximation for sums of associated or negatively associated random variables, illustrated by an application to simple random sampling. In Section 5, we show how such relaxed conditions imply a Poincaré inequality. Finally, in Section 6 we give an analogous relaxation of strict boundedness or monotonicity conditions for continuous random variables, to obtain normal approximation results.
2 Poisson approximation results
In this section, we begin by proving a Poisson approximation bound (Theorem 2.2 below) under conditions in the spirit of [7], which relax strict monotonicity assumptions such as (3). As we will see, the upper bounds we obtain have a form similar to that of (2). The condition (8) below is employed directly by [7], where a random variable satisfying this condition is said to be ‘-bounded with probability for the upper tail’. The analogous condition (10) was not used by [7], but is in the same spirit.
We then examine situations in which these Poisson approximation results may be applied. Section 2.2 considers models where we have an underlying random variable , satisfying a property such as (3), which is then contaminated with independent noise.
2.1 A Poisson approximation theorem
Our main Poisson approximation result is given in Theorem 2.2 below, and is proved using the Stein–Chen method. For an introduction to the Stein–Chen method for Poisson approximation, the interested reader is referred to [3], [11] and references therein.
Throughout this section, given a parameter and a set , we write for the solution to the Stein–Chen equation
[TABLE]
where and . By convention, we take for each . Writing for any function , we recall the standard bound [3, Eq. (1.17)]
[TABLE]
For any non-negative, integer-valued random variable , we write the upper tail
[TABLE]
Lemma 2.1**.**
For any non-negative, integer-valued random variable with
[TABLE]
for any . Here, and for the rest of the paper, represents the size-biased version of defined in (4).
The proof of Lemma 2.1 is deferred until Appendix A. We now apply this representation to prove the following.
Theorem 2.2**.**
Let be a non-negative, integer-valued random variable with and .
- (i)
Suppose there is a coupling of and such that
[TABLE]
for all . Then, for any ,
[TABLE] 2. (ii)
Suppose there is a coupling , , and a non-negative integer-valued random variable (which may be dependent on ) such that
[TABLE]
for all . Then for any
[TABLE]
Proof.
- (i)
Under the assumptions of the first part of the theorem,
[TABLE]
Note that (11) is equivalent to the stochastic ordering , where is Bernoulli with mean independent of all else, and hence generalizes (3). This stochastic ordering assumption was considered by [10], and the upper bound of part (i) now follows from their Proposition 3. For completeness we give a self-contained proof here. An analogous proof will be used for part (ii), in the case of positive dependence, for which no corresponding bound is available elsewhere.
We now apply the representation of Lemma 2.1. Taking modulus signs and using the triangle inequality and (6), we have that
[TABLE]
where . Since by (4) , the proof is complete. 2. (ii)
We use a similar argument to the above. We have
[TABLE]
from which an analogous argument to part (i) gives
[TABLE]
∎
Remark 2.3**.**
Taking in Theorem 2.2, we recover the results we expect under the stochastic ordering assumptions of [10]. For example, with and , the upper bound of Theorem 2.2(i) reduces to (2).**
Example 2.4**.**
Let and be independent. In the zero-inflated Poisson case where and , the argument of Example 3.6 of [7] shows that we may apply our Theorem 2.2(i), with the and we have defined here. Indeed, direct calculation gives
[TABLE]
so that (8) holds with equality. The bound (9) is . Since we may choose and obtain
[TABLE]
the bound (9) is exact in this case.**
2.2 Models contaminated with noise
In this section, we show how the assumptions of Theorem 2.2 are satisfied by a random variable , made up of a random variable which satisfies a monotone coupling assumption such as (3), together with some independent noise. In this case, we expect that will be close to Poisson as long as is close to Poisson and the noise is small. This is confirmed in the explicit bounds we derive below.
We consider separately the cases corresponding to parts (i) and (ii) of Theorem 2.2, beginning with part (i), the negatively dependent case.
Consider first the random variable , where
- •
and can be coupled such that almost surely. Note that this is possible if satisfies (3), which holds, for example, if may be written as a sum of negatively related Bernoulli random variables (see [3] for applications where this situation arises naturally). We let .
- •
is a non-negative, integer-valued random variable independent of with mean .
- •
is a Bernoulli random variable, independent of all else.
Theorem 2.5**.**
With this choice of , we may apply Theorem 2.2(i) with .
Proof.
Following, for example, Corollary 2.1 of [6], we may construct by replacing either or by its size-biased version, with the term to replace chosen with probability proportional to its mean. We obtain
[TABLE]
since and are equal in law, where may be constructed independently of , and is a Bernoulli random variable, independent of all else, with .
For any event and indicator variable , we know that
[TABLE]
Using this result to condition firstly on and then on , we have
[TABLE]
by our assumptions on , hence (8) is satisfied with taking this value. ∎
Now, to demonstrate the application of Theorem 2.2(ii) in this context, we will consider the random variable , where
- •
there exists a random variable (which may depend on and ) such that . In the case where is a sum of positively related Bernoulli random variables, such a random variable exists (see [10]). Again, the reader is referred to [3] for a wealth of applications involving such sums. We assume we have constructed in such a way that almost surely. This is possible under our stochastic ordering assumption.
- •
X, and are as above.
Theorem 2.6**.**
With this choice of , we may apply Theorem 2.2(ii) with .
Proof.
We still have the representation (12) for . Proceeding as before, using (13), by conditioning firstly on and then on , we have
[TABLE]
Hence, (10) is satisfied with taking this value. ∎
3 Applications
3.1 The Martin-Löf epidemic model
We show in this section how our framework implies a Poisson approximation result for the number of survivors in an epidemic model which is based on the Martin-Löf [17] model, but with the addition of an independent ‘catastrophe’ which causes the entire population to become infected. A Poisson approximation result was derived by Ball and Barbour [2] for the usual Martin-Löf model, and we base our argument on theirs.
We begin by describing the random graph model used in the construction of the epidemic. The random directed graph consists of vertices. Independently for each vertex , we choose a subset of vertices (distinct from ) to connect by a directed edge emanating from vertex . The value of is chosen from some given distribution, and then, conditional on , the set is chosen uniformly at random from the -subsets of vertices with .
The Martin-Löf epidemic is then constructed by choosing some initial set of infected vertices; the set of remaining vertices is , the set of initial susceptibles. The epidemic then proceeds in discrete time by recursively defining the set of infected vertices and susceptible vertices at time using the equations
[TABLE]
for . The epidemic ends when for some .
Ball and Barbour prove a Poisson approximation theorem, with an explicit rate, for , the ultimate number of susceptible vertices remaining, in this model. We will use their result to prove an analogous result in a modified version of this model.
To avoid the notational burden associated with the most general version of this model, for most of this section we will concentrate on the Reed–Frost model, in which each has a binomial distribution. Following [2], we let and consider the choice for some . We return to the more general model at the end of the section.
We will consider Poisson approximation for , where has a Bernoulli distribution with mean , independent of all else. The event represents a catastrophe in which the entire population is infected, which happens with (small) probability , independent of the dynamics of the epidemic model. By the triangle inequality, we then write
[TABLE]
where is the number of isolated vertices (i.e., the number of vertices that cannot be reached from any other vertex) in the random directed graph described above, and
[TABLE]
so that is a mixed Poisson distribution. The first term on the right-hand side of (14) is equal to
[TABLE]
by the argument leading to Corollary 2.5 of [2]. Similarly, the final term on the right-hand side of (14) is . We use our Poisson approximation results from above to bound the middle term on the right-hand side of (14).
Consider first the case where the are fixed constants. We note that may be written as a sum of negatively related Bernoulli random variables (see Theorem 1 of [2]). So, taking in our Theorem 2.5, we may use our Theorem 2.2(i) (with the choices and ) to get that in this case
[TABLE]
Taking expectations on the right-hand side of (15), in order to derive a bound in the case where the are random variables, we may follow the arguments leading to Corollary 2.5 of [2] to obtain
[TABLE]
Hence, from (14) we have:
Proposition 3.1**.**
[TABLE]
In particular, if , we obtain the order , the same order as in Corollary 2.5 of [2] for the usual Reed–Frost model. That is, if the probability of catastrophe is small enough, it does not affect the order of the Poisson approximation bound obtained for the ultimate number of susceptible vertices remaining.
Remark 3.2**.**
We can also use the arguments leading to Theorem 3 of [2] to give a bound in the more general Martin–Löf epidemic model discussed above. Under the assumptions of Theorem 3 of [2], and with defined therein, we obtain
[TABLE]
where .**
3.2 Extremes of associated random variables
We recall the following definition of association, introduced by [12].
Definition 3.3**.**
The random variables are associated if
[TABLE]
for all increasing functions and .
Association is a notion of positive dependence that we will return to in Section 4. In this section, our interest is in Poisson approximation for sums of associated random variables.
Suppose that are (for simplicity) identically distributed, associated random variables and define for some , so that counts the number of the that exceed the threshold , and write . Poisson approximation in this situation is considered in Section 8.3 of [3], using the observation that is a sum of positively related Bernoulli random variables to obtain the bound
[TABLE]
We consider now the effect of some (independent) ‘contamination’ of our sequence on this Poisson approximation result.
Suppose that are identically distributed, as before. Furthermore, let
- •
be associated, for some , and
- •
be independent of , with arbitrary dependence among these random variables.
Let , , and . Note that the expected number of random variables exceeding the threshold is the same as before. In light of Theorem 2.6, we apply Theorem 2.2(ii) with the choices , , and such that (as in the analysis in [3]) to obtain
Proposition 3.4**.**
[TABLE]
We now compare the bounds (17) and (18) in a concrete example.
Example 3.5**.**
Let have independent uniform distributions, and define , . These random variables are associated. In their Example 8.3.2, [3] shows that , if . We hence follow [3] and choose . Example 8.3.2 of [3] also shows that
[TABLE]
and so the bound (17), for the original model without contamination, becomes
[TABLE]
Now consider the ‘contaminated’ model, with as above for . We let be independent of , but each with the same marginal distribution as . Writing
[TABLE]
where we used the inequality (19), the bound (18) becomes
[TABLE]
*Consider the case where is fixed and . In this case, the upper bound of (20) is of order . If and , then the upper bound (21) for the model with contamination is of this same order.
4 Association and negative association
We now turn our attention to the application of Theorem 2.2 to derive Poisson approximation results for sums of associated or negatively associated random variables. We recall the definition (16) of association, and the following definition of negative association, introduced by [15].
Definition 4.1**.**
The random variables are said to be negatively associated if
[TABLE]
for all non-decreasing functions and , and all such that .
We also refer the reader to [4], [8] and references therein for further discussion of the association and negative association properties, their applications, and some approximation results for sums of associated or negatively associated random variables.
We consider firstly Poisson approximation results for , where are negatively associated, non-negative integer-valued random variables. For each , we choose and define . In the setting of compound Poisson approximation considered by [8], for example, these sets represent a ‘neighbourhood of dependence’ of , containing those indices such that is strongly dependent (in some sense) on . Here, however, we are free to make any choice of these sets . In the examples we consider below, we will choose for each , for simplicity, though our Poisson approximation results apply with an arbitrary choice of these sets.
Given these sets for , we define
[TABLE]
and . Letting be a random variable, independent of all else, with for , Lemma 3.1 of [8] shows that . So, since there exists a coupling such that almost surely, and noting that almost surely, we may apply Theorem 2.2(i) with the choice
[TABLE]
by the definition of the size-bias distribution. We emphasise again that this applies for any choice of the sets , . For simplicity, we state explicitly in Corollary 4.2 below the bound we obtain from Theorem 2.2(i) in this setting with the choices and for each , in which case for each .
Corollary 4.2**.**
Let , where are negatively associated, non-negative integer-valued random variables, with . Let . Then
[TABLE]
where .
Remark 4.3**.**
In the setting of Corollary 4.2, if the are Bernoulli random variables then we have . Since negatively associated Bernoulli random variables are known to be negatively related (see page 78 of [11], for example), we know that in this setting, and so the results of [10] may be applied. In this case, our Corollary 4.2 gives the same bound as [10].**
Given the above remark, we may think of as measuring (in a certain sense) how close the are to having Bernoulli distributions, with an increasing resulting in a smaller upper bound in Poisson approximation. We illustrate this with a simple example.
Example 4.4**.**
In the setting of Corollary 4.2, if are identically distributed with , and for some with , then we have , and obtain a good Poisson approximation bound when is small.**
We have so far discussed only the approximation of sums of negatively associated random variables. We now turn our attention to sums of associated random variables, where we use similar techniques to the above. We let be associated, non-negative integer-valued random variables, and define (for ) as above, again with any choice of the sets allowed for each . We also let be as above, again independent of all else, and write .
We further define the random variable , independent of all else, with for . Write . By Lemma 3.2 of [8], we have that , and so, analogously to the case of negative association, we may apply Theorem 2.2(ii) with this choice of and with . An analogue of Corollary 4.2 thus also applies in this setting.
As before, if the are Bernoulli random variables then we have . Since associated Bernoulli random variables are positively related (see, for example, page 77 of [11]), the results of [10] may also be applied here, and we obtain the same bound as [10] in this special case. Again, we are not limited to considering Bernoulli random variables, and may use the value of to measure how close the are to being Bernoulli.
4.1 Application to simple random sampling
Let be (not necessarily distinct) non-negative integers. Suppose we take a random sample of size without replacement from this collection of numbers, and let denote this sample. The random variables are negatively related (see Section 3.2 of [15]), and we will consider Poisson approximation of using Corollary 4.2. Straightforward calculations give and
[TABLE]
Noting that, by exchangeability, for each , we may take
[TABLE]
and apply Corollary 4.2 with these choices.
In the case where each is either 0 or 1, has a hypergeometric distribution, for which good Poisson approximation bound are well-known; see, for example, Theorem 6.A of [3] for an upper bound obtained using the Stein–Chen method. This bound is obtained by writing as a sum of negatively related Bernoulli random variables. Note that our result generalises this bound: in the case where each is either 0 or 1, we may take , and we recover the upper bound given by [3].
5 A Poincaré inequality
Next, we show how the assumptions of Section 2 may be employed to prove a Poincaré inequality, relaxing strict monotonicity assumptions.
Definition 5.1**.**
Define the discrete Poincaré constant for a (non-negative, integer-valued) random variable by
[TABLE]
where the supremum is taken over the set
[TABLE]
We note the well-known lower bound
[TABLE]
obtained by choosing .
Theorem 1.1 of [9] proves that if satisfies (3), then . In Theorem 5.3 below, we weaken this condition, assuming only the conditions of Theorem 2.2(i), to prove an analogous result. The bound can be expressed in terms of the failure rate of , defined below.
Definition 5.2**.**
For a discrete random variable , define the failure (or hazard) rate
[TABLE]
and write , where the infimum is taken over the support of .
Throughout this section, we let have a Bernoulli distribution with mean , independent of all else.
Theorem 5.3**.**
Let be a non-negative, integer-valued random variable with . Let be such that , then
[TABLE]
Proof.
Our argument is based on that of [9]. As there, we also employ the kernel function defined by Klaassen [16]:
[TABLE]
Then Lemma 5.2 of [9] gives, for ,
[TABLE]
Following the argument on page 517 of [9], using the stochastic ordering assumption we make here, the second term on the right-hand side of (23) may be bounded by .
For the first term on the right-hand side of (23), we have that
[TABLE]
where this final equality follows from Eq. (15) of [9]. Again employing our stochastic ordering assumption, we obtain
[TABLE]
We therefore have the bound
[TABLE]
The theorem then follows. ∎
Remark 5.4**.**
In the case we obtain the bound of Theorem 1.1 of [9], as we would expect. If there is a coupling of and such that (8) holds for all , then the proof of Theorem 2.2(i) shows that the assumptions of Theorem 5.3 above hold, and we have our upper bound on .**
Example 5.5**.**
We return to the setting of Example 2.4, and let . Then and
[TABLE]
Since for all , we may use the increasing failure rate (IFR) property of the Poisson distribution to obtain the following bound from Theorem 5.3:
[TABLE]
*Expanding (24) in , we obtain In this case, the lower bound (22) becomes , showing that (24) is close to sharp for close to 1 and small.
Applying Theorem 5.3 relies on both controlling the failure rate of and finding a suitable . We conclude this section by showing that we can bound , and hence , under the standard -log-concavity condition (see [5]).
Corollary 5.6**.**
Let be a non-negative, integer-valued random variable with . Assume that there exists such that
[TABLE]
for all . Then
[TABLE]
Proof.
We first show that, if , . With this, we will then see that the bound follows from Theorem 5.3. That is, we begin by showing that, as in (11), for all , i.e.,
[TABLE]
We observe that, summing the collapsing sum in (25) from to , we obtain:
[TABLE]
for all . Fixing some , we have by (27) that
[TABLE]
and the result (26) follows with .
Now, (25) implies that is log-concave, which in turn implies the IFR property, so that . Substituting this into Theorem 5.3 we obtain our upper bound on .
∎
Notice that if is Poisson with mean , we can take in (25), to recover the fact that and deduce the standard Poisson Poincaré inequality . Note also that in [14], the (stronger) fact that is proved under the -log-concavity condition (25), but this requires use of the discrete Bakry-Émery theory of [5].
6 Normal approximation
Finally, we show how our assumptions will carry over to normal approximation, again based on ideas used in Stein’s method. Several different coupling constructions are used in Stein’s method for normal approximation; see [6] for an introduction to these, and to the area more generally. In line with our work in Section 2, we will consider normal approximation using the size-biased coupling; again we will consider a non-negative random variable and write for its size-biased version. In this setting, a natural assumption under which normal approximation results for have been established is boundedness of ; see Theorems 5.6 and 5.7 of [6], for example.
We prove an analogous normal approximation theorem which allows the random variable to be contaminated with some independent noise, as we did in the Poisson approximation setting in Section 2.2. Our bound will be stated in terms of the Kolmogorov distance, defined by
[TABLE]
and for a non-negative random variable we will write
[TABLE]
Before stating the main theorem of this section, we note that for any non-negative random variable , , so that it is always possible to couple and such that almost surely. We also note that the existence of a size-biased coupling such that is equivalent to the assertion that ; see Section 7 of [1].
Theorem 6.1**.**
Let and be independent, non-negative random variables, with a.s. for some . Assume also that
- •
* and are coupled such that for some , and*
- •
* and are coupled such that .*
Let , and . Define and
[TABLE]
Then
[TABLE]
Proof.
Abusing notation, let and, for fixed , let be the solution to the Stein equation , where is the distribution function of the standard normal distribution. Note the standard bound (see Lemma 2.3 of [6], for example).
Now, following the proof of Theorem 5.7 of [6], we write
[TABLE]
The absolute value of the final term on the right-hand side of (28) is bounded, as in Theorem 5.7 of [6], by
[TABLE]
Note that we can write , where is a Bernoulli variable with mean independent of all else; see Corollary 2.1 of [6]. Conditioning on , we then have
[TABLE]
by the assumptions of our theorem.
We use our representation of , and the independence of and , to write the first term on the right-hand side of (28) as
[TABLE]
Conditioning on and , and applying the bound , this may be bounded by . ∎
6.1 Application: the lightbulb process
Consider the following model, motivated by a pharmaceutical study of dermal patches designed to activate particular receptors, though often phrased in terms of lightbulbs being switched on and off; see [13] and references therein. We begin with lightbulbs, all switched off. At time (for ), exactly of the lightbulbs are chosen, uniformly at random, and their state switched. One random variable of interest is , the number of lightbulbs switched on after time .
Goldstein and Zhang [13, Theorem 1.1] prove a bound for normal approximation of . Combining their bound with our Theorem 6.1, we see the effect of ‘contaminating’ by , which (for simplicity) we define to have a distribution, independent of . Other types of contamination can also be investigated in this framework.
Also for simplicity, we will restrict attention to the case where is even. In this case we have , and we let denote the variance of , which is equal to ; see [13] for further details, and for a discussion of the differences between the cases where is even and is odd. Goldstein and Zhang [13] also provide a coupling such that a.s., and show that
[TABLE]
Straightforward calculations (in the spirit of Section 5.3 of [6]) show that . Hence, letting , the bound of Theorem 6.1 becomes
Proposition 6.2**.**
[TABLE]
where .
So, for example, if and this bound is of the same order, , as the bound on given by Theorem 1.1 of [13], where .
Appendix A Proof of Lemma 2.1
Let . Substituting the Stein–Chen equation (5), we have
[TABLE]
where we deal with the first term in (A) since for any function with , summation by parts gives
[TABLE]
The result follows taking , since we know by assumption.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Arratia and P. Baxendale (2015). Bounded size bias coupling: a Gamma function bound, and universal Dickman-function behavior. Probab. Theory Related Fields 162 , 411-429.
- 2[2] F. Ball and A. D. Barbour (1990). Poisson approximation for some epidemic models. J. Appl. Probab. 27 , 479–490.
- 3[3] A. D. Barbour, L. Holst and S. Janson (1992). Poisson Approximation . Oxford University Press, Oxford.
- 4[4] M. V. Boutsikas and M. V. Koutras (2000). A bound for the distribution of the sum of discrete associated or negatively associated random variables. Ann. Appl. Probab. 10 , 1137–1150.
- 5[5] P. Caputo, P. Dai Pra and G. Posta (2009). Convex entropy decay via the Bochner-Bakry-Emery approach. Ann. Inst. Henri Poincaré Probab. Stat. 45 , 734–753.
- 6[6] L. H. Y. Chen, L. Goldstein and Q.-M. Shao (2011). Normal Approximation by Stein’s Method . Springer, Berlin.
- 7[7] N. Cook, L. Goldstein and T. Johnson (2017). Size biased couplings and the spectral gap for random regular graphs. Ann. Probab. , to appear.
- 8[8] F. Daly (2013). Compound Poisson approximation with association or negative association via Stein’s method. Electron. Commun. Probab. 18 (30), 1–12.
