On Negatively Dependent Sampling Schemes, Variance Reduction, and Probabilistic Upper Discrepancy Bounds
Michael Gnewuch, Marcin Wnuk, Nils Hebbinghaus

TL;DR
This paper explores negative dependence in sampling schemes to improve variance and discrepancy bounds, providing new explicit bounds and comparing different negative dependence notions.
Contribution
It introduces new pre-asymptotic bounds with explicit constants for discrepancy measures under negative dependence conditions.
Findings
Negative dependence can lead to improved variance bounds.
Explicit constants are derived for star discrepancy bounds.
Several negatively dependent sampling schemes are exemplified.
Abstract
We study some notions of negative dependence of a sampling scheme that can be used to derive variance bounds for the corresponding estimator or discrepancy bounds for the underlying random point set that are at least as good as the corresponding bounds for plain Monte Carlo sampling. We provide new pre-asymptotic bounds with explicit constants for the star discrepancy and the weighted star discrepancy of sampling schemes that satisfy suitable negative dependence properties. Furthermore, we compare the different notions of negative dependence and give several examples of negatively dependent sampling schemes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On Negatively Dependent Sampling Schemes, Variance Reduction, and Probabilistic Upper Discrepancy Bounds
Marcin Wnuk Mathematisches Seminar, Universität Osnabrück, Germany ([email protected]).
Michael Gnewuch Institut für Mathematik, Universität Osnabrück, Germany ([email protected]).
Nils Hebbinghaus Institut für Informatik, Christian-Albrechts-Universität zu Kiel, Germany ([email protected]).
Abstract
We study some notions of negative dependence of a sampling scheme that can be used to derive variance bounds for the corresponding estimator or discrepancy bounds for the underlying random point set that are at least as good as the corresponding bounds for plain Monte Carlo sampling.
We provide new pre-asymptotic bounds with explicit constants for the star discrepancy and the weighted star discrepancy of sampling schemes that satisfy suitable negative dependence properties. Furthermore, we compare the different notions of negative dependence and give several examples of negatively dependent sampling schemes, including mixed sequences.
1 Introduction
Plain Monte Carlo (MC) sampling is a method frequently used in stochastic simulation and multivariate numerical integration. Let be independent random points, each uniformly distributed in the -dimensional unit cube . For an arbitrary integrable random variable (or function) we consider the MC estimator (or quadrature)
[TABLE]
for the expected value (or integral)
[TABLE]
An advantage of the MC estimator is that already under the very mild assumption on to be square integrable, it converges to for with convergence rate . Even though the convergence rate is not very impressive, it has the invaluable advantage that it does not depend on the number of variables .
However, there are many dependent sampling schemes (i.e., random sample points , , that are still uniformly distributed in , but not necessarily independent any more) known that are superior to plain MC sampling with respect to certain objectives. An example are suitably randomized quasi-Monte Carlo (RQMC) point sets. They ensure, for instance, higher convergence rates for numerical integration of sufficiently smooth functions, they lead to much smaller asymptotic discrepancy measures, their sample points do not tend to cluster and have more evenly distributed lower dimensional projections (see, e.g., [5, 6, 19, 22]). It would be desirable to be able to construct dependent sampling schemes that have some of these or other favorable properties, and that are, with respect to other objectives, at least as good as MC sampling schemes.
Recently, in this direction some research had been done. In [20] Christiane Lemieux showed that a negative dependence property of RQMC points ensures that the variance of the corresponding RQMC estimator for functions that are monotone with respect to each variable is never larger than the variance of the corresponding MC estimator . She also proved that a different negative dependence property yields that the variance of the RQMC estimator for an arbitrary bounded quasi-monotone is never larger than the variance of . Those negative dependence properties rely solely on the marginals and the bivariate copulas of the RQMC points (i.e., on the distribution of single points and on the common distribution of pairs of points). Related results can be found in [33].
In a different line of research the second and the third author of this book chapter showed in [12, 14] that a specific negative dependence property of RQMC points guarantees that they satisfy the same pre-asymptotic probabilistic discrepancy bounds (with explicitly revealed dependence on the number of points as well as on the dimension ) as MC points. Here the negative dependence property relies on the common distribution of all sample points. Related results can be found in [7].
For more extensive motivations of both lines of research we refer to the elaborate introductions of [20] and [7, 12], respectively. The aim of this book chapter is to survey and compare the approaches mentioned above and to provide several new results.
This chapter is organized as follows: In Section 2 we introduce some notions of negatively dependent sampling schemes and discuss how one can benefit from them. In Section 3 we provide new probabilistic upper discrepancy bounds for sampling schemes. The discrepancy measures we consider are the star discrepancy and the weighted star discrepancy. These bounds are “plug-in results” in the following sense: One just has to check whether a sampling scheme satisfies the sufficient negative dependence condition and – if this is the case – obtains immediately a probabilistic discrepancy bound with explicitly given constants. In the Section 4, we give several examples of sampling schemes that satisfy the one or the other notion of negative dependence, including a generalized notion of stratified sampling schemes and mixed randomized sequences. Finally, in the Section 5 we elaborate on relations between different notions of negative dependence.
We finish the introduction by stating some notation. Let If not stated otherwise we are always considering a randomized point set consisting of points. For we write if All other inequalities are also to be understood componentwise. Moreover, . Via we denote the set of boxes (“corners”) anchored at [math]
[TABLE]
and by the set of boxes anchored at
[TABLE]
We write for the set of differences of boxes anchored at [math],
[TABLE]
For we denote the set by , stands for the d-dimensional Lebesgue measure on in case we just write If not specified, all random variables are defined on a probability space .
2 Review of Notions of Negative Dependance of Sampling Schemes
2.1 -Negative Dependence of Binary Random Variables and Sampling Schemes
The concept of negative dependence was introduced by Lehmann [18] for pairs of random variables. In the literature one finds several contributions on rather demanding notions of negative dependence as, e.g., negative association introduced in [16]; a survey can be found in [30]. Sufficient for our purpose is the following notion for Bernoulli or binary random variables, i.e., random variables that only take values in .
Definition 2.1**.**
Let . We call binary random variables upper -negatively dependent if
[TABLE]
and lower -negatively dependent if
[TABLE]
We call -negatively dependent if both conditions (2) and (3) are satisfied. If , we usually suppress the explicit reference to .
-Negative dependence is usually called negative orthant dependence, cf. [4].
Notice that, in particular, independent binary random variables are negatively dependent. Furthermore, it is easily seen that for and the notions of upper and lower -negative dependence are equivalent, cf. [18].
We are interested in binary random variables , , of the form , where is a Lebesgue-measurable subset of (whose characteristic function is denoted by ), and are randomly chosen points in .
We will use the following bound of Hoeffding-type; for a proof see, e.g., [14].
Theorem 2.2**.**
Let , and let be -negatively dependent binary random variables. Put . We have
[TABLE]
Definition 2.3**.**
A randomized point set is called a sampling scheme if every single is distributed uniformly in and the vector is exchangeable, meaning that for any permutation of it holds that the law of is the same as the law of
The assumption of exchangeability is only of technical nature and, if we consider as a randomized point set, may be always obtained in the process of symmetrization. Indeed, let be a point set such that every is uniformly distributed in and let be a random uniformly chosen permutation of Then is already a sampling scheme.
2.2 Pairwise Negative Dependence and Variance Reduction
Definition 2.4**.**
We say that a sampling scheme is pairwise negatively dependent if for every it holds that the random variables
[TABLE]
are negatively dependent. In other words, a sampling scheme is pairwise negatively dependent if for every we have
[TABLE]
[TABLE]
Note that (5) implies (6) and vice versa, therefore one of the conditions is in fact redundant. In [20] this is known as negatively upper orthant dependent - or NUOD - sampling schemes.
Our interest lies in numerical integration of functions from some class using RQMC. A QMC quadrature is just a quadrature consisting of nodes, such that the evaluation in every node is given the same weight By randomizing the set of nodes we obtain an RQMC quadrature. Let be an RQMC estimator of based on the sampling scheme i.e.
[TABLE]
Moreover, let be an estimator of based on a Monte Carlo sample consisting of points (i.e. the integration nodes are chosen independently and uniformly from , see (1)).
It turns out that randomized QMC quadratures based on pairwise negatively dependent sampling schemes may lead to variance reduction in comparison to the simple MC quadratures. Here we describe shortly one of such cases, namely when integrands are bounded quasimonotone functions. The following exposition is based on [20].
To define what a quasimonotone function is we need first to introduce the notion of quasivolume. For and a function we write to represent the evaluation of at the point where for and otherwise. The quasivolume of over an interval is given by
[TABLE]
We say that the function is quasimonotone if
[TABLE]
for every interval Note that if we define a content then quasimonotonicity of means exactly that for any axis-parallel rectangle it holds
Apart from pairwise negative dependence there are a few similar notions which are also of interest. Let If for every and every measurable
[TABLE]
we say that the sampling scheme is conditionally negatively quadrant dependent (conditionally NQD). Here denotes the orthogonal projection of onto its first coordinates. Note that the conditional NQD property holds in particular if are independent and for every and every we have
[TABLE]
in which case we talk of a coordinatewise independent NQD sampling scheme. Christiane Lemieux showed in [20, Corollary 2] that conditionally NQD sampling schemes provide RQMC estimators of integrals with variance no bigger then the variance of the MC estimator if the integrand is monotone in each coordinate.
The following is basically a combination of Proposition Remark and Corollary from [20].
Theorem 2.5**.**
Let and be a sampling scheme. Then if either
The function is bounded and or is quasimonotone and is pairwise negatively dependent, 2. 2.
The function is monotone in each coordinate and is conditionally negatively quadrant dependent,
it holds
[TABLE]
In Section 5 we discuss relations between the introduced notions of negative dependence.
Let us note that the aforementioned paper provides actually more general results. Interested reader will find details in Sections 3 and 4 of [20].
For examples of pairwise negatively dependent and conditionally NQD sampling schemes see Sections 4.2 and 4.3.
2.3 Negatively Dependent Sampling Schemes and Discrepancy
Definition 2.6**.**
We say that a sampling scheme is negatively dependent if for every the random variables
[TABLE]
are -negatively dependent. In other words for every we require
[TABLE]
[TABLE]
Note that differently from the case of pairwise negative dependence, for one indeed needs to check both inequalities as they do not, in general, imply one another. If and we usually talk just of negatively dependent sampling schemes. Moreover, if (7) is satisfied we speak of upper negatively dependent sampling schemes and if (8) is satisfied we speak of lower negatively dependent sampling schemes.
To motivate the interest in negatively dependent sampling schemes we introduce the notion of discrepancy. Discrepancy is meant to quantify how far is a finite point set consisting of points from being equidistributed in It plays an important role in fields like numerical integration, computer graphics, empirical process theory and many more. Let and We define the discrepancy function for the point set at the point via
[TABLE]
and the star discrepancy by
[TABLE]
Making a connection to numerical integration we note one of the versions of the Koksma-Hlawka inequality, which states that for every point set consisting of points it holds
[TABLE]
where is the Hardy-Krause variation of . The inequality is actually sharp, cf. [24].
It has been shown in [12] that - - negatively dependent sampling schemes have with large probability star discrepancy of the order More precisely the following theorem holds.
Theorem 2.7**.**
*Let and . Let be a negatively --dependent sampling scheme.
Then for every *
[TABLE]
holds with probability at least Moreover, for every
[TABLE]
Notice that these bounds depend only mildly on or In particular, --negatively dependent sampling schemes satisfy the same preasymptotic discrepancy bound as Monte Carlo point sets do. For more details see [12].
In Remark 4.12 we present a bound similar to (10) under a bit different assumptions that can be applied to so-called mixed randomized sequences.
3 New Probabilistic Discrepancy Bounds
3.1 Bound on the Star Discrepancy for Negatively Dependent Sampling Schemes
Proving that a given sampling scheme is --negatively dependent may turn out to be a difficult task. One of the problems lies in the fact that elements of may in general not be represented as Cartesian products of one-dimensional intervals, cf. also Remark 4.12. With this in mind we would like to weaken the assumptions on the sampling scheme In the following result we show that by requiring the sampling scheme only to be --negatively dependent one already gets with high probability a discrepancy of the order
Theorem 3.1**.**
Let and . Let be a --negatively dependent sampling scheme in Then for every
[TABLE]
holds with probability at least where Moreover, for every
[TABLE]
where
The proof of Theorem 3.1 requires some preparation. To “discretize” the star discrepancy, we define –covers as in [8]: for any a finite set of points in is called a –cover of , if for every there exist such that and . The number denotes the smallest cardinality of a –cover of .
The following theorem was stated and proved in [10].
Theorem 3.2**.**
For any and we have
[TABLE]
Notice that due to Stirling’s formula we have and so the cardinality of the cover may be bounded from above by Furthermore, it is easy to verify that in the case the identity
[TABLE]
is established with the help of the -cover .
With the help of -covers the star discrepancy can be approximated in the following sense.
Lemma 3.3**.**
Let be an -point set, , and be a -cover of . Then
[TABLE]
The proof of Lemma 3.3 is straightforward, cf., e.g., [8, Lemma 3.1].
Now we are ready to prove Theorem 3.1.
Proof.
For to be chosen later let be a cover consisting of at most elements. Such a exists due to Theorem 3.2 and discussion thereafter.
Define
[TABLE]
Now Lemma 3.3 gives us
[TABLE]
For every and put
[TABLE]
Let Due to Hoeffding’s inequality applied to random variables (applicable since is - negatively dependent) we obtain for every
[TABLE]
With the help of a simple union bound we get
[TABLE]
Using the above we would like to find a bound on discrepancy of the sampling scheme which holds with probability at least We are looking for such that
[TABLE]
Put Inequality (15) holds true if
[TABLE]
Our problem boils now down to finding possibly small for which
[TABLE]
Specifying to be of the form
[TABLE]
we get that (16) is satisfied if
[TABLE]
Expanding in dependence of it suffices to find for which
[TABLE]
and one easily sees that this is satisfied for given in the statement of the theorem. To prove 11 one only needs to plug in into (14) and then consider the two cases and separately. ∎
3.2 Bound on the Weighted Star Discrepancy for -negatively Dependent Sampling Schemes.
One of the reasons why the QMC integration may be successfully applied in many high-dimensional problems is the fact that quite often only a small number of coordinates is really important. This observation led to the introduction of weighted function spaces and weighted discrepancies by Sloan and Woźniakowski in [31]. The above concepts are closely related to the theory of weighted spaces of Sobolev type, in particular the integration error in those spaces obeys a Koksma-Hlawka type upper bound, which may be phrased using the norm of the function and the weighted star discrepancy.
By weights we understand a set of non-negative numbers where is interpreted as the weight of the coordinates from Let denote the cardinality of For we write to denote the point in agreeing with on the coordinates from and having all the other coordinates set to
The weighted star discrepancy of a point set and weights is defined by
[TABLE]
The following theorem is similar in flavor to the Theorem from [2].
Theorem 3.4**.**
Let and let be a sampling scheme, such that for every its projection on the coordinates in is - - negatively dependent. Then for any weights and any it holds
[TABLE]
with probability at least Moreover, for it holds
[TABLE]
Proof.
We shall only prove the statement (17), the statement (18) follows then by simple calculations. For and put
[TABLE]
Here denotes the projection of on the coordinates from By Theorem 2.7 it holds
[TABLE]
Now
[TABLE]
∎
4 Examples of Negatively Dependent and Pairwise Negatively Dependent Sampling Schemes
Many sampling schemes, such as randomly shifted and jittered rank-1 lattices (cf. Section 4.2) and Latin hypercube sampling (cf. Section 4.3), are multidimensional generalizations of the one-dimensional simple stratified sampling. Simple startified sampling is defined in the following way: let be a uniformly chosen permutation of and let be independent random variables distributed uniformly on Moreover, is independent of We put
[TABLE]
Effectively, one is considering the partition of the unit interval and in every element of the partition putting one point, independently of all the other points. The simple lemma is a useful tool for our investigations and may be found e.g. in [33].
Lemma 4.1**.**
Simple stratified sampling is pairwise negatively dependent.
4.1 Negative Dependence of Generalized Stratified Sampling
We partition into sets with Let be a random vector distributed uniformly on
[TABLE]
Given the value of we place one point for each with uniformly and independently of all other points inside Symmetrizing this construction yields a sampling scheme which we call generalized stratified sampling (note that every single is uniformly distributed in . Here “generalized” has to be understood in the sense that there are possibly more strata then points.
Example 4.2**.**
There are many natural choices for the strata. The simplest one would be stripes of the form with However, one could also choose, e.g., elementary cells (i.e., fundamental parallelepipeds) of a rank-1 lattice (cf. [17]), see Figure 1.
To show that generalized stratified sampling is negatively dependent we need first a simple lemma.
Lemma 4.3**.**
Let and let
[TABLE]
The function
[TABLE]
takes on its maximum in the point
Proof.
We shall prove the statement by induction on The case is straightforward by Lagrange multipliers theorem. Suppose we have already shown the statement for and we would like to prove it for Firstly let us fix the value of It holds
[TABLE]
By the induction assumption for a fixed value of the last term is maximal when for we have where we put Plugging it into the above formula we obtain
[TABLE]
which we need to maximize with respect to It holds
[TABLE]
where and Now we have
[TABLE]
The derivative vanishes for at and Since and is a local maximum the claim follows. ∎
Theorem 4.4**.**
Let be a generalized stratified sampling as described above and be measurable. Then for every it holds
[TABLE]
In particular, generalized stratified sampling is - negatively dependent for any system of measurable subsets of .
Proof.
Fix as in the statement of the theorem and define
[TABLE]
Note that For we have
[TABLE]
By Lemma 4.3 it follows
[TABLE]
∎
Remark 4.5**.**
Without further information on the strata we cannot make any conclusions about pairwise negative dependence of generalized stratified sampling. As an example consider a stratified sampling scheme defined by two strata in One may choose and in such a way that and see Figure 2. In this case however
[TABLE]
and the sampling scheme is not pairwise negatively dependent.
On the other hand if we consider strata with then this practically boils down to the one-dimensional case and so the corresponding sampling scheme is pairwise negatively dependent, cf. Lemma 4.1.
4.2 Pairwise Negative Dependence and Conditional NQD Property of Randomly Shifted and Jittered Rank-1 Lattices
The exposition follows closely [33]. Let be prime. By we denote . Moreover, We also put and similarly
A discrete subgroup of the dimensional torus is called a lattice. A set is a rank-1 lattice if for some it admits a representation
[TABLE]
In this case is called a generating vector of the lattice.
Note that our definition differs from the usual one in that we allow only for generating vectors from and not from which saves us from considering some degenerate cases.
We want now to define a sampling scheme based on rank-1 lattices which we call randomly shifted and jittered rank-1 lattice. To this end let be a rank-1 lattice with generating vector chosen randomly uniformly from Let be distributed uniformly on be uniformly distributed on and be a uniformly chosen permutation of Moreover, let all of the aforementioned random variables be independent. We put
[TABLE]
We call the sampling scheme a randomly shifted and jittered rank-1 lattice (RSJ rank-1 lattice). Putting it in words: we first take a rank-1 lattice with a random generator and symmetrize it. Then we shift the lattice uniformly on the torus, where the shift has resolution . In the last step we jitter every point independently of all the other points in a cube of volume .
The following is Theorem 3.4. from [33].
Theorem 4.6**.**
Let be prime, . RSJ rank-1 lattice in is a coordinatewise independent NQD sampling scheme.
In particular, RSJ rank-1 lattice is a pairwise negatively dependent and a conditionally NQD sampling scheme, which means that both alternative conditions for from Theorem 2.5 hold.
In contrast to generalized stratified sampling (cf. Theorem 4.4) and Latin hypercube sampling (see Theorem 4.7), RSJ rank-1 lattice is for and in general not - negatively dependendent, see Subsection 5.1.
4.3 Negative Dependence, Conditional NQD Property, and Pairwise Negative Dependence of Latin Hypercube Sampling
Let be independent uniformly chosen permutations of and be independent random variables distributed uniformly on and independent also of the permutations. A sampling scheme is called a Latin hypercube sampling if the th coordinate of the th point is given by
[TABLE]
What one intuitively does is the following: one cuts into slices given by
[TABLE]
and puts points in such a way that in every slice there is exactly one point.
It is worth mentioning that for Latin hypercube sampling is exactly the same as RSJ rank-1 lattice (namely simple stratified sampling). For the joint distribution of a pair of points is the same for Latin hypercube sampling as for RSJ rank-1 lattice. But if we sample more than two points, then the joint distributions already differ, see [33].
Negative dependence of Latin hypercube Sample has been studied in [12] and pairwise negative dependence has been investigated in [33].
Theorem 4.7**.**
Latin hypercube Sample in is a sampling scheme which is
- (i)
* - - negatively dependent,*
- (ii)
* - negatively dependent,*
- (iii)
coordinatewise independent NQD.
In the above, statements and follow from Theorem from [12], and statement is Theorem from [33].
In particular from it follows that LHS is pairwise negatively dependent as well as conditionally NQD.
4.4 Pairwise Negative Dependence of Scrambled (0,m,s)-Nets.
The so called -nets belong to the most regular deterministic point sets. First defined by Niederreiter in [23], they have been subject of extensive research. For a nice introduction on -nets and their randomization, see [22].
Let us fix a base For and an interval of the form
[TABLE]
is called an elementary interval (in base b). Moreover, for and vectors and (where for every we require ) we define an -dimensional elementary interval via
[TABLE]
A -net is any such that for any elementary interval with there are exactly point in It is easily seen that a -net consists of exactly points. Specific constructions of -nets are known.
Scrambling of depth is a bijective function such that for any elementary interval with the image is again an elementary interval of volume
Now let us focus on the case Taking a -net and applying to it a random scrambling of depth one obtains a randomized point set. Scramblings are defined in such a way that for any scrambling of depth and a -net , the point set is again a -net. By an appropriate choice of randomized scrambling one may make to be a sampling scheme. In this case we call a scrambled -net. Scrambling as a way of randomization of -nets has been studied by A.B. Owen, e.g., in [29].
In a recent article [21] C. Lemieux and J. Wiart have shown the following theorem (which follows from Corollary from the aforementioned article).
Theorem 4.8**.**
Scrambled -nets are pairwise negatively dependent sampling schemes.
4.5 Mixed Randomized Sequences
As already mentioned, part of the success of RQMC stems from the fact that in many high-dimensional practical integration problems only a small number of coordinates is of real importance. It stands to reason that one tries to use it to his avail by constructing quadratures in which one uses RQMC on the “important” coordinates and simple (usually much cheaper) Monte Carlo for the rest of the coordinates. This method is sometimes referred to as padding and the resulting sequences of integration nodes are called mixed sequences. Let us give a formal definition.
Definition 4.9**.**
Let with . Let be a sequence in , and let be a sequence in . The -dimensional concatenated sequence is called a mixed sequence. If is a sequence of independent uniformly distributed random points, one also says that results from by padding by Monte Carlo and calls a hybrid-Monte Carlo sequence. If and are both randomized sequences, we call a mixed randomized sequence.
Padding by Monte Carlo was introduced by Spanier in [32] to tackle problems in particle transport theory. He suggested to use a hybrid-Monte Carlo sequence resulting from padding a deterministic low-discrepancy sequence. Hybrid-Monte Carlo sequences showed a favorable performance in several numerical experiments, see, e.g., [26, 27]. The latter papers also provided theoretical results on probabilistic discrepancy estimates of hybrid-Monte Carlo sequences which have been improved in [3, 11]. Favorable discrepancy bounds for padding Latin hypercube sampling (LHS) by Monte Carlo were provided in [12]. Padding a sequence by LHS (instead of by Monte Carlo) was considered earlier by Owen [28, Example 5].
A related line of research, initiated in [25], is to study the discrepancy of concatenated sequences that result from two deterministic sequences. More recent results can, e.g., be found in [13, 9, 15] and the literature mentioned therein.
The following proposition shows that concatenating two mutually independent negatively dependent sampling schemes results again in a (higher dimensional) negatively dependent sampling scheme. A weaker version of the next proposition may be found in [14]; cf. Lemma 5 there.
Proposition 4.10**.**
Let such that . Let , be Borel measurable sets. Let be a sampling scheme in and a sampling scheme in . Furthermore, let .
- (i)
If the random variables , , and , , are upper negatively - and - dependent, respectively, and mutually independent, then the random variables , , induced by the random vectors in , are upper negatively -dependent.
- (ii)
If the random variables , , and , , are lower negatively - and -dependent, respectively, and mutually independent, then the random variables , , induced by the random vectors , … ,* in , are lower negatively -dependent.*
Proof.
Let us first prove statement (i). Obviously we have for
[TABLE]
We now prove statement (ii). Take any and set . Suppose first that is a hybrid-Monte Carlo sequence, i.e. is a Monte Carlo sampling scheme. Due to our assumptions in statement (ii) we obtain
[TABLE]
Now let be any sampling scheme in such that the random variables are lower -negatively dependent and let be a Monte Carlo sampling scheme in ; we assume both sampling schemes to be mutually independent to . Analogously as in the previous case we obtain
[TABLE]
It follows from the case of hybrid-Monte Carlo sequences that
[TABLE]
∎
Remark 4.11**.**
It follows easily on closer examination of the proof that for the statement (i) of Proposition 4.10 to hold true we need only and to be negatively - respectively -upper dependent point sets, not necessarily sampling schemes. Moreover, if in (ii) we assume that is a Monte Carlo sampling scheme we also do not need to assume that is a sampling scheme.
Remark 4.12**.**
Let , be systems of measurable sets in and , respectively. Let be an --negative dependent sampling scheme in and an --negative dependent sampling scheme in ; both sampling schemes should be mutually independent. Furthermore, let be the resulting concatenated sampling scheme in , i.e., , .
- (i)
If and , we obtain from Proposition 4.10 that the mixed randomized sequence is --negatively dependent, which implies that we may directly apply Theorem 3.1 to obtain a probabilistic discrepancy bound for .
- (ii)
If and , we obtain from Proposition 4.10 that is -negatively dependent with respect to the set system
[TABLE]
Hence Theorem 2.7 is unfortunately not directly applicable to . Nevertheless, one may prove a counterpart of Theorem 2.7 with slightly worse constants that relies on negative dependence with respect to . Namely, one may show for every that
[TABLE]
The bound is based on the following simple observation: To estimate the local discrepancy of in a test box , the strategy used in [12] (and earlier in [2]) is to decompose into finitely many disjoint differences of boxes such that . This gives
[TABLE]
Now let us consider a fixed index . Then we find such that and . Furthermore, we may write and with and . Then we may represent as disjoint union
[TABLE]
Thus
[TABLE]
where . Now large deviation inequalities of Bernstein- and Hoeffding-type can be used to obtain for each of the random variables , the same upper bound as for the local discrepancy of a --negative dependent sampling schemes in the proof of [12, Theorem 4.3]. This, combined with (20) und (21), results in a probabilistic discrepancy bound for that is as most as twice as big as the one from Theorem 2.7; for further details see [12, Proof of Theorem 4.3].
5 Relations Between Notions of Negative Dependence
It may be easily seen that the coordinatewise independent NQD property implies the pairwise negative dependence property as well as the conditional NQD property. It turns out that this is the only valid implication between the considered notions of negative dependence. In this section we give examples showing that other implications do not hold.
5.1 Pairwise Negative Dependence and Negative Dependence
Neither the pairwise negative dependence of a sampling scheme implies the negative dependence, nor the other way round.
Example 5.1**.**
We first show an example of a negatively dependent sampling scheme which is not pairwise negatively dependent. To this end consider a sampling scheme consisting of just two points with joint CDF given by
[TABLE]
It is easy to see that is continuous, qusi-monotone, and , which implies that is a CDF of a sampling scheme. Moreover,
[TABLE]
so the sampling scheme is - negatively dependent. Notice that due to it is equivalent to saying that the sampling scheme is - negatively dependent. However, for instance
[TABLE]
Example 5.2**.**
To see that even the stronger coordinatewise independent NQD property does not imply the negative dependence property consider RSJ rank-1 lattice defined in Subsection 4.2. On the one hand, according to Theorem 4.6, RSJ rank-1 lattice is coordinatewise independent NQD. On the other hand, let us consider the situation for and a large to be chosen later. We put Obviously
[TABLE]
We also have
[TABLE]
the inequality follows since for the diagonal configuration of the points (i.e. for some permutation of ) there is one triple of points always lying in Notice that any generating vector of the form and any shift of the form results in a diagonal configuration. Now for large enough it holds
[TABLE]
5.2 Conditional NQD and Pairwise Negative Dependence
Example 5.3**.**
First we show an example of a pairwise negatively dependent sampling scheme which is not conditionally NQD. Let denote the slots. Now we are considering a sampling scheme such that given the slots the points are distributed uniformly within the slots and are independent. Denote and set
[TABLE]
[TABLE]
[TABLE]
It is easy to see that is not conditionally NQD, e.g.
[TABLE]
Showing that is pairwise negatively dependent requires simple but tedious calculations and as such will be omitted. Intuitively it is clear, since the sampling scheme gives high probability to diagonal arrangements (i.e. ).
Example 5.4**.**
Now we show an example of a sampling scheme which is conditionally NQD but not pairwise negatively dependentd. To this end let be two independent random variables distributed uniformly on We consider a sampling scheme given by Let and be measurable. Sampling scheme is conditionally NQD since
[TABLE]
On the other hand is not pairwise negatively dependent. To see this note that
[TABLE]
and
[TABLE]
Taking for some the point satisfying and yields the claim.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. Aistleitner , Covering numbers, dyadic chaining and discrepancy , J. Complexity, 27 (2011), pp. 531–540.
- 2[2] , Tractability results for the weighted star-discrepancy , J. Complexity, 30 (2014), pp. 381–391.
- 3[3] C. Aistleitner and M. T. Hofer , Probabilistic error bounds for the discrepancy of mixed sequences , Monte Carlo Methods Appl., 18 (2012), pp. 181–200.
- 4[4] H. W. Block, T. H. Savits, and M. Shaked. , Some concepts of negative dependence , Ann. Probab., 10 (1982), pp. 765–772.
- 5[5] J. Dick, F. Y. Kuo, and I. H. Sloan , High dimensional integration – the quasi-Monte Carlo way , Acta Numerica, 22 (2013), pp. 133–288.
- 6[6] J. Dick and F. Pillichshammer , Digital nets and sequences , Cambridge University Press, Cambridge, 2010.
- 7[7] B. Doerr, C. Doerr, and M. Gnewuch , Probabilistic lower discrepancy bounds for Latin hypercube samples , in Contemporary Computational Mathematics – a Celebration of the 80th Birthday of Ian Sloan, J. Dick, F. Y. Kuo, and H. Woźniakowski, eds., Springer-Verlag, 2018, pp. 339–350.
- 8[8] B. Doerr, M. Gnewuch, and A. Srivastav , Bounds and constructions for the star discrepancy via δ 𝛿 \delta -covers , J. Complexity, 21 (2005), pp. 691–709.
