Bootstrap inference for the finite population total under complex sampling designs
Zhonglei Wang, Jae Kwang Kim, Liuhua Peng

TL;DR
This paper introduces a unified bootstrap method for complex survey designs, improving finite population total inference accuracy over traditional methods, especially with small samples.
Contribution
It develops a bootstrap approach applicable to complex sampling schemes like Poisson and probability-proportional-to-size sampling, using studentization and multinomial bootstrapping.
Findings
The proposed bootstrap method is second-order accurate.
It outperforms Wald-type methods in coverage rate with limited samples.
Simulation results confirm improved inference accuracy.
Abstract
Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random sampling. In this paper, we propose a unified bootstrap method applicable to some complex sampling designs, including Poisson sampling and probability-proportional-to-size sampling. Two main features of the proposed bootstrap method are that studentization is used to make inference, and the finite population is bootstrapped based on a multinomial distribution by incorporating the sampling information. We show that the proposed bootstrap method is second-order accurate using the Edgeworth expansion. Two simulation studies are conducted to compare the proposed bootstrap method with the Wald-type method, which is widely used in survey sampling. Results…
| Design | Method | |||||
|---|---|---|---|---|---|---|
| C.R. | C.L. | C.R. | C.L. | |||
| Poisson | Bootstrap | 0.90 | 15.5 | 0.90 | 3.6 | |
| Wald-type | 0.84 | 12.1 | 0.88 | 3.6 | ||
| SRS | Bootstrap | 0.90 | 13.0 | 0.89 | 2.8 | |
| Wald-type | 0.83 | 9.1 | 0.89 | 2.8 | ||
| PPS | Bootstrap | 0.88 | 10.3 | 0.90 | 2.6 | |
| Wald-type | 0.83 | 7.5 | 0.89 | 2.6 | ||
| Design | ||||||||
|---|---|---|---|---|---|---|---|---|
| Bootz | Bootz | |||||||
| Poisson | -0.5 | 0.37 | 0.31 | 0.36 | 0.32 | 0.31 | 0.32 | |
| -0.25 | 0.45 | 0.40 | 0.44 | 0.41 | 0.40 | 0.41 | ||
| -0.1 | 0.50 | 0.46 | 0.49 | 0.47 | 0.46 | 0.47 | ||
| 0 | 0.54 | 0.50 | 0.53 | 0.51 | 0.50 | 0.51 | ||
| 0.1 | 0.58 | 0.54 | 0.57 | 0.55 | 0.54 | 0.55 | ||
| 0.25 | 0.64 | 0.60 | 0.63 | 0.61 | 0.60 | 0.61 | ||
| 0.5 | 0.73 | 0.69 | 0.73 | 0.70 | 0.69 | 0.70 | ||
| SRS | -0.5 | 0.37 | 0.31 | 0.34 | 0.32 | 0.31 | 0.32 | |
| -0.25 | 0.45 | 0.40 | 0.42 | 0.41 | 0.40 | 0.41 | ||
| -0.1 | 0.50 | 0.46 | 0.48 | 0.47 | 0.46 | 0.47 | ||
| 0 | 0.54 | 0.50 | 0.52 | 0.51 | 0.50 | 0.51 | ||
| 0.1 | 0.58 | 0.54 | 0.56 | 0.55 | 0.54 | 0.55 | ||
| 0.25 | 0.63 | 0.60 | 0.62 | 0.61 | 0.60 | 0.61 | ||
| 0.5 | 0.73 | 0.69 | 0.71 | 0.70 | 0.69 | 0.70 | ||
| PPS | -0.5 | 0.37 | 0.31 | 0.34 | 0.33 | 0.31 | 0.32 | |
| -0.25 | 0.45 | 0.40 | 0.42 | 0.42 | 0.40 | 0.41 | ||
| -0.1 | 0.50 | 0.46 | 0.48 | 0.47 | 0.46 | 0.47 | ||
| 0 | 0.54 | 0.50 | 0.52 | 0.51 | 0.50 | 0.51 | ||
| 0.1 | 0.57 | 0.54 | 0.56 | 0.55 | 0.54 | 0.55 | ||
| 0.25 | 0.63 | 0.60 | 0.62 | 0.61 | 0.60 | 0.61 | ||
| 0.5 | 0.72 | 0.69 | 0.71 | 0.70 | 0.69 | 0.70 | ||
| Design | Method | C.R. | C.L. | |
|---|---|---|---|---|
| Poisson | (5,10) | Bootstrap | 0.90 | 114.08 |
| Wald-type | 0.85 | 98.58 | ||
| (10,30) | Bootstrap | 0.90 | 73.92 | |
| Wald-type | 0.88 | 68.66 | ||
| PPS | (5,10) | Bootstrap | 0.89 | 17.56 |
| Wald-type | 0.85 | 14.57 | ||
| (10,30) | Bootstrap | 0.90 | 9.40 | |
| Wald-type | 0.86 | 8.24 |
| Design | Bootz | ||||
|---|---|---|---|---|---|
| Poisson | (5,10) | -0.5 | 0.35 | 0.31 | 0.35 |
| -0.25 | 0.43 | 0.40 | 0.43 | ||
| -0.1 | 0.48 | 0.46 | 0.49 | ||
| 0 | 0.53 | 0.50 | 0.52 | ||
| 0.1 | 0.58 | 0.54 | 0.56 | ||
| 0.25 | 0.62 | 0.60 | 0.62 | ||
| 0.5 | 0.73 | 0.69 | 0.72 | ||
| (10,30) | -0.5 | 0.34 | 0.31 | 0.33 | |
| -0.25 | 0.43 | 0.40 | 0.42 | ||
| -0.1 | 0.48 | 0.46 | 0.48 | ||
| 0 | 0.52 | 0.50 | 0.52 | ||
| 0.1 | 0.56 | 0.54 | 0.56 | ||
| 0.25 | 0.61 | 0.60 | 0.62 | ||
| 0.5 | 0.72 | 0.69 | 0.71 | ||
| PPS | (5,10) | -0.5 | 0.32 | 0.31 | 0.32 |
| -0.25 | 0.41 | 0.40 | 0.41 | ||
| -0.1 | 0.47 | 0.46 | 0.47 | ||
| 0 | 0.51 | 0.50 | 0.51 | ||
| 0.1 | 0.55 | 0.54 | 0.55 | ||
| 0.25 | 0.61 | 0.60 | 0.62 | ||
| 0.5 | 0.71 | 0.69 | 0.71 | ||
| (10,30) | -0.5 | 0.32 | 0.31 | 0.32 | |
| -0.25 | 0.41 | 0.40 | 0.41 | ||
| -0.1 | 0.46 | 0.46 | 0.47 | ||
| 0 | 0.50 | 0.50 | 0.50 | ||
| 0.1 | 0.54 | 0.54 | 0.54 | ||
| 0.25 | 0.60 | 0.60 | 0.60 | ||
| 0.5 | 0.69 | 0.69 | 0.69 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Sampling and Estimation Techniques · Bayesian Methods and Mixture Models · Advanced Statistical Methods and Models
Bootstrap inference for the finite population total under complex sampling designs
Zhonglei Wang Wang Yanan Institute for Studies in Economics and School of Economics, Xiamen University, Xiamen, Fujian 361005, P.R.C.
Jae Kwang Kim Department of Statistics, Iowa State University, Ames, IA 50011, U.S.A.; Email: [email protected]
Liuhua Peng School of Mathematics and Statistics, the University of Melbourne, Victoria 3010, Australia
Abstract
Bootstrap is a useful tool for making statistical inference, but it may provide erroneous results under complex survey sampling. Most studies about bootstrap-based inference are developed under simple random sampling and stratified random sampling. In this paper, we propose a unified bootstrap method applicable to some complex sampling designs, including Poisson sampling and probability-proportional-to-size sampling. Two main features of the proposed bootstrap method are that studentization is used to make inference, and the finite population is bootstrapped based on a multinomial distribution by incorporating the sampling information. We show that the proposed bootstrap method is second-order accurate using the Edgeworth expansion. Two simulation studies are conducted to compare the proposed bootstrap method with the Wald-type method, which is widely used in survey sampling. Results show that the proposed bootstrap method is better in terms of coverage rate especially when sample size is limited.
Keywords: Confidence interval, Edgeworth expansion, Multinomial distribution, Second-order accurate.
1 Introduction
Bootstrap, first proposed by Efron (1979), is a simulation-based approach for accessing uncertainty of estimates and for constructing confidence intervals. Bootstrap is widely used in that it is easy to implement and is second-order accurate under mild conditions (Hall, 1992, §3.3). However, classical bootstrap methods are not applicable under most sampling designs since the independent or identical distributed assumption may fail.
Under complex sampling, bootstrap methods have been proposed to handle variance estimation. In survey sampling, one of the most popular bootstrap approaches is the rescaling bootstrap method proposed by Rao and Wu (1988) under stratified random sampling, and they demonstrated that their bootstrap- intervals are second-order accurate if the variance component is known. Such a variance, however, is seldom known in practice. Rao et al. (1992) generalized the rescaling bootstrap method to cover the non-smooth statistics, but they did not discuss the second-order accuracy. Sitter (1992a) considered a mirror-match bootstrap method for sampling designs without replacement and discussed the second-order accuracy based on the known population variance as Rao and Wu (1988). Sitter (1992b) extended the without-replacement bootstrap method (Gross, 1980) to complex sampling designs and compared the proposed method with the rescaling bootstrap method (Rao and Wu, 1988) and the mirror-match bootstrap method (Sitter, 1992a). Shao and Sitter (1996) proposed a bootstrap method for the case when survey data are subject to missingness. Sverchkov and Pfeffermann (2004) proposed to use a multinomial distribution to reconstruct the finite population to estimate the mean square error. Beaumont and Patak (2012) proposed a generalized bootstrap method for variance estimation under Poisson sampling. Antal and Tillé (2011) proposed one-one resampling methods to estimate the variance for some complex sampling designs. Mashreghi et al. (2016) gave a comprehensive overview of the bootstrap methods in survey sampling for variance estimation.
In survey sampling, the literature on bootstrap-based approaches for interval estimation is very limited. Bickel and Freedman (1984) first considered interval estimation under stratified random sampling. Booth et al. (1994) generalized the method of Bickel and Freedman (1984) to show that the constructed confidence interval for a smooth function of the finite population mean is second-order accurate. However, all of the theoretical results, including that of Rao and Wu (1988) are restricted to stratified random sampling. Although Beaumont and Patak (2012) discussed a generalized bootstrap method for survey sampling with special attention to Poisson sampling, they did not provide rigorous results for the second-order accuracy of their methods.
In this paper, we focus on interval estimation under complex sampling. The goal of this study is to develop a unified bootstrap method to approximate the sampling distribution of the design-based estimator under some popular sampling designs, including Poisson sampling, simple random sampling (SRS) and probability-proportional-to-size (PPS) sampling. The proposed bootstrap methods apply multinomial distributions to generate the bootstrap finite populations by incorporating the sampling information, and the same sampling design is conducted to obtain a bootstrap sample from each bootstrap finite population. A similar idea has been successfully applied to SRS by Gross (1980) and Chao and Lo (1985). Our bootstrap methods differ from that proposed by Sverchkov and Pfeffermann (2004) in the sense that the finite population is iteratively bootstrapped, and an asymptotically pivotal statistic is used to make statistical inference for the finite population total. We also study the theoretical properties of the proposed bootstrap methods for different sampling designs using the Edgeworth expansion. We summarize our contributions in this paper below:
We have proposed a unified bootstrap method for interval estimation under some popular complex sampling designs, including Poisson sampling, SRS and PPS sampling. A simulation study also confirms that the proposed method works even under two-stage cluster sampling. 2. 2.
For three commonly used sampling designs, we have provided a rigorous proof for the second-order accuracy of the proposed bootstrap methods and shown that the estimation error is (DiCiccio and Romano, 1995) under mild conditions. Wald-type method is widely used in survey sampling, so the proposed bootstrap method is an important contribution since it provides more accurate inference compared with the Wald-type method under mild conditions. Besides, to our knowledge, we are the first to provide the Edgeworth expansion of a studentized estimator under Poisson sampling.
The remaining part of the paper is organized as follows. Sampling designs and design-based estimators under consideration are briefly reviewed in Section 2. In the following three sections, we propose bootstrap methods for Poisson sampling, SRS and PPS sampling, respectively, and theoretical properties are also investigated. Two simulation studies are conducted to compare the proposed bootstrap method with the Wald-type method in Section 6. Some concluding remarks are made in Section 7.
2 Sampling designs and estimates
In survey sampling, the finite population is often assumed to be fixed, and the randomness is due to the sampling design. Let be the finite population of size , and we are interested in making inference for the finite population total . For simplicity, we assume that the elements in are scalers. To avoid unnecessary details, we also assume that the population size is known, so it is equivalent to make statistical inference for the finite population mean .
We consider three commonly used sampling designs, including Poisson sampling, SRS and PPS sampling. For without-replacement sampling designs, such as Poisson sampling and SRS, is the sampling indicator with indicating that the -th element is in the sample and 0 otherwise, and is the first-order inclusion probability of the -th element for , where the expectation is taken with respect to the sampling design. Let be the set of first-order inclusion probabilities, and it is assumed to be known. Poisson sampling generates a sample based on independent Bernoulli experiments, one for each element in the finite population. That is, for , where is a Bernoulli distribution with success probability , and a sample is . Let be a realized sample size and be the expected sample size under Poisson sampling. For SRS, a without-replacement sample of size is selected with equal probabilities, and we can show for under SRS. Denote to be the Horvitz-Thompson estimator (Horvitz and Thompson, 1952) of under Poisson sampling, and the corresponding one is under SRS. The sample size is random under Poisson sampling, but it is fixed under SRS. Without loss of generality, assume that the first elements are sampled under Poisson sampling or SRS, and the design-unbiased variance estimators are and , respectively, where is the sample variance of , and .
PPS sampling generates a sample of size by independently and identically selecting an element from times with selection probabilities , where is the known selection probability of for and . Replicates may occur in the sample under PPS sampling, and the population total is estimated by the Hansen–Hurwitz estimator (Hansen and Hurwitz, 1943), which is denoted as , where , and if , and is the index of the selected element for the -th draw. A design-unbiased variance estimator is .
Throughout the paper, assume that the (expected) sample size is less than the population size. Since we study a sequence of finite populations and inclusion probabilities in the following three sections, assume that and are indexed by implicitly, and samples are generated independently for different finite populations. We use the notation “” to indicate that and have the same asymptotic order. That is, is equivalent to and .
3 Bootstrap method for Poisson sampling
We propose the following bootstrap method to approximate the sampling distribution of under Poisson sampling.
- Step 1.
Based on the sample , generate from a multinomial distribution with trials and a probability vector , where and
[TABLE]
for . Denote and , and they consist of copies of and , respectively. Let the bootstrap finite population total be . 2. Step 2.
For , generate independently from a binomial distribution with trials and a success probability . The bootstrap sample consists of replicates of under Poisson sampling. Denote and , where is the bootstrap variance estimator. 3. Step 3.
Repeat the two steps above independently times.
Step 1 corresponds to generating a bootstrap finite population and bootstrap first-order inclusion probabilities by incorporating the sampling information. Based on and , Step 2 is used to generate a bootstrap sample, from which a bootstrap replicate of is obtained. Instead of sampling from the bootstrap finite population directly, Step 2 provides a more efficient way to generate a sample using under Poisson sampling. In Step 2, we center by the bootstrap population total not by . The reason is that the finite population should be fixed, and the randomness is due to Poisson sampling. Thus, the statistic should be centered using the corresponding population total . If we center by , it causes additional variability due to generating different bootstrap finite populations. The same argument applies for the other two sampling designs. We use the empirical distribution of to approximate that of and make inference for .
Before discussing the theoretical properties of the proposed bootstrap method, we introduce some mild conditions on and .
- (C1)
There exist constants and such that , and satisfies
[TABLE]
for . 2. (C2)
The sequence of finite populations and first-order inclusion probabilities satisfy
[TABLE]
where , and is a positive constant. 3. (C3)
The following condition holds for finite populations, that is,
[TABLE]
where is a positive constant. 4. (C4)
Denote for , and let be the integer part of . Then, there exist constants and such that, for any subset of ,
[TABLE]
uniformly in , where is the imaginary unit.
We briefly comment on these conditions. Condition (C1) is commonly used in survey sampling (Fuller, 2009); the first part of (C1) is a mild restriction on the expected sample size, and the second part regulates the first-order inclusion probabilities. Condition (C2) rules out the degenerate case of the Horvitz–Thompson estimator under Poisson sampling. The moment condition in (C3) guarantees the convergence of the variance estimators and other quantities, and it is also required for SRS and PPS sampling that we will discuss in the following two sections. To illustrate the existence of and satisfying (C1) and (C2) simultaneously, consider , so (C1) holds, where and , for example. Then, we have If converges as , then (C2) holds. Condition (C4) is a counterpart of non-lattice assumption and is useful in deriving Edgeworth expansions. Specifically, for any subset of , condition (C4) ensures that have subsequences of length with different spans; see Feller (2008, §16.6) for more discussion on a similar assumption.
Denote to be a probability space, where and are the -algebra and the probability measure on associated with Poisson sampling, respectively. That is, , and , where , is the power set of , and for . Let be the product space and be the product--algebra; see Klenke (2014, §14.1) for details about the notations. By Corollary 14.33 of Klenke (2014), there exists a uniquely determined probability measure on such that , where for and .
Lemma 3.1**.**
Suppose that (C1)–(C3) hold. Then,
[TABLE]
as almost surely .
Let and . Then,
[TABLE]
In addition, denote and . Then,
[TABLE]
Lemma 3.1 shows some basic properties of the finite population quantities and their design-based estimators. Specifically, the ratio almost surely by (C2). Under Poisson sampling, is the third central moment of , and is a quantity involved in the Edgeworth expansion of the distribution of .
Theorem 3.1**.**
Assume that conditions (C1)–(C4) hold. Let be the cumulative distribution function of under Poisson sampling. Then,
[TABLE]
Furthermore,
[TABLE]
uniformly in , where is the cumulative distribution function of the standard normal distribution with the probability density function .
we make brief comments on the notation in (5) of Theorem 3.1. The probability on the left side of (5) is not random. However, we use estimators in the Edgeworth expansion to make it easier to compare (5) with the result in the following theorem, so instead of , it is reasonable to use on the right side of (5). Similar argument can be made for Edgeworth expansions under the other two sampling designs.
In order to establish the Edgeworth expansion for the conditional distribution of , we need the following assumption, which is similar to condition (C4) but with replaced by . We isolate (C4) and (C5) since (C5) is not needed for Theorem 3.1.
- (C5)
There exist constants and such that, for any subset of with cardinality ,
[TABLE]
uniformly in .
The next theorem presents the Edgeworth expansion for the distribution of based on the proposed bootstrap method.
Theorem 3.2**.**
Suppose that conditions (C1)–(C5) hold. Let be the cumulative distribution function of conditional on the bootstrap finite population . Then,
[TABLE]
uniformly in .
By comparing (5) in Theorem 3.1 with (6) in Theorem 3.2, we show that the proposed bootstrap method is second-order accurate, but the Wald-type method, which is based on the asymptotic normality of , is not if and are nonzero by noting the fact that and are design-unbiased estimators of and , respectively. Typically, the cumulative distribution function is hard to study analytically, so we use an empirical distribution to approximate it.
Now, consider establishing confidence intervals for the population total . An approximate two-sided confidence interval at significance level based on the Wald-type method can be constructed as
[TABLE]
where and are the and quantiles of the standard normal distribution, respectively. According to Theorem 3.1, though the upper and lower confidence limits of (7) have error rates of order , this two-sided confidence interval has error rate of order since is an even function of , and the order term in the Edgeworth expansion of cancel in the error rate. However, the order term leads to an error rates of order for one-sided confidence intervals based on the normal approximation.
The confidence interval of based on the proposed bootstrap methods is
[TABLE]
where and are the and quantiles of . By Theorem 3.2, the coverage error of (8) is of order . Moreover, the upper and lower limits of (8) have error rates , which outperforms the confidence interval (7) based on Wald-type method. In addition, the one-sided confidence interval by the proposed bootstrap method is more accurate than the one-sided confidence interval obtained by the Wald-type method. Furthermore, as discussed in Section 3.6 of Hall (1992), an asymmetric equal-tailed confidence interval may convey important information. The same arguments can be used for the other two sampling designs.
4 Bootstrap method for SRS
We propose the following procedure to make statistical inference for under SRS.
- Step 1.
Generate from , where for . Then, contains copies of for , and the bootstrap finite population total is . 2. Step 2.
Generate a bootstrap sample of size , denoted as , from using SRS. Then, we can obtain , where , , , and . 3. Step 3.
Repeat the two steps above independently times.
The three steps for SRS are similar to those under Poisson sampling, but we do not need since for . Different from that under Poisson sampling, the bootstrap sample is generated directly from . One commonly used algorithm to generate a sample of size under SRS is to select elements sequentially from the finite population without replacement. If , the computational complexity of selecting each element is . Besides the above bootstrap procedure, we propose the following one. It can be shown that these two procedures are equivalent under SRS, but the computational complexity of the latter is for selecting each element.
- Step 1’.
The same as Step 1 above. 2. Step 2’.
Initialize and for . 3. Step 3’.
Generate a bootstrap sample of size from under SRS.
- Step 3.1’.
Initialize . 2. Step 3.2’.
Select an index, say , from with selection probability for . 3. Step 3.3’.
Update if . Set if , and if , where for two sets and . 4. Step 3.4’.
Set , and go back to Step 3.2’ until . 5. Step 3.5’.
Obtain , where , , , and . 4. Step 4’.
Repeat the above three steps independently times.
We list some necessary conditions for studying the theoretical properties of the proposed bootstrap method under SRS.
- (C6)
There exist and such that and as . 2. (C7)
The finite population satisfies
[TABLE]
where , and is a positive constant. 3. (C8)
The distribution converges weakly to a strongly non-lattice distribution , where assigns probability to .
Condition (C6) is a counterpart of (C1), and it is used to rule out the trivial case when the sample size equals to that of the finite population. Condition (C7) regulates the variance of with respect to the distribution , and it concentrates our discussion on the non-degenerate case under SRS. The non-latticed assumption in (C8) is used to make the discussion easier, and a distribution is strongly non-latticed if for all ; see Babu and Singh (1984) for details.
We can use a similar argument made in Section 3 to show that there exists a probability measure on the product space equipped with the product -algebra .
Lemma 4.1**.**
Suppose that (C3), (C6) and (C7) hold. Then,
[TABLE]
where is the third central moment of with respect to the distribution . Besides,
[TABLE]
as almost surely . In addition,
[TABLE]
where , and is the sample mean.
Lemma 4.1 is the counterpart of Lemma 3.1 under SRS, and it shows the convergence properties of the sample variance and third central moment under mild conditions. We do not use scaling factors in (9)–(11) since the quantities involved are with respect to the distribution .
Theorem 4.1**.**
Suppose that (C3) and (C6)–(C8) hold. Then,
[TABLE]
uniformly in , where is the cumulative distribution function of .
Theorem 4.1 shows the Edgeworth expansion for the distribution of , and this result is obtained by one result in Section 2 of Babu and Singh (1985). Instead of using and as done by Babu and Singh (1985), we use their estimators in (12) based on Lemma 4.1.
Theorem 4.2**.**
Suppose that (C3) and (C6)–(C8) hold. Then, we have
[TABLE]
uniformly in , where is the cumulative distribution function of conditional on the bootstrap finite population .
Theorem 4.2 shows the Edgeworth expansion for the distribution of obtained by the proposed bootstrap method. By comparing (12) in Theorem 4.1 with (13) in Theorem 4.2, we have shown the second-order accuracy of the proposed bootstrap method.
5 Bootstrap method for PPS sampling
We consider PPS sampling in this section and propose the following bootstrap method to approximate the sampling distribution of .
- Step 1.
Obtain from a multinomial distribution , where and for . Then, consists of copies of , and the bootstrap finite population total is . The bootstrap selection probabilities are where , and consists of copies of for . 2. Step 2.
Based on , generate a sample of size by independently and identically selecting an element from times with selection probabilities . Then, we have , where , and if the index of the -th draw is , and is the counterpart of based on the bootstrap sample. 3. Step 3.
Repeat the two steps above independently times.
To implement the proposed bootstrap method for PPS sampling, the bootstrap selection probability should be standardized before drawing a sample. Similarly to the previous two sections, we use the empirical distribution of to make statistical inference for .
The computational complexity of selecting an element in Step 2 is . An equivalent way of carrying out the proposed bootstrap method under PPS sampling is described below, and its computational complexity is for selecting an element.
- Step 1’.
The same as Step 1 above. 2. Step 2’.
Obtain an independent and identical sample of size from , and the selection probability of is for . Denote to be the number of ’s in the sample. Then, we have , where and . 3. Step 3’.
Repeat the above three steps independently times.
The following regularity conditions are required to validate the proposed bootstrap method under PPS sampling.
- (C9)
There exists such that , and the selection probabilities satisfy
[TABLE]
for , where and are positive constants. 2. (C10)
The sequence of finite populations and selection probabilities satisfy
[TABLE]
where , and is a positive number. 3. (C11)
The distribution is non-lattice, where assigns probability to for .
Condition (C9) regulates the sample size and selection probabilities, and (C10) rules out the degenerate case under PPS sampling. To show (C9) and (C10) can be satisfied simultaneously, take for . Then, (C9) holds with , and , where . Thus, converges if both and converge as . Since corresponds to the PPS sampling procedure, condition (C11) focuses our attention to the non-lattice case.
Based on a similar argument made under Poisson sampling, there exists a probability measure on equipped with the product -algebra under PPS sampling.
Lemma 5.1**.**
Suppose that (C3), (C9) and (C10) hold. Then,
[TABLE]
as almost surely , where is the sample variance of . Let and , then
[TABLE]
Lemma 5.1 shows convergence properties of estimators of the variance and third central moment. The next theorem deals with the Edgeworth expansion for the distribution of .
Theorem 5.1**.**
Suppose that (C3), (C9)–(C11) hold. Then,
[TABLE]
uniformly in , where is the cumulative distribution function of under PPS sampling.
Based on the result in Theorem 5.1, the Wald-type method may provide inefficient inference results for compared with the proposed bootstrap method if the sample size is small and .
Theorem 5.2**.**
Suppose that (C3), (C9)–(C11) hold. Then, we have
[TABLE]
uniformly in , where is the cumulative distribution function of conditional on the bootstrap finite population .
Theorem 5.2 shows the Edgeworth expansion for the cumulative distribution function of based on the proposed bootstrap method. By comparing (16) in Theorem 5.1 with (17) in Theorem 5.2, we have shown that the proposed bootstrap method is second-order accurate under PPS sampling.
6 Simulation study
6.1 Single-stage sampling designs
We conduct a simulation study based on single-stage sampling designs in this section. A finite population is generated by
[TABLE]
for , where is an exponential distribution with a scale parameter , and the population size is , which is assumed to be known. The size measure is simulated by for , where . The expected sample size is . We are interested in constructing a 90% confidence interval for the finite population mean by survey data under the following sampling designs, and its true value is around 9.7.
Poisson sampling. The first-order inclusion probability is for , and its expected sample size is . 2. 2.
SRS with sample size . 3. 3.
PPS sampling. The selection probability for this design is for , and the sample size is .
Based on a sample, denote to be the design-unbiased variance estimator of , where is the design-unbiased estimate of under a specific sampling design. We consider the following methods to construct the 90% confidence interval.
- Method I.
Proposed bootstrap method by setting . Denote and to be the 5%-th and 95%-th sample quantiles of obtained by the proposed bootstrap method, where , and are the bootstrap counterparts of , and in the -th repetition. Then, a 90% confidence interval for can be constructed by
[TABLE]
- Method II.
Wald-type method. A Wald-type 90% confidence interval for is obtained by
[TABLE]
where and are the 5%-th and 95%-th quantiles of the standard normal distribution.
We conduct Monte Carlo simulations for each sampling design, and the two methods are compared in terms of the coverage rate and the length of the constructed 90% confidence interval. Table 1 summarizes the simulation results. When the sample size is small, the proposed bootstrap method is more preferable in the sense that its coverage rates are closer to 0.9 compared with the Wald-type method under the three sampling designs. The confidence interval constructed by the proposed bootstrap method is wider compared with that by the Wald-type method. As the sample size increases to , the performance of the two methods is approximately the same in the sense that the coverage rates of both methods are close to 0.9, and confidence interval lengths are approximately the same.
In addition, we also compare the two methods in terms of approximating the probability , which is obtained by 10 000 Monte Carlo simulations. We set as done by Lai and Wang (1993). Table 2 summarizes the simulation results. For both sample sizes, the proposed bootstrap method can approximate the target distribution well, but the performance of the Wald-type method is not as good as the proposed one when sample size is small.
6.2 Two-stage sampling designs
In this section, we test the performance of the proposed method under two-stage sampling designs. A finite population is generated by
[TABLE]
for and , where is a Poisson distribution with a rate parameter , , is the minimum cluster size, and is the number of clusters in the finite population. The finite population size is , and the cluster sizes range from 43 to 129. We assume that the finite population size and cluster sizes are known. We are interested in constructing a 90% confidence interval for the finite population mean , where the true value of is approximately 70.5.
We consider two different sampling designs for the first stage; one is Poisson sampling, and the other one is PPS sampling. The first-order inclusion probability (selection probability) of the -th cluster is proportional to its cluster size under Poisson (PPS) sampling for . SRS is conducted within each selected cluster independently in the second stage. The expected sample size of the first-stage sampling is , and that of the second-stage sampling is . In this simulation, we consider two scenarios for the sample sizes, that is, and .
The derivations of the design-unbiased estimator and its variance estimator under the two-stage sampling designs in this simulation study are presented in Appendix 9.2. We consider the following methods to construct the 90% confidence intervals for the parameters of interest.
- Method I.
The proposed method extended to a two-stage sampling design. This method is approximately the same as that mentioned in Section 6.1 with the following two steps to bootstrap the finite population, and we set for this method.
- Step 1.
Use the proposed method to bootstrap the clusters by treating them as “elements”, and the original sample within each selected cluster are replicated accordingly. 2. Step 2.
For each bootstrap cluster, apply the proposed method to bootstrap the cluster finite population independently.
- Method II.
Wald-type method, and it is the same as the one discussed in Section 6.1.
We conduct Monte Carlo simulations for each scenario. Table 3 summarizes the coverage rate and average length of the constructed 90% confidence interval for the finite population mean. The coverage rates of the proposed bootstrap method are closer to 0.9 even when the sample size is limited. However, the coverage rates of the commonly used Wald-type method are not as good as the proposed bootstrap method. Specifically, the coverage rates of the Wald-type method are only around 0.86 for three scenarios, and it improves to 0.88 when sample size is large under Poisson sampling. The confidence intervals of the proposed bootstrap method are wider than those of the Wald-type method when sample size is small.
As in Section 6.1, we also compare those two methods in terms of approximating , which is obtained by 10 000 Monte Carlo simulations. We set . Table 4 summarizes the simulation results. For both sample sizes, the proposed bootstrap method can approximate the target distribution well, but the performance of the Wald-type method is not as good as the proposed one especially when the sample size is small.
7 Conclusion
In this paper, we propose bootstrap methods for Poisson sampling, SRS and PPS sampling, and we show that the proposed bootstrap methods are second-order accurate. The first step of the proposed bootstrap methods corresponds to an inverse sampling procedure by incorporating the sampling information. Since the proposed bootstrap method is based on an asymptotically pivotal statistic, it is necessary to estimate the variance of the design-unbiased estimator. Simulation results show that the proposed bootstrap method provides more conservative confidence interval than the Wald-type method when the sample size is small, and the 90% confidence interval constructed by the proposed bootstrap method has a better coverage rate. Although the proposed bootstrap method is discussed under the single-stage sampling designs, simulation shows that it works well under some two-stage sampling designs, and Edgeworth expansion for two-stage sampling designs are under investigation. It may be extended to other complex sampling designs when the asymptotic distribution of the design-unbiased estimator exists, but the second-order accuracy may not be guaranteed. Besides, the proposed bootstrap method can be easily parallelized in practice.
8 Acknowledgment
We would like to thank Dr. J. N. K. Rao for the suggestion to discuss the simple random sampling and the two anonymous reviewers for the detailed and constructive comments.
9 Supplement
9.1 Proofs
For the purpose of clarity, we explicitly express , , , and for , , , and to highlight that they are indexed by , and the same notation is used for other quantities without further mentioning. Denote and to be the expectation and variance with respect to the probability measure of a specific sampling design, say under Poisson sampling, and to be the conditional mean and variance with respect to the multinomial distribution in the first steps of the proposed bootstrap method conditional on the realized sample , and and to be the expectation and variance with respect to the sampling design in the second step conditional on the bootstrap finite population .
Proof of Lemma 3.1.
Denote , then n_{0}N^{-2}\big{(}\hat{V}_{N,Poi}-V_{N,Poi}\big{)}=\sum_{i=1}^{N}X_{N,i}^{(1)}. Let be the event \big{\{}\big{\lvert}\sum_{i=1}^{N}X_{N,i}^{(1)}\big{\rvert}>\epsilon\big{\}} for , where and is the set of positive integers.
By the Borel-Cantelli Lemma (Athreya and Lahiri, 2006, Thereom 7.2.2), to show (1), it is enough to prove
[TABLE]
for . By the Markov’s inequality (Athreya and Lahiri, 2006, Proposition 6.2.4), we have
[TABLE]
where the last equality holds since for , and is independent of for with .
Consider
[TABLE]
where is a positive constant determined by (C1).
Next, consider
[TABLE]
where is a positive constant.
Based on some algebra and (C3), we have
[TABLE]
By (A.2), (A.3) and (A.4), we have
[TABLE]
for any fixed , where the last inequality holds by (C3). Since by (C1), we have proved (1) based on (A.1).
For , we have
[TABLE]
where the first inequality holds by and , the second inequality holds by (C1), and the last equality holds by (C3).
Mentioned that
[TABLE]
and
[TABLE]
we can obtain that by the Markov’s inequality. The results concerning and can be proved similarly and is omitted here. Thus, we finalize the proof of Lemma 3.1. ∎
The following lemmas are useful in establishing Theorem 3.1 and 3.2.
Lemma 9.1**.**
Denote for . Let and be the characteristic function (c.f.) of , where is the imaginary unit. Then under conditions (C1)–(C3),
[TABLE]
for all , where \nu_{N,Poi}^{(3)}=\sum_{i=1}^{N}|y_{N,i}|^{3}(1-\pi_{N,i})\big{\{}(1-\pi_{N,i})^{2}\pi_{N,i}^{-2}+1\big{\}}. Furthermore,
[TABLE]
for all |t|\leq\min\left(\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2},V_{N,Poi}^{3/2}/\left(4\nu_{N,Poi}^{(3)}\right)\right), where is a positive constant and recall that .
Proof.
As , and for . In addition,
[TABLE]
and
[TABLE]
Thus, , by (C2) and Lemma 3.1. In addition, \nu_{N,Poi}^{(3)}=\sum_{i=1}^{N}|y_{N,i}|^{3}(1-\pi_{N,i})\big{\{}(1-\pi_{N,i})^{2}\pi_{N,i}^{-2}+1\big{\}}=O(n_{0}^{-2}N^{-3}), which implies By the fact that , for . By Lemma 5.1 of Petrov (1995),
[TABLE]
for all .
It remains to show (A.7). Denote as the characteristic function of . Note that for any complex numbers , , , it follows that
[TABLE]
By Lemma 11.4.3 of Athreya and Lahiri (2006),
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2}, where is a positive constant and the last inequality is by the fact that
[TABLE]
Similarly, by Lemma 11.4.3 of Athreya and Lahiri (2006),
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2}.
Thus, if |t|\leq\min\left(\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2},V_{N,Poi}^{3/2}/\left(4\nu_{N,Poi}^{(3)}\right)\right),
[TABLE]
Mentioned that
[TABLE]
for some positive constant .
Finally, by (9.1), (9.1) and (A.10), it follows that
[TABLE]
for all |t|\leq\min\left(\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2},V_{N,Poi}^{3/2}/\left(4\nu_{N,Poi}^{(3)}\right)\right). This concludes the proof of this Lemma. ∎
Lemma 9.2**.**
Denote
[TABLE]
then under conditions (C1)–(C3),
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2}, where , , ,
[TABLE]
and satisfies
[TABLE]
with being a positive constant and \nu_{N,Poi}^{(3)}=\sum_{i=1}^{N}|y_{N,i}|^{3}(1-\pi_{N,i})\big{\{}(1-\pi_{N,i})^{2}\pi_{N,i}^{-2}+1\big{\}}.
Proof.
First, write
[TABLE]
for , where
[TABLE]
and is a positive constant. The last but one inequality is due to the fact that for any real number , . As a consequence, for any ,
[TABLE]
where
[TABLE]
with being a positive constant.
Denote for . By the same technique as in the proof of Lemma 5.1 of Petrov (1995), we can show that
[TABLE]
Using the inequality for all complex number , we can obtain that
[TABLE]
By Lemma 11.4.3 of Athreya and Lahiri (2006),
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2}. Thus, we obtain
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2}.
Finally, we have
[TABLE]
for all |t|\leq\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2} and satisfies
[TABLE]
where is a positive constant. ∎
Lemma 9.3**.**
Denote , and , then under conditions (C1)–(C3),
[TABLE]
where
[TABLE]
and recall that . In addition, satisfies
[TABLE]
Proof.
Denote . Mentioned that and
[TABLE]
we have that . In addition,
[TABLE]
By some algebra,
[TABLE]
Use the notations of , and , we have
[TABLE]
where
[TABLE]
It remains to show
[TABLE]
and
[TABLE]
For (A.11), it is a directly consequence of
[TABLE]
and
[TABLE]
For (A.12), as
[TABLE]
where , we have that
[TABLE]
Thus, we finish the proof of this lemma. ∎
Lemma 9.4**.**
Assume condition (C3) holds. Then for any positive integer satisfies as and , there exists a subset such that
[TABLE]
Proof.
We prove this lemma by contradiction.
First, we split the population into subsets, with the first subsets as and the last subset as . Here denotes the integer part of . Assume that (A.13) is not satisfied by any subset of with cardinality . Then
[TABLE]
for all . This implies
[TABLE]
as .
This result is contradicted with condition (C3). Thus, there exists at least one subset of with cardinality satisfies (A.13). ∎
Lemma 9.5**.**
Denote , and . Let be the cumulative distribution function (cdf) of under Poisson sampling, where . Then, under conditions (C1)–(C4),
[TABLE]
uniformly in , where
[TABLE]
and
[TABLE]
Proof.
According to Lemma 9.3, , where , , are defined in Lemma 9.3 and satisfies
[TABLE]
Thus, it suffices to show that
[TABLE]
where
[TABLE]
Define
[TABLE]
As is nonrandom and , by the fact that
[TABLE]
it is enough to prove that
[TABLE]
Denote and let be the characteristic function (c.f.) of , that is
[TABLE]
In addition, denote
[TABLE]
By Esseen’s smoothing lemma (Petrov, 1995, Theorem 5.1), for any arbitrary ,
[TABLE]
where is chosen to satisfy . Thus, it suffices to prove
[TABLE]
Recall , where , then
[TABLE]
where and
[TABLE]
Denote
[TABLE]
then
[TABLE]
So it is sufficient to show that
[TABLE]
A simple calculation yields , and
[TABLE]
This implies that for where as , . By the inequality that for any real number , we write
[TABLE]
According to Lemma 9.1 and Lemma 9.2,
[TABLE]
and
[TABLE]
for all |t|\leq\min\left(\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2},V_{N,Poi}^{3/2}/\left(4\nu_{N,Poi}^{(3)}\right)\right), where satisfies
[TABLE]
Recall that , it can be easily verified that
[TABLE]
for |t|\leq\min\left(b_{N}n_{0}^{1/2},\big{\{}\max_{1\leq i\leq N}E(X_{N,i}^{2}\mid\mathcal{F}_{N})\big{\}}^{-1/2},V_{N,Poi}^{3/2}/\left(4\nu_{N,Poi}^{(3)}\right)\right), where
[TABLE]
and is a positive constant.
Under the assumption that for a positive constant , we have . Then, . It follows that for ,
[TABLE]
It is obvious that
[TABLE]
it remains to establish
[TABLE]
Denote
[TABLE]
then and .
Take . According to Lemma 9.4, we assume that for sufficient large without loss of generality. Define
[TABLE]
By simple algebra,
[TABLE]
By the inequality that for all real , we have
[TABLE]
where is a positive constant. This clearly indicates that
[TABLE]
In view of the fact that are the only terms in that depend on , for a positive constant ,
[TABLE]
by condition (C4). In addition, there exists a constant such that
[TABLE]
Finally, for a positive constant ,
[TABLE]
as . Therefore, we finish the proof of this lemma. ∎
Proof of Theorem 3.1.
According to Lemma 3.1, . This cooperates with (C2) indicates that
[TABLE]
In addition, by (1) and (2) of Lemma 3.1, we can prove that
[TABLE]
Similarly, we can show that according to Lemma 3.1. Finally, by (C2) and Lemma 3.1,
[TABLE]
and
[TABLE]
Combine these results with Lemma 9.5 , we have proved Theorem 3.1.
∎
The proof of Theorem 3.2 uses the following lemma.
Lemma 9.6**.**
Let be a multinomial random vector with distribution , where and for . Denote to be the bootstrap finite population generated from the realized sample and the random vector with each indicates the number of replicates of in . Define as the number of distinct , in . Then, as ,
[TABLE]
where is the integer part of and is the probability measure for the first step of the proposed bootstrap method conditional on the realized sample .
Proof of Lemma 9.6.
We prove this lemma under two different case scenarios: and .
First, consider the case of , so we have that . By the strong law of large numbers, with probability . It suffices to show that
[TABLE]
Mentioned that
[TABLE]
where the last inequality is due to that for .
Next, consider the case of and it is sufficient to prove that
[TABLE]
When , and . Thus,
[TABLE]
which implies (A.20) immediately.
When , and . Using Stirling’s formula, there exists a positive constant such that
[TABLE]
where the second inequality uses the fact that . Mentioned that and , we have
[TABLE]
for . Thus,
[TABLE]
Finally,
[TABLE]
which finalize the proof of this lemma. ∎
Proof of Theorem 3.2.
We first show
[TABLE]
almost surely . Denote to be the event \big{\{}N^{-1}\lvert\sum_{i=1}^{N}(\pi_{N,i}^{-1}I_{N,i}-1)\lvert>\epsilon\big{\}}, where is a fixed positive number. Similar to the argument used in proving Lemma 3.1, we have
[TABLE]
where is a positive constant with respect to and . This immediately implies
[TABLE]
for any arbitrary positive and we have proved (A.21) by the Borel-Cantelli Lemma.
Next, for any ,
[TABLE]
where the first inequality holds by (C1) and the last inequality holds by (C3). Thus, by (A.22) and the Markov’s inequality, we have
[TABLE]
for . In addition, as
[TABLE]
we have
[TABLE]
In the first step of our proposed bootstrap method, are independently and identically distributed (i.i.d.) with \mathbb{P}_{*}(y_{N,i}^{*}=y_{N,j})=\rho_{N,j}=\pi_{N,j}^{-1}\big{(}\sum_{\ell=1}^{n}\pi_{N,\ell}^{-1}\big{)}^{-1}. Mentioned that, for a positive constant ,
[TABLE]
where the first equality holds by the property of the proposed bootstrap method, the inequality holds by (A.21) and (C1), and the last equality holds by (A.23). Thus, by (A.25) and Markov’s inequality, we have
[TABLE]
Similarly, we can prove that for any subset of the bootstrap finite population, say, and all ,
[TABLE]
here can be any positive integer less than .
Denote , then by Lemma 3.1, (A.21) and Condition (C2),
[TABLE]
in probability. In addition, for a positive constant ,
[TABLE]
where . Thus, we have
[TABLE]
Denote T_{N,Poi}^{*}=\big{(}\hat{V}_{N,Poi}^{*}\big{)}^{-1/2}\big{(}\hat{Y}_{N,Poi}^{*}-Y_{N}^{*}\big{)}, where ,
[TABLE]
and
[TABLE]
here is the bootstrap counterpart of and conditional on the bootstrap finite population .
In view of (A.26) and (A.28), by similar arguments used in the proof of Lemma 9.3, we can show that
[TABLE]
where
[TABLE]
and satisfies
[TABLE]
where is the counterpart of conditional on the bootstrap finite population .
Let be the cumulative distribution function of conditional on the bootstrap finite population , we proceed to prove
[TABLE]
uniformly in , where
[TABLE]
and
[TABLE]
Denote and let be the characteristic function (c.f.) of conditional on the bootstrap finite population , that is,
[TABLE]
Regarding the proof of Lemma 9.5, it is enough to show that
[TABLE]
for any arbitrary in order to finalize (9.1). Here is chosen to satisfy with defined as
[TABLE]
Denote and
[TABLE]
then , and
[TABLE]
Take , we prove (A.30) under two different case scenarios: and .
First, consider the case of , we have that . According to Lemma 9.6, we assume , without loss of generality. Then, by (A.28), condition (C5) and the fact that
[TABLE]
we arrive at
[TABLE]
By (A.27), we have that . Similar to the proof of Lemma 9.3, define
[TABLE]
then . Furthermore, for positive constants and ,
[TABLE]
in probability and this immediately implies
[TABLE]
Next, consider the case of . According to Lemma 9.6, we assume , without loss of generality. Then, use the same technique as (A.31), we can show that
[TABLE]
The following procedure is similar to the case of . Thus, we arrive at
[TABLE]
It remains to show that
[TABLE]
and
[TABLE]
Mentioned that
[TABLE]
and
[TABLE]
it suffices to prove that
[TABLE]
in probability conditional on the series of realized samples. For (A.32), it is a consequence result of and Lemma 3.1. For (A.33), consider
[TABLE]
almost surely, where the second equality holds by (A.21). Next, consider
[TABLE]
almost surely, where the second inequality holds by (A.21) and the last equality holds by (C3). By (2) in Lemma 3.1, (A.35) and (A.36), we have proved (A.33), and the proof of (A.34) is similar. This concludes the proof of Theorem 3.2. ∎
Lemma 9.7**.**
Let be pairwise distinct positive integers, which are no larger than . Suppose that (C6) holds. Under SRS, we have
[TABLE]
Proof of Lemma 9.7.
Consider
[TABLE]
where the last inequality holds by the fact that for . Thus, we have proved (A.37).
Denote to be the number of elements that equal to 1 in set . Under SRS, we have
[TABLE]
Under SRS, we have for . Consider
[TABLE]
where the last equality holds by the facts that and if . Thus, we have proved (A.38) by (A.42).
Consider
[TABLE]
which proves (A.39).
Similar with the case for two terms, we have the following results under SRS. That is,
[TABLE]
Consider
[TABLE]
After some algebra, the first three terms of (A.43) are
[TABLE]
Together with (A.43) and (A.44), we have
[TABLE]
where the last equality holds by (C6) and the fact that the second term in the first equality converges to 0. Thus, we have shown (A.40) by (A.45).
For four terms under SRS, we have
[TABLE]
Now, consider
[TABLE]
Consider
[TABLE]
where the last equality is valid by (C6). Together with (A.46) and (A.47), we have
[TABLE]
Thus, we have proved (A.41). ∎
Proof of Lemma 4.1.
Based on basic algebra and (C3), we can show (9), so the proof is omitted here.
Note that
[TABLE]
To show (10), it is enough to show that
[TABLE]
almost surely.
First, we show (A.48), and we have
[TABLE]
Based on (C6) and the proof of Lemma 3.1, it is enough to show that
[TABLE]
By some basic algebra and (C3), we have
[TABLE]
where and are defined similarly as for (A.4).
Consider
[TABLE]
where the last equality holds by Lemma 9.7, (A.4) and (A.50) to (A.52). Thus, we have proved (A.48). Similarly, we can prove (A.49).
Note that
[TABLE]
To show (11), by (A.48) and (A.49), it remains to show
[TABLE]
in probability.
Note that
[TABLE]
Consider
[TABLE]
where (A.54) holds by the sampling design, is the finite population variance of , and the second equality of (A.55) holds by (C3). Together with (A.54) and (A.55), we have proved (A.53). ∎
Proof of Theorem 4.2.
First, we show that
[TABLE]
Consider
[TABLE]
Together with (A.57) and (A.58), we have proved (A.56) using the Markov inequality.
By (C8), there exists a strongly non-latticed distribution such that
[TABLE]
as for . Next, we show that
[TABLE]
in probability as for , where is the bootstrap finite population.
By Euler’s formula, we have
[TABLE]
It is enough to show that
[TABLE]
We only show the result (A.61), and (A.62) can be obtained in a similar manner.
By the first step of the proposed bootstrap method, we have
[TABLE]
where the inequality of (A.64) holds by the negative correlation among and the fact that . Using similar argument in (A.54) and (A.55), we can have shown (A.61) by results in (A.63) and (A.64). By (A.61) and (A.62), we have proved (A.60).
Thus, by (C6), (A.56) and (A.60), we have
[TABLE]
almost surely conditional on the generated bootstrap finite population, where and are the bootstrap central third moment and variance.
Based on Lemma 4.1, it remains to show that
[TABLE]
in probability. By some algebra, it is equivalent to show
[TABLE]
in probability.
Consider
[TABLE]
where the last equality of (A.71) is derived by the Markov inequality and a similar procedure for (A.58). Thus, we have proved (A.67) by (A.70) and (A.71). Similarly, we can prove (A.68) and (A.69). Therefore, we have shown (A.65) and (A.66), which concludes the proof of Theorem 4.2. ∎
Proof of Lemma 5.1.
Mentioned that and for ; , we have
[TABLE]
for all positive , where the second equality holds by (C9), and the last equality holds by (C3).
By the strong law of large numbers,
[TABLE]
almost surely and
[TABLE]
almost surely.
Note that
[TABLE]
By (A.73) and (A.74), we have proved (14).
Notice that
[TABLE]
where , and the last equality of (A.75) holds by (C9) and (A.72). In addition, for , we have
[TABLE]
and
[TABLE]
By Markov’s inequality,
[TABLE]
for , from which we can prove that . Thus, we complete the proof of Lemma 5.1. ∎
Proof of Theorem 5.1.
Rewrite
[TABLE]
where . By (A.72), we have . Using the results of Hall (1987), as the distribution of is non-lattice, we have
[TABLE]
where and . Finally, according to Lemma 5.1, we have shown Theorem 5.1. ∎
Proof of Theorem 5.2.
Mentioned that for ; and are independent. For any such that ,
[TABLE]
Thus, by SLLN,
[TABLE]
with probability 1 for all .
In the first step of our proposed bootstrap method, are independently and identically distributed (i.i.d.) with \mathbb{P}_{*}(y_{N,i}^{*}=y_{N,a,j})=\rho_{N,j}=p_{N,a,j}^{-1}\big{(}\sum_{\ell=1}^{n}p_{N,a,\ell}^{-1}\big{)}^{-1} for ; . Let be the probability measure for the first step of the proposed bootstrap method conditional on the realized sample . Then, we have
[TABLE]
from which, we get that conditional on the series of realized samples,
[TABLE]
Recall that , where is the number of repetitions of the -th realized sample in the proposed bootstrap method. Next, we show
[TABLE]
Consider
[TABLE]
where the equality of (A.81) holds by (C9). By (A.80) and (A.81), we have shown (A.79) according to (A.76) with .
Similarly, we can show
[TABLE]
for .
Let be the quantities for the -th selected element from the bootstrap finite population . Denote for . Then \mathbb{P}_{PPS}^{*}\big{(}Z_{N,i}^{*}=C_{N}^{*}(p_{N,k}^{*})^{-1}y_{N,k}^{*}\big{)}=(C_{N}^{*})^{-1}p_{N,k}^{*} for ; , where is the counterpart of conditional on the bootstrap finite population .
Conditional on the bootstrap finite population , denote as the distribution of , where , and with .
Consider
[TABLE]
where the results of (A.83) and (A.84) are based on (A.72).
Recall that is the expectation with respect to the sampling design conditional on the bootstrap finite population and consists of copies of for . Consider
[TABLE]
where the fourth equality holds by (A.82), and last equality holds by Lemma 5.1, (C9), (A.83) and (A.84).
Consider the characteristic function of and . Specifically, the characteristic function of is
[TABLE]
and the characteristic function of , conditional on the bootstrap finite population , is
[TABLE]
To show that the distribution of is non-lattice in probability conditional on the bootstrap finite population , it is enough to show that, for any fixed ,
[TABLE]
in probability as . By remarking that
[TABLE]
First,
[TABLE]
in probability as . Second,
[TABLE]
It suffices to show that
[TABLE]
We can show (9.1) since (A.79) and (A.82) hold, and are independent and identically distributed random variables. Similarly, we can show (A.89). By (A.77), (C11), and the fact that the distribution of is non-lattice in probability, we have the following result by Hall (1987):
[TABLE]
uniformly in , where .
Based on a similar argument made for (A.85), we have
[TABLE]
Together with Lemma 5.1, (A.90) to (A.92) and (C10), we have proved Theorem 5.2. ∎
9.2 Design-unbiased estimates for the two-stage sampling designs
For the two-stage sampling designs in the second simulation study, Poisson sampling and PPS sampling are used in the first stage, and an SRS design is independently conducted within each selected cluster in the second stage. For the sampling designs in the first stage, denote to be the first-order inclusion probability for Poisson sampling, and to be the selection probability for PPS sampling. In this section, we comply to the notation convention in Section 1.2.8 of Fuller (2009) to discuss the variance estimation under the two-stage sampling designs.
For the two-stage sampling design, where Poisson sampling is applied in the first stage, the design-based estimator of is
[TABLE]
where is the index of the selected clusters in the first stage, is an design-unbiased estimate of the cluster total under the SRS design, and is the index set of the sample within the selected cluster indexed by . It can be shown that the same form holds when PPS sampling is used in the first stage.
First, we discuss the variance estimator of for the two-stage sampling design where Poisson sampling is used in the first stage. As shown in Section 1.2.8 by Fuller (2009), the variance of can be decomposed into two parts. That is,
[TABLE]
where and
Consider
[TABLE]
where the equality holds since the SRS design is independently conducted within each selected cluster, , is the finite population variance within the -th cluster, and is the finite population mean of the -th cluster. Since the sample variance is an unbiased estimator of , where is the estimated cluster mean, the first term of (A.93) can be estimated by
[TABLE]
where .
For the second term of (A.93), consider
[TABLE]
Since Poisson sampling is used in the first stage, we have
[TABLE]
which can be estimated by Notice that
[TABLE]
By (A.96) and (A.97) and the fact that is an unbiased estimator of , the second term of (A.93) can be estimated by
[TABLE]
By (A.95) and (A.98), the variance of can be estimated by
[TABLE]
when Poisson sampling is used in the first stage.
Next, we use variance decomposition (A.93) to derive the variance estimator of under the two-stage sampling design where PPS sampling is applied in the first stage. The result shown in (A.94) holds, and we can still use (A.95) to approximate .
Consider
[TABLE]
where the equality holds by the property of PPS sampling, and . Based on (A.97), we can estimate by
[TABLE]
Consider
[TABLE]
where . Thus, we can estimate by
[TABLE]
By (A.95), (A.99) and the two approximations above, we can obtain the variance estimate of by
[TABLE]
for the two-stage sampling design with PPS sampling is used in the first stage.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 2Antal and Tillé (2011) Antal, E. and Tillé, Y. (2011). A direct bootstrap method for complex sampling designs from a finite population, J. Amer. Statist. Assoc. 106 (494): 534–543.
- 3Athreya and Lahiri (2006) Athreya, K. B. and Lahiri, S. N. (2006). Measure Theory and Probability Theory , Springer Science & Business Media, New York.
- 4Babu and Singh (1984) Babu, G. J. and Singh, K. (1984). On one term edgeworth correction by efron’s bootstrap, Sankhya A 46 (2): 219–232.
- 5Babu and Singh (1985) Babu, G. J. and Singh, K. (1985). Edgeworth expansions for sampling without replacement from finite populations, J. Multivariate Anal. 17 (3): 261–278.
- 6Beaumont and Patak (2012) Beaumont, J. F. and Patak, Z. (2012). On the generalized bootstrap for sample surveys with special attention to Poisson sampling, Int. Stat. Rev. 80 (1): 127–148.
- 7Bickel and Freedman (1984) Bickel, P. J. and Freedman, D. A. (1984). Asymptotic normality and the bootstrap in stratified sampling, Ann. Statist. 12 (2): 470–482.
- 8Booth et al. (1994) Booth, J. G., Butler, R. W. and Hall, P. (1994). Bootstrap methods for finite populations, J. Amer. Statist. Assoc. 89 (428): 1282–1289.
