Why stratification may hurt, & how much

Chris A.J. Klaassen; Andries J. Lenstra

arXiv:0802.2087·stat.ME·February 15, 2008

Why stratification may hurt, & how much

Chris A.J. Klaassen, Andries J. Lenstra

PDF

Open Access

TL;DR

This paper investigates conditions where stratified sampling can be less effective than simple random sampling, providing bounds and explanations for this counterintuitive phenomenon.

Contribution

It offers a detailed analysis of when and why stratification may be detrimental, including bounds and theoretical insights into its potential drawbacks.

Findings

01

Stratification can sometimes worsen sampling efficiency.

02

Bounds for the extent of stratification's negative impact are provided.

03

An explanation for the conditions leading to stratification's failure is given.

Abstract

There are circumstances under which stratified sampling is worse than simple random sampling, even if the allocation of the sample sizes is optimal. This phenomenon was discovered more than sixty years ago, but is not as widely known as one might expect. We provide it with lower and upper bounds for its badness as well as with an explanation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring · Advanced Statistical Methods and Models · Machine Learning and Algorithms