CLT and Edgeworth Expansion for m-out-of-n Bootstrap Estimators of The Studentized Median
Imon Banerjee, Sayak Chakrabarty

TL;DR
This paper rigorously analyzes the m-out-of-n bootstrap for sample quantiles, establishing CLTs, Edgeworth expansions, and Berry-Esseen bounds, with applications to robust inference in complex stochastic models.
Contribution
It provides the first parameter-free CLT and Edgeworth expansion for m-out-of-n bootstrap estimators of the median, with tight assumptions and practical applications.
Findings
Proves a data-driven CLT for bootstrap quantile estimators.
Derives an Edgeworth expansion with explicit convergence rates.
Demonstrates applicability to Markov chain-based statistics.
Abstract
The m-out-of-n bootstrap, originally proposed by Bickel, Gotze, and Zwet (1992), approximates the distribution of a statistic by repeatedly drawing m subsamples (with m much smaller than n) without replacement from an original sample of size n. It is now routinely used for robust inference with heavy-tailed data, bandwidth selection, and other large-sample applications. Despite its broad applicability across econometrics, biostatistics, and machine learning, rigorous parameter-free guarantees for the soundness of the m-out-of-n bootstrap when estimating sample quantiles have remained elusive. This paper establishes such guarantees by analyzing the estimator of sample quantiles obtained from m-out-of-n resampling of a dataset of size n. We first prove a central limit theorem for a fully data-driven version of the estimator that holds under a mild moment condition and involves no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Statistical Methods and Inference · Machine Learning and Algorithms
