Hyperparameter Selection for Subsampling Bootstraps
Yingying Ma, Hansheng Wang

TL;DR
This paper introduces a theoretically grounded method for selecting hyperparameters in subsampling bootstrap techniques, significantly improving estimator efficiency without extra computational cost.
Contribution
It develops a hyperparameter selection approach based on theoretical analysis, optimizing the efficiency of subsampling estimators in massive data analysis.
Findings
Hyperparameter selection improves estimator efficiency.
The method is validated through simulations and real data.
No additional computational cost is required.
Abstract
Massive data analysis becomes increasingly prevalent, subsampling methods like BLB (Bag of Little Bootstraps) serves as powerful tools for assessing the quality of estimators for massive data. However, the performance of the subsampling methods are highly influenced by the selection of tuning parameters ( e.g., the subset size, number of resamples per subset ). In this article we develop a hyperparameter selection methodology, which can be used to select tuning parameters for subsampling methods. Specifically, by a careful theoretical analysis, we find an analytically simple and elegant relationship between the asymptotic efficiency of various subsampling estimators and their hyperparameters. This leads to an optimal choice of the hyperparameters. More specifically, for an arbitrarily specified hyperparameter set, we can improve it to be a new set of hyperparameters with no extra CPU…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Advanced Statistical Process Monitoring
