Training and Testing with Multiple Splits: A Central Limit Theorem for Split-Sample Estimators
Bruno Fava

TL;DR
This paper introduces a new statistical inference method for split-sample estimators that improves accuracy and reproducibility by leveraging multiple data splits and addressing dependence issues, applicable to complex models.
Contribution
It develops a central limit theorem for split-sample estimators under general conditions, enabling valid inference without restrictions on model complexity or convergence rates.
Findings
Confidence intervals are valid for many applications.
New method accounts for dependence across splits.
Improved power in economic prediction problems.
Abstract
As predictive algorithms grow in popularity, using the same dataset to both train and test a new model has become routine across research, policy, and industry. Sample-splitting attains valid inference on model properties by using separate subsamples to estimate the model and to evaluate it. However, this approach has two drawbacks, since each task uses only part of the data, and different splits can lead to widely different estimates. Averaging across multiple splits, I develop an inference approach that uses more data for training, uses the entire sample for testing, and improves reproducibility. I address the statistical dependence from reusing observations across splits by proving a new central limit theorem for a large class of split-sample estimators under arguably mild and general conditions. Importantly, I make no restrictions on model complexity or convergence rates. I show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods and Bayesian Inference · Statistical Methods and Inference
