What is the best predictor that you can compute in five minutes using a given Bayesian hierarchical model?
Jonathan R. Bradley

TL;DR
This paper introduces a method to determine the best possible Bayesian hierarchical model prediction achievable within five minutes by calibrating the model to computational constraints, facilitating analysis of large datasets.
Contribution
It proposes a data subset model framework that calibrates Bayesian models to computational time limits without restrictive assumptions, enabling efficient predictions on big data.
Findings
The data subset model is proper and semi-parametric.
Subsets of normally distributed data are asymptotically partially sufficient.
Simulation results show the impact of computer speed on analysis.
Abstract
The goal of this paper is to provide a way for statisticians to answer the question posed in the title of this article using any Bayesian hierarchical model of their choosing and without imposing additional restrictive model assumptions. We are motivated by the fact that the rise of ``big data'' has created difficulties for statisticians to directly apply their methods to big datasets. We introduce a ``data subset model'' to the popular ``data model, process model, and parameter model'' framework used to summarize Bayesian hierarchical models. The hyperparameters of the data subset model are specified constructively in that they are chosen such that the implied size of the subset satisfies pre-defined computational constraints. Thus, these hyperparameters effectively calibrates the statistical model to the computer itself to obtain predictions/estimations in a pre-specified amount of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Geostatistics and Mapping · Statistical Methods and Inference · Statistical Methods and Bayesian Inference
