Robust Bayesian Inference for Big Data: Combining Sensor-based Records with Traditional Survey Data
Ali Rafei, Carol A. C. Flannagan, Brady T. West, Michael R. Elliott

TL;DR
This paper introduces a robust Bayesian method that combines sensor data with survey data to address bias in large non-probability samples, utilizing Bayesian additive regression trees for flexible modeling and uncertainty quantification.
Contribution
It develops a double robust Bayesian approach using non-parametric models to improve bias correction in big data with unknown selection mechanisms.
Findings
Effective bias adjustment demonstrated on driving data
Bayesian additive regression trees capture non-linear effects
Uncertainty quantification through posterior predictive draws
Abstract
Big Data often presents as massive non-probability samples. Not only is the selection mechanism often unknown, but larger data volume amplifies the relative contribution of selection bias to total error. Existing bias adjustment approaches assume that the conditional mean structures have been correctly specified for the selection indicator or key substantive measures. In the presence of a reference probability sample, these methods rely on a pseudo-likelihood method to account for the sampling weights of the reference sample, which is parametric in nature. Under a Bayesian framework, handling the sampling weights is an even bigger hurdle. To further protect against model misspecification, we expand the idea of double robustness such that more flexible non-parametric methods, as well as Bayesian models, can be used for prediction. In particular, we employ Bayesian additive regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Forecasting Techniques and Applications · Statistical Methods and Bayesian Inference
