Empirical Bayes Predictive Density Estimation under Covariate Shift in Large Imbalanced Linear Mixed Models
Abir Sarkar, Gourab Mukherjee, Keisuke Yano

TL;DR
This paper develops a calibration framework for empirical Bayes predictive density estimation in large, imbalanced linear mixed models, addressing covariate shift and data scarcity.
Contribution
It introduces a novel risk estimation method using data fission, establishing convergence rates and demonstrating theoretical optimality.
Findings
The proposed estimator achieves near-optimal predictive KL risk.
Simulation results show robustness across various data scarcity levels.
Method effectively handles covariate shift in large imbalanced datasets.
Abstract
We study empirical Bayes (EB) predictive density estimation in linear mixed models (LMMs) with large number of units, which induce a high dimensional random effects space. Focusing on Kullback Leibler (KL) risk minimization, we develop a calibration framework to optimally tune predictive densities derived from on a broad class of flexible priors. Our proposed method addresses two key challenges in predictive inference: (a) severe data scarcity leading to highly imbalanced designs, in which replicates are available for only a small subset of units; and (b) distributional shifts in future covariates. To estimate predictive KL risk in LMMs, we use a data-fission approach that leverages exchangeability in the covariate distribution. We establish convergence rates for our proposed risk estimators and show how their efficiency deteriorates as data scarcity increases. Our results imply the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
