Empirical Bayes for Data Integration

Paul Rognon-Vael; David Rossell

arXiv:2508.08336·stat.ME·February 6, 2026

Empirical Bayes for Data Integration

Paul Rognon-Vael, David Rossell

PDF

Open Access

TL;DR

This paper explores empirical Bayes methods for data integration in transfer learning, demonstrating their advantages in variable selection consistency, convergence speed, and practical performance improvements over full Bayesian approaches.

Contribution

It develops a computational framework for empirical Bayes in data integration, showing its theoretical benefits and practical effectiveness in high-dimensional settings.

Findings

01

Empirical Bayes achieves consistent variable selection under weaker conditions.

02

It attains faster convergence rates than full Bayes.

03

Data integration with empirical Bayes provides meaningful practical improvements.

Abstract

We discuss the use of empirical Bayes for data integration, in the sense of transfer learning. Our main interest is in settings where one wishes to learn structure (e.g. feature selection) and one only has access to incomplete data from previous studies, such as summaries, estimates or lists of relevant features. We discuss differences between full Bayes and empirical Bayes, and develop a computational framework for the latter. We discuss how empirical Bayes attains consistent variable selection under weaker conditions (sparsity and betamin assumptions) than full Bayes and other standard criteria do, and how it attains faster convergence rates. Our high-dimensional regression examples show that fully Bayesian inference enjoys excellent properties, and that data integration with empirical Bayes can offer moderate yet meaningful improvements in practice.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference