Regularized Bayesian transfer learning for population level etiological distributions
Abhirup Datta, Jacob Fiksel, Agbessi Amouzou, Scott Zeger

TL;DR
This paper introduces a Bayesian transfer learning method for estimating population-level cause-specific mortality fractions from verbal autopsy data, addressing small sample sizes and transfer errors with a novel shrinkage prior and ensemble approach.
Contribution
It proposes a hierarchical Bayesian framework with a new shrinkage prior for transfer error rates, enabling accurate population-level cause distribution estimation from limited data.
Findings
The method outperforms existing approaches in simulations and real data.
The ensemble model effectively selects the most accurate baseline classifier.
The R-package implementation facilitates practical application.
Abstract
Computer-coded verbal autopsy (CCVA) algorithms predict cause of death from high-dimensional family questionnaire data (verbal autopsies) of a deceased individual. CCVA algorithms are typically trained on non-local data, then used to generate national and regional estimates of cause-specific mortality fractions. These estimates may be inaccurate if the non-local training data is different from the local population of interest. This problem is a special case of transfer learning. However, most transfer learning classification approaches are concerned with individual (e.g. a person's) classification within a target domain (e.g. a particular population) with training performed in data from a source domain. Epidemiologists are often more interested in estimating population-level etiological distributions, using datasets much smaller than those used in common transfer learning applications.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
