Improving prediction models by incorporating external data with weights based on similarity
Max Behrens, Maryam Farhadizadeh, Angelika Rohde, Alexander R\"uhle,, Nils H. Nicolay, Harald Binder, Daniela Z\"oller

TL;DR
This paper introduces a novel weighting method that combines data set and observation similarities to improve prediction models using external data, especially in small clinical datasets with heterogeneity.
Contribution
The paper proposes a new approach that integrates data set and observation weights based on similarity, enhancing prediction accuracy in multi-center clinical studies.
Findings
Improved prediction performance when external data sets are similar.
Method effectively quantifies similarity between data sets.
Application to radiotherapy dose prediction demonstrates practical utility.
Abstract
In clinical settings, we often face the challenge of building prediction models based on small observational data sets. For example, such a data set might be from a medical center in a multi-center study. Differences between centers might be large, thus requiring specific models based on the data set from the target center. Still, we want to borrow information from the external centers, to deal with small sample sizes. There are approaches that either assign weights to each external data set or each external observation. To incorporate information on differences between data sets and observations, we propose an approach that combines both into weights that can be incorporated into a likelihood for fitting regression models. Specifically, we suggest weights at the data set level that incorporate information on how well the models that provide the observation weights distinguish between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
