Enhancing Inference for Small Cohorts via Transfer Learning and Weighted Integration of Multiple Datasets

Subharup Guha; Mengqi Xu; Yi Li

arXiv:2505.07153·stat.ME·April 17, 2026

Enhancing Inference for Small Cohorts via Transfer Learning and Weighted Integration of Multiple Datasets

Subharup Guha, Mengqi Xu, Yi Li

PDF

TL;DR

This paper introduces TRANSLATE, a novel transfer learning weighting method that improves inference accuracy for small regional cohorts by integrating multiple datasets while accounting for heterogeneity.

Contribution

The paper proposes TRANSLATE, a new weighting approach that aligns external datasets with the target cohort, providing theoretical guarantees and practical improvements in sepsis outcome analysis.

Findings

01

TRANSLATE improves inference precision in simulations.

02

Application to Northeast sepsis data demonstrates enhanced accuracy.

03

Method effectively accounts for regional heterogeneity.

Abstract

Lung sepsis remains a significant concern in the Northeastern U.S., yet the national eICU Collaborative Database includes only a small number of patients from this region, highlighting underrepresentation. Understanding clinical variables such as FiO2, creatinine, platelets, and lactate, which reflect oxygenation, kidney function, coagulation, and metabolism, is crucial because these markers influence sepsis outcomes and may vary by sex. Transfer learning helps address small sample sizes by borrowing information from larger datasets, although differences in covariates and outcome-generating mechanisms between the target and external cohorts can complicate the process. We propose a novel weighting method, TRANSfer LeArning wiTh wEights (TRANSLATE), to integrate data from various sources by incorporating domain-specific characteristics through learned weights that align external data with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.