Joint integrative analysis of multiple data sources with correlated vector outcomes
Emily C. Hector, Peter X.-K. Song

TL;DR
This paper introduces a distributed quadratic inference framework for joint analysis of multiple data sources with correlated outcomes, improving efficiency and computational speed in estimating covariate effects.
Contribution
It develops a novel distributed estimation method combining quadratic inference functions and meta-estimation for correlated multi-source data analysis.
Findings
Method improves estimation efficiency.
Framework is computationally fast.
Successfully applied to multi-cohort study data.
Abstract
We propose a distributed quadratic inference function framework to jointly estimate regression parameters from multiple potentially heterogeneous data sources with correlated vector outcomes. The primary goal of this joint integrative analysis is to estimate covariate effects on all outcomes through a marginal regression model in a statistically and computationally efficient way. We develop a data integration procedure for statistical estimation and inference of regression parameters that is implemented in a fully distributed and parallelized computational scheme. To overcome computational and modeling challenges arising from the high-dimensional likelihood of the correlated vector outcomes, we propose to analyze each data source using Qu, Lindsay and Li (2000)'s quadratic inference functions, and then to jointly reestimate parameters from each data source by accounting for correlation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Statistical Methods and Inference · Advanced Causal Inference Techniques
