Fused mean structure learning in data integration with dependence
Emily C. Hector

TL;DR
This paper introduces a novel data fusion method for integrating multiple studies with dependent outcomes, improving flexibility, statistical accuracy, and computational efficiency, especially in neuroimaging data analysis.
Contribution
It proposes a new quadratic inference-based fusion approach with a pairwise penalty, along with theoretical properties and an efficient meta-estimator for data integration.
Findings
Enhanced flexibility and performance over existing methods
Theoretical guarantees for the estimator's properties
Successful application to neuroimaging data with an available R package
Abstract
Motivated by image-on-scalar regression with data aggregated across multiple sites, we consider a setting in which multiple independent studies each collect multiple dependent vector outcomes, with potential mean model parameter homogeneity between studies and outcome vectors. To determine the validity of jointly analyzing these data sources, we must learn which of these data sources share mean model parameters. We propose a new model fusion approach that delivers improved flexibility, statistical performance and computational speed over existing methods. Our proposed approach specifies a quadratic inference function within each data source and fuses mean model parameter vectors in their entirety based on a new formulation of a pairwise fusion penalty. We establish theoretical properties of our estimator and propose an asymptotically equivalent weighted oracle meta-estimator that is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Functional Brain Connectivity Studies · Domain Adaptation and Few-Shot Learning
