Multi-source Learning for Target Population by High-dimensional Calibration
Haoxiang Zhan, Jae Kwang Kim, Yumou Qiu

TL;DR
This paper introduces a high-dimensional calibration method for multi-source learning that effectively combines heterogeneous datasets to estimate target population parameters with improved efficiency and robustness.
Contribution
It proposes a novel high-dimensional debiased calibration (HDC) method and a multi-source HDC (MHDC) estimator that support flexible models and enhance efficiency in multi-source data integration.
Findings
MHDC estimator achieves asymptotic normality.
It outperforms existing doubly robust estimators in simulations.
The method demonstrates practical utility in meteorological data analysis.
Abstract
Multi-source learning is an emerging area of research in statistics, where information from multiple datasets with heterogeneous distributions is combined to estimate the parameter of interest for a target population without observed responses. We propose a high-dimensional debiased calibration (HDC) method and a multi-source HDC (MHDC) estimator for general estimating equations. The HDC method uses a novel approach to achieve Neyman orthogonality for the target parameter via high-dimensional covariate balancing on an augmented set of covariates. It avoids the augmented inverse probability weighting formulation and leads to an easier optimization algorithm for the target parameter in estimating equations and M-estimation. The proposed MHDC estimator integrates multi-source data while supporting flexible specifications for both density ratio and outcome regression models, achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Survey Sampling and Estimation Techniques · Statistical Methods and Inference
