Transfer Learning of CATE with Kernel Ridge Regression
Seok-Jin Kim, Hongjie Liu, Molei Liu, Kaizheng Wang

TL;DR
This paper introduces a novel transfer learning method for estimating conditional average treatment effects using kernel ridge regression, effectively handling covariate shifts and limited overlaps between source and target populations.
Contribution
The paper proposes an overlap-adaptive transfer learning approach for CATE using KRR, with a novel data partitioning and model selection strategy, supported by theoretical guarantees.
Findings
Achieves superior finite-sample efficiency in simulations
Demonstrates robustness to weak overlaps and complex CATE functions
Effectively applied to a 401(k) eligibility dataset
Abstract
The proliferation of data has sparked significant interest in leveraging findings from one study to estimate treatment effects in a different target population without direct outcome observations. However, the transfer learning process is frequently hindered by substantial covariate shift and limited overlap between (i) the source and target populations, as well as (ii) the treatment and control groups within the source. We propose a novel method for overlap-adaptive transfer learning of conditional average treatment effect (CATE) using kernel ridge regression (KRR). Our approach involves partitioning the labeled source data into two subsets. The first one is used to train candidate CATE models based on regression adjustment and pseudo-outcomes. An optimal model is then selected using the second subset and unlabeled target data, employing another pseudo-outcome-based strategy. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing
