Distributionally Robust Policy Learning with Wasserstein Distance
Daido Kido

TL;DR
This paper introduces a distributionally robust approach to policy learning that accounts for population differences using Wasserstein distance, providing a new estimator that improves treatment rule performance in target populations.
Contribution
It develops a novel Wasserstein-distance-based distributionally robust estimator for individualized treatment rules that performs well even with limited or source population data.
Findings
The proposed estimator outperforms naive methods in target population scenarios.
The approach offers simple intuition and estimation procedures.
Theoretical performance guarantees are provided.
Abstract
The effects of treatments are often heterogeneous, depending on the observable characteristics, and it is necessary to exploit such heterogeneity to devise individualized treatment rules (ITRs). Existing estimation methods of such ITRs assume that the available experimental or observational data are derived from the target population in which the estimated policy is implemented. However, this assumption often fails in practice because of limited useful data. In this case, policymakers must rely on the data generated in the source population, which differs from the target population. Unfortunately, existing estimation methods do not necessarily work as expected in the new setting, and strategies that can achieve a reasonable goal in such a situation are required. This study examines the application of distributionally robust optimization (DRO), which formalizes an ambiguity about the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealth Systems, Economic Evaluations, Quality of Life · Advanced Causal Inference Techniques · Statistical Methods and Inference
