Policy Learning with Distributional Welfare
Yifan Cui, Sukjin Han

TL;DR
This paper proposes a novel approach to treatment allocation that targets distributional welfare using conditional quantiles of treatment effects, offering robustness to model uncertainty and accommodating different policymaker preferences.
Contribution
It introduces a new policy framework based on quantile of treatment effects, with minimax robustness, extending welfare analysis beyond average effects.
Findings
Develops policies targeting distributional welfare using QoTE.
Establishes asymptotic regret bounds for proposed policies.
Provides a flexible framework applicable to various welfare functionals.
Abstract
In this paper, we explore optimal treatment allocation policies that target distributional welfare. Most literature on treatment choice has considered utilitarian welfare based on the conditional average treatment effect (ATE). While average welfare is intuitive, it may yield undesirable allocations especially when individuals are heterogeneous (e.g., with outliers) - the very reason individualized treatments were introduced in the first place. This observation motivates us to propose an optimal policy that allocates the treatment based on the conditional quantile of individual treatment effects (QoTE). Depending on the choice of the quantile probability, this criterion can accommodate a policymaker who is either prudent or negligent. The challenge of identifying the QoTE lies in its requirement for knowledge of the joint distribution of the counterfactual outcomes, which is not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHealthcare Policy and Management
