Constructing Effective Personalized Policies Using Counterfactual Inference from Biased Data Sets with Many Features
Onur Atan, William R. Zame, Qiaojun Feng, Mihaela van der Schaar

TL;DR
This paper introduces a new method for creating personalized decision policies from biased, high-dimensional data lacking counterfactuals, by estimating propensity scores, inferring counterfactuals, and selecting relevant features.
Contribution
The paper presents a novel approach that effectively handles bias and high-dimensional features in constructing personalized policies using counterfactual inference.
Findings
Significant performance improvement over state-of-the-art algorithms
Effective feature selection for each action and instance
Robustness across diverse application settings
Abstract
This paper proposes a novel approach for constructing effective personalized policies when the observed data lacks counter-factual information, is biased and possesses many features. The approach is applicable in a wide variety of settings from healthcare to advertising to education to finance. These settings have in common that the decision maker can observe, for each previous instance, an array of features of the instance, the action taken in that instance, and the reward realized -- but not the rewards of actions that were not taken: the counterfactual information. Learning in such settings is made even more difficult because the observed data is typically biased by the existing policy (that generated the data) and because the array of features that might affect the reward in a particular instance -- and hence should be taken into account in deciding on an action in each particular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
