Policy Transforms and Learning Optimal Policies
Thomas M. Russell

TL;DR
This paper develops a framework for learning optimal policy rules in uncertain, possibly incomplete models, providing theoretical guarantees for policy learnability both before and after data observation, with applications to discrete choice and program evaluation.
Contribution
It introduces a novel approach to assess and learn optimal policies under model uncertainty, including conditions for learnability and guarantees for ex-ante and ex-post policy performance.
Findings
Sufficient conditions for policy learnability are established.
The approach applies to non-parametric and semiparametric models.
Illustrations include discrete choice and program evaluation examples.
Abstract
We study the problem of choosing optimal policy rules in uncertain environments using models that may be incomplete and/or partially identified. We consider a policymaker who wishes to choose a policy to maximize a particular counterfactual quantity called a policy transform. We characterize learnability of a set of policy options by the existence of a decision rule that closely approximates the maximin optimal value of the policy transform with high probability. Sufficient conditions are provided for the existence of such a rule. However, learnability of an optimal policy is an ex-ante notion (i.e. before observing a sample), and so ex-post (i.e. after observing a sample) theoretical guarantees for certain policy rules are also provided. Our entire approach is applicable when the distribution of unobservables is not parametrically specified, although we discuss how semiparametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Water resources management and optimization · Advanced Multi-Objective Optimization Algorithms
