Policy learning for many outcomes of interest: Combining optimal policy   trees with multi-objective Bayesian optimisation

Patrick Rehill; Nicholas Biddle

arXiv:2212.06312·cs.LG·October 18, 2023

Policy learning for many outcomes of interest: Combining optimal policy trees with multi-objective Bayesian optimisation

Patrick Rehill, Nicholas Biddle

PDF

Open Access 1 Repo

TL;DR

This paper introduces Multi-Objective Policy Learning (MOPoL), a method combining optimal decision trees with multi-objective Bayesian optimisation to effectively explore trade-offs between multiple policy outcomes, demonstrated on a healthcare case study.

Contribution

It presents a novel approach that efficiently models trade-offs in policy outcomes using Pareto frontiers and surrogate models, enhancing multi-objective policy decision-making.

Findings

01

Successfully applied to anti-malarial medication rationing in Kenya

02

Efficiently approximates optimal trees with low-cost greedy trees

03

Provides a Pareto frontier of trade-offs between outcomes

Abstract

Methods for learning optimal policies use causal machine learning models to create human-interpretable rules for making choices around the allocation of different policy interventions. However, in realistic policy-making contexts, decision-makers often care about trade-offs between outcomes, not just single-mindedly maximising utility for one outcome. This paper proposes an approach termed Multi-Objective Policy Learning (MOPoL) which combines optimal decision trees for policy learning with a multi-objective Bayesian optimisation approach to explore the trade-off between multiple outcomes. It does this by building a Pareto frontier of non-dominated models for different hyperparameter settings which govern outcome weighting. The key here is that a low-cost greedy tree can be an accurate proxy for the very computationally costly optimal tree for the purposes of making decisions which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pbrehill/optimalpolicymobo
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Water resources management and optimization · Advanced Multi-Objective Optimization Algorithms