Risk Aware and Multi-Objective Decision Making with Distributional Monte   Carlo Tree Search

Conor F. Hayes; Mathieu Reymond; Diederik M. Roijers; Enda Howley,; Patrick Mannion

arXiv:2102.00966·cs.LG·February 3, 2021·6 cites

Risk Aware and Multi-Objective Decision Making with Distributional Monte Carlo Tree Search

Conor F. Hayes, Mathieu Reymond, Diederik M. Roijers, Enda Howley,, Patrick Mannion

PDF

Open Access

TL;DR

This paper introduces Distributional Monte Carlo Tree Search, an algorithm that models the distribution of future returns to improve decision-making in risk-sensitive and multi-objective reinforcement learning scenarios, especially when only a single trial is possible.

Contribution

The paper presents a novel distributional approach to Monte Carlo Tree Search that captures the entire distribution of returns, enhancing risk-aware and multi-objective decision-making capabilities.

Findings

01

Outperforms state-of-the-art in multi-objective RL for expected utility.

02

Effectively models the distribution over future returns.

03

Improves decision quality in risk-sensitive applications.

Abstract

In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from the single execution of a policy. In these settings, making decisions based on the average future returns is not suitable. For example, in a medical setting a patient may only have one opportunity to treat their illness. When making a decision, just the expected return -- known in reinforcement learning as the value -- cannot account for the potential range of adverse or positive outcomes a decision may have. Our key insight is that we should use the distribution over expected future returns differently to represent the critical information that the agent requires at decision time. In this paper, we propose Distributional Monte Carlo Tree Search, an algorithm that learns a posterior distribution over the utility of the different possible returns attainable from individual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Simulation Techniques and Applications