Two steps to risk sensitivity

Chris Gagne; Peter Dayan

arXiv:2111.06803·cs.AI·November 15, 2021

Two steps to risk sensitivity

Chris Gagne, Peter Dayan

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores risk sensitivity in decision making using distributional reinforcement learning, focusing on CVaR, and introduces methods to ensure time consistency for better modeling of human and animal behavior.

Contribution

It reanalyzes human decision-making with CVaR, identifies issues with risk aversion, and proposes alternative CVaR forms that are time consistent.

Findings

01

Revealed hidden risk aversion in human choices using CVaR analysis.

02

Showed that certain CVaR forms lack time consistency, affecting modeling accuracy.

03

Simulations demonstrated differences in planning implications between CVaR variants.

Abstract

Distributional reinforcement learning (RL) -- in which agents learn about all the possible long-term consequences of their actions, and not just the expected value -- is of great recent interest. One of the most important affordances of a distributional view is facilitating a modern, measured, approach to risk when outcomes are not completely certain. By contrast, psychological and neuroscientific investigations into decision making under risk have utilized a variety of more venerable theoretical models such as prospect theory that lack axiomatically desirable properties such as coherence. Here, we consider a particularly relevant risk measure for modeling human and animal planning, called conditional value-at-risk (CVaR), which quantifies worst-case outcomes (e.g., vehicle accidents or predation). We first adopt a conventional distributional approach to CVaR in a sequential setting and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

crgagne/twosteps_neurips2021
noneOfficial

Videos

Two steps to risk sensitivity· slideslive

Taxonomy

TopicsDecision-Making and Behavioral Economics · Health Systems, Economic Evaluations, Quality of Life · Explainable Artificial Intelligence (XAI)