Conditional Value-at-Risk for Reachability and Mean Payoff in Markov   Decision Processes

Jan K\v{r}et\'insk\'y; Tobias Meggendorfer

arXiv:1805.02946·cs.LO·May 9, 2018

Conditional Value-at-Risk for Reachability and Mean Payoff in Markov Decision Processes

Jan K\v{r}et\'insk\'y, Tobias Meggendorfer

PDF

TL;DR

This paper explores the application of Conditional Value-at-Risk (CVaR) in Markov decision processes to quantify and manage risk in reachability and mean-payoff objectives, providing complexity bounds and strategy characterizations.

Contribution

It introduces CVaR constraints into MDPs, analyzes their computational complexity, and characterizes the structure of optimal strategies with respect to memory and randomization.

Findings

01

Derived bounds on decision problem complexity.

02

Characterized strategy structures for CVaR constraints.

03

Extended analysis to conjunctions with expectation and VaR constraints.

Abstract

We present the conditional value-at-risk (CVaR) in the context of Markov chains and Markov decision processes with reachability and mean-payoff objectives. CVaR quantifies risk by means of the expectation of the worst p-quantile. As such it can be used to design risk-averse systems. We consider not only CVaR constraints, but also introduce their conjunction with expectation constraints and quantile constraints (value-at-risk, VaR). We derive lower and upper bounds on the computational complexity of the respective decision problems and characterize the structure of the strategies in terms of memory and randomization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.