Risk-Averse Stochastic Shortest Path Planning
Mohamadreza Ahmadi, Anushri Dixit, Joel W. Burdick, and Aaron D. Ames

TL;DR
This paper introduces a risk-averse approach to stochastic shortest path planning in MDPs, using a nested risk functional and a DCP-based computational method to find optimal policies that account for rare but critical events.
Contribution
It develops a novel framework for risk-averse shortest path planning with theoretical guarantees and a new computational technique using difference convex programs.
Findings
Existence of optimal stationary Markovian policies under risk aversion.
A new DCP-based method for computing risk-averse policies.
Successful illustration with rover navigation using CVaR and EVaR measures.
Abstract
We consider the stochastic shortest path planning problem in MDPs, i.e., the problem of designing policies that ensure reaching a goal state from a given initial state with minimum accrued cost. In order to account for rare but important realizations of the system, we consider a nested dynamic coherent risk total cost functional rather than the conventional risk-neutral total expected cost. Under some assumptions, we show that optimal, stationary, Markovian policies exist and can be found via a special Bellman's equation. We propose a computational technique based on difference convex programs (DCPs) to find the associated value functions and therefore the risk-averse policies. A rover navigation MDP is used to illustrate the proposed methodology with conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) coherent risk measures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
