Risk-Averse Decision Making Under Uncertainty
Mohamadreza Ahmadi, Ugo Rosolia, Michel D. Ingham, Richard M. Murray,, and Aaron D. Ames

TL;DR
This paper develops a novel framework for designing risk-averse policies in MDPs and POMDPs using difference convex programming, addressing practical mission-critical scenarios with large deviation considerations.
Contribution
It introduces a unified optimization approach for risk-averse decision making in MDPs and POMDPs using DCP and DCCP frameworks, extending traditional expected cost methods.
Findings
Formulates risk-averse policies as DCPs for MDPs.
Extends the framework to POMDPs with belief-based policies.
Provides a policy iteration algorithm for risk-averse FSCs.
Abstract
A large class of decision making under uncertainty problems can be described via Markov decision processes (MDPs) or partially observable MDPs (POMDPs), with application to artificial intelligence and operations research, among others. Traditionally, policy synthesis techniques are proposed such that a total expected cost or reward is minimized or maximized. However, optimality in the total expected cost sense is only reasonable if system behavior in the large number of runs is of interest, which has limited the use of such policies in practical mission-critical scenarios, wherein large deviations from the expected behavior may lead to mission failure. In this paper, we consider the problem of designing policies for MDPs and POMDPs with objectives and constraints in terms of dynamic coherent risk measures, which we refer to as the constrained risk-averse problem. For MDPs, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReliability and Maintenance Optimization
