Risk-Averse Decision Making Under Uncertainty

Mohamadreza Ahmadi; Ugo Rosolia; Michel D. Ingham; Richard M. Murray,; and Aaron D. Ames

arXiv:2109.04082·cs.AI·September 10, 2021·1 cites

Risk-Averse Decision Making Under Uncertainty

Mohamadreza Ahmadi, Ugo Rosolia, Michel D. Ingham, Richard M. Murray,, and Aaron D. Ames

PDF

Open Access

TL;DR

This paper develops a novel framework for designing risk-averse policies in MDPs and POMDPs using difference convex programming, addressing practical mission-critical scenarios with large deviation considerations.

Contribution

It introduces a unified optimization approach for risk-averse decision making in MDPs and POMDPs using DCP and DCCP frameworks, extending traditional expected cost methods.

Findings

01

Formulates risk-averse policies as DCPs for MDPs.

02

Extends the framework to POMDPs with belief-based policies.

03

Provides a policy iteration algorithm for risk-averse FSCs.

Abstract

A large class of decision making under uncertainty problems can be described via Markov decision processes (MDPs) or partially observable MDPs (POMDPs), with application to artificial intelligence and operations research, among others. Traditionally, policy synthesis techniques are proposed such that a total expected cost or reward is minimized or maximized. However, optimality in the total expected cost sense is only reasonable if system behavior in the large number of runs is of interest, which has limited the use of such policies in practical mission-critical scenarios, wherein large deviations from the expected behavior may lead to mission failure. In this paper, we consider the problem of designing policies for MDPs and POMDPs with objectives and constraints in terms of dynamic coherent risk measures, which we refer to as the constrained risk-averse problem. For MDPs, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReliability and Maintenance Optimization