Multi-Objective Approaches to Markov Decision Processes with Uncertain   Transition Parameters

Dimitri Scheftelowitsch; Peter Buchholz; Vahid Hashemi; Holger; Hermanns

arXiv:1710.08986·cs.AI·October 26, 2017

Multi-Objective Approaches to Markov Decision Processes with Uncertain Transition Parameters

Dimitri Scheftelowitsch, Peter Buchholz, Vahid Hashemi, Holger, Hermanns

PDF

TL;DR

This paper explores multi-objective optimization for Markov decision processes with uncertain parameters, aiming to find policies that perform well across various scenarios rather than optimizing for worst-case or average-case alone.

Contribution

It introduces methods for computing Pareto optimal policies in bounded-parameter MDPs considering multiple performance scenarios simultaneously.

Findings

01

Developed algorithms for Pareto optimal policy computation.

02

Analyzed worst, best, and average case performances together.

03

Demonstrated effectiveness on bounded-parameter MDPs.

Abstract

Markov decision processes (MDPs) are a popular model for performance analysis and optimization of stochastic systems. The parameters of stochastic behavior of MDPs are estimates from empirical observations of a system; their values are not known precisely. Different types of MDPs with uncertain, imprecise or bounded transition rates or probabilities and rewards exist in the literature. Commonly, analysis of models with uncertainties amounts to searching for the most robust policy which means that the goal is to generate a policy with the greatest lower bound on performance (or, symmetrically, the lowest upper bound on costs). However, hedging against an unlikely worst case may lead to losses in other situations. In general, one is interested in policies that behave well in all situations which results in a multi-objective view on decision making. In this paper, we consider policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.