Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty

Shie Mannor (Technion); Ofir Mebel (Technion); Huan Xu (National; University of Singapore)

arXiv:1206.4643·cs.LG·June 22, 2012·25 cites

Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty

Shie Mannor (Technion), Ofir Mebel (Technion), Huan Xu (National, University of Singapore)

PDF

Open Access

TL;DR

This paper introduces a novel approach to robust Markov decision processes by modeling coupled uncertainties with a bounded deviation concept, leading to less conservative solutions and practical algorithms for optimal control.

Contribution

It presents a new coupled uncertainty model called 'Lightning Does not Strike Twice' and develops tractable algorithms for optimal policies under this model.

Findings

01

Probabilistic guarantees for real-life applicability

02

Less conservative solutions compared to uncoupled models

03

Efficient algorithms for coupled uncertainty control

Abstract

We consider Markov decision processes under parameter uncertainty. Previous studies all restrict to the case that uncertainties among different states are uncoupled, which leads to conservative solutions. In contrast, we introduce an intuitive concept, termed "Lightning Does not Strike Twice," to model coupled uncertain parameters. Specifically, we require that the system can deviate from its nominal parameters only a bounded number of times. We give probabilistic guarantees indicating that this model represents real life situations and devise tractable algorithms for computing optimal control policies using this concept.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Risk and Portfolio Optimization · Advanced Bandit Algorithms Research