Lightning Does Not Strike Twice: Robust MDPs with Coupled Uncertainty
Shie Mannor (Technion), Ofir Mebel (Technion), Huan Xu (National, University of Singapore)

TL;DR
This paper introduces a novel approach to robust Markov decision processes by modeling coupled uncertainties with a bounded deviation concept, leading to less conservative solutions and practical algorithms for optimal control.
Contribution
It presents a new coupled uncertainty model called 'Lightning Does not Strike Twice' and develops tractable algorithms for optimal policies under this model.
Findings
Probabilistic guarantees for real-life applicability
Less conservative solutions compared to uncoupled models
Efficient algorithms for coupled uncertainty control
Abstract
We consider Markov decision processes under parameter uncertainty. Previous studies all restrict to the case that uncertainties among different states are uncoupled, which leads to conservative solutions. In contrast, we introduce an intuitive concept, termed "Lightning Does not Strike Twice," to model coupled uncertain parameters. Specifically, we require that the system can deviate from its nominal parameters only a bounded number of times. We give probabilistic guarantees indicating that this model represents real life situations and devise tractable algorithms for computing optimal control policies using this concept.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Risk and Portfolio Optimization · Advanced Bandit Algorithms Research
