Planning under periodic observations: bounds and bounding-based solutions
Federico Rossi, Dylan Shell

TL;DR
This paper addresses planning for robots with intermittent information updates, proposing bounds and bounding-based methods to efficiently solve complex decision problems with practical applications like planetary exploration.
Contribution
It introduces a new subclass of planning problems with periodic information updates and develops systematic performance bounds for these challenging Markov Decision Processes.
Findings
Performance bounds improve solution quality
Bounding-based methods are effective empirically
Time until information is revealed is a key factor
Abstract
We study planning problems faced by robots operating in uncertain environments with incomplete knowledge of state, and actions that are noisy and/or imprecise. This paper identifies a new problem sub-class that models settings in which information is revealed only intermittently through some exogenous process that provides state information periodically. Several practical domains fit this model, including the specific scenario that motivates our research: autonomous navigation of a planetary exploration rover augmented by remote imaging. With an eye to efficient specialized solution methods, we examine the structure of instances of this sub-class. They lead to Markov Decision Processes with exponentially large action-spaces but for which, as those actions comprise sequences of more atomic elements, one may establish performance bounds by comparing policies under different information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Machine Learning and Algorithms · Reinforcement Learning in Robotics
