Planning under periodic observations: bounds and bounding-based   solutions

Federico Rossi; Dylan Shell

arXiv:2208.03351·cs.RO·August 9, 2022

Planning under periodic observations: bounds and bounding-based solutions

Federico Rossi, Dylan Shell

PDF

Open Access

TL;DR

This paper addresses planning for robots with intermittent information updates, proposing bounds and bounding-based methods to efficiently solve complex decision problems with practical applications like planetary exploration.

Contribution

It introduces a new subclass of planning problems with periodic information updates and develops systematic performance bounds for these challenging Markov Decision Processes.

Findings

01

Performance bounds improve solution quality

02

Bounding-based methods are effective empirically

03

Time until information is revealed is a key factor

Abstract

We study planning problems faced by robots operating in uncertain environments with incomplete knowledge of state, and actions that are noisy and/or imprecise. This paper identifies a new problem sub-class that models settings in which information is revealed only intermittently through some exogenous process that provides state information periodically. Several practical domains fit this model, including the specific scenario that motivates our research: autonomous navigation of a planetary exploration rover augmented by remote imaging. With an eye to efficient specialized solution methods, we examine the structure of instances of this sub-class. They lead to Markov Decision Processes with exponentially large action-spaces but for which, as those actions comprise sequences of more atomic elements, one may establish performance bounds by comparing policies under different information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Machine Learning and Algorithms · Reinforcement Learning in Robotics