Sensor Synthesis for POMDPs with Reachability Objectives

Krishnendu Chatterjee; Martin Chmelik; Ufuk Topcu

arXiv:1710.00675·cs.AI·October 3, 2017

Sensor Synthesis for POMDPs with Reachability Objectives

Krishnendu Chatterjee, Martin Chmelik, Ufuk Topcu

PDF

TL;DR

This paper introduces a symbolic SAT-based method for synthesizing minimal additional sensors in POMDPs to ensure almost-sure reachability with small-memory policies, reducing sensor complexity without increasing policy complexity.

Contribution

It presents the first approach to synthesize the weakest sensors in POMDPs for reachability objectives, with a formal NP-complete analysis and practical implementation.

Findings

01

Significant reduction in the number of sensors needed in classical POMDP examples.

02

The approach maintains policy complexity while decreasing sensor requirements.

03

The problem is shown to be NP-complete, guiding the design of the algorithm.

Abstract

Partially observable Markov decision processes (POMDPs) are widely used in probabilistic planning problems in which an agent interacts with an environment using noisy and imprecise sensors. We study a setting in which the sensors are only partially defined and the goal is to synthesize "weakest" additional sensors, such that in the resulting POMDP, there is a small-memory policy for the agent that almost-surely (with probability~1) satisfies a reachability objective. We show that the problem is NP-complete, and present a symbolic algorithm by encoding the problem into SAT instances. We illustrate trade-offs between the amount of memory of the policy and the number of additional sensors on a simple example. We have implemented our approach and consider three classical POMDP examples from the literature, and show that in all the examples the number of sensors can be significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.