Optimal Sensor and Actuator Selection for Factored Markov Decision   Processes: Complexity, Approximability and Algorithms

Jayanth Bhargav; Mahsa Ghasemi; Shreyas Sundaram

arXiv:2407.07310·eess.SY·February 26, 2025

Optimal Sensor and Actuator Selection for Factored Markov Decision Processes: Complexity, Approximability and Algorithms

Jayanth Bhargav, Mahsa Ghasemi, Shreyas Sundaram

PDF

Open Access

TL;DR

This paper investigates the complexity of selecting sensors and actuators in factored MDPs and POMDPs, proving NP-hardness, and evaluates greedy algorithms showing their practical effectiveness despite theoretical limitations.

Contribution

It establishes NP-hardness and inapproximability results for sensor and actuator selection in factored MDPs and POMDPs, and demonstrates the practical performance of greedy algorithms.

Findings

01

Sensor and actuator selection problems are NP-hard to approximate.

02

Greedy algorithms often perform well in practice despite theoretical limitations.

03

Explicit examples show failure cases of greedy algorithms.

Abstract

Factored Markov Decision Processes (fMDPs) are a class of Markov Decision Processes (MDPs) in which the states (and actions) can be factored into a set of state (and action) variables and can be encoded compactly using a factored representation. In this paper, we consider a setting where the state of the fMDP is not directly observable, and the agent relies on a set of potential sensors to gather information. Each sensor has a selection cost and the designer must select a subset of sensors under a limited budget. We formulate the problem of selecting a set of sensors for fMDPs (under a budget) to maximize the infinite-horizon discounted return provided by the optimal policy. We show the fundamental result that it is NP-hard to approximate this problem to within any non-trivial factor. Our inapproximability results for optimal sensor selection also extend to a general class of Partially…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Fault Detection and Control Systems

MethodsSparse Evolutionary Training