Optimal Sensor and Actuator Selection for Factored Markov Decision Processes: Complexity, Approximability and Algorithms
Jayanth Bhargav, Mahsa Ghasemi, Shreyas Sundaram

TL;DR
This paper investigates the complexity of selecting sensors and actuators in factored MDPs and POMDPs, proving NP-hardness, and evaluates greedy algorithms showing their practical effectiveness despite theoretical limitations.
Contribution
It establishes NP-hardness and inapproximability results for sensor and actuator selection in factored MDPs and POMDPs, and demonstrates the practical performance of greedy algorithms.
Findings
Sensor and actuator selection problems are NP-hard to approximate.
Greedy algorithms often perform well in practice despite theoretical limitations.
Explicit examples show failure cases of greedy algorithms.
Abstract
Factored Markov Decision Processes (fMDPs) are a class of Markov Decision Processes (MDPs) in which the states (and actions) can be factored into a set of state (and action) variables and can be encoded compactly using a factored representation. In this paper, we consider a setting where the state of the fMDP is not directly observable, and the agent relies on a set of potential sensors to gather information. Each sensor has a selection cost and the designer must select a subset of sensors under a limited budget. We formulate the problem of selecting a set of sensors for fMDPs (under a budget) to maximize the infinite-horizon discounted return provided by the optimal policy. We show the fundamental result that it is NP-hard to approximate this problem to within any non-trivial factor. Our inapproximability results for optimal sensor selection also extend to a general class of Partially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Fault Detection and Control Systems
MethodsSparse Evolutionary Training
