Exploiting Submodular Value Functions For Scaling Up Active Perception
Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Matthijs T. J., Spaan

TL;DR
This paper introduces a scalable planning approach for active perception tasks by exploiting submodular value functions, enabling efficient decision-making in complex sensor environments with theoretical guarantees and empirical validation.
Contribution
It presents a novel greedy PBVI method that leverages submodularity to improve scalability and provides theoretical bounds on the value function's approximation error.
Findings
Achieves similar performance to existing methods
Reduces computational cost significantly
Provides theoretical guarantees under submodularity
Abstract
In active perception tasks, an agent aims to select sensory actions that reduce its uncertainty about one or more hidden variables. While partially observable Markov decision processes (POMDPs) provide a natural model for such problems, reward functions that directly penalize uncertainty in the agent's belief can remove the piecewise-linear and convex property of the value function required by most POMDP planners. Furthermore, as the number of sensors available to the agent grows, the computational cost of POMDP planning grows exponentially with it, making POMDP planning infeasible with traditional methods. In this article, we address a twofold challenge of modeling and planning for active perception tasks. We show the mathematical equivalence of POMDP and POMDP-IR, two frameworks for modeling active perception tasks, that restore the PWLC property of the value function. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
