Exploiting Submodular Value Functions For Scaling Up Active Perception

Yash Satsangi; Shimon Whiteson; Frans A. Oliehoek; Matthijs T. J.; Spaan

arXiv:2009.09696·cs.AI·September 22, 2020

Exploiting Submodular Value Functions For Scaling Up Active Perception

Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Matthijs T. J., Spaan

PDF

TL;DR

This paper introduces a scalable planning approach for active perception tasks by exploiting submodular value functions, enabling efficient decision-making in complex sensor environments with theoretical guarantees and empirical validation.

Contribution

It presents a novel greedy PBVI method that leverages submodularity to improve scalability and provides theoretical bounds on the value function's approximation error.

Findings

01

Achieves similar performance to existing methods

02

Reduces computational cost significantly

03

Provides theoretical guarantees under submodularity

Abstract

In active perception tasks, an agent aims to select sensory actions that reduce its uncertainty about one or more hidden variables. While partially observable Markov decision processes (POMDPs) provide a natural model for such problems, reward functions that directly penalize uncertainty in the agent's belief can remove the piecewise-linear and convex property of the value function required by most POMDP planners. Furthermore, as the number of sensors available to the agent grows, the computational cost of POMDP planning grows exponentially with it, making POMDP planning infeasible with traditional methods. In this article, we address a twofold challenge of modeling and planning for active perception tasks. We show the mathematical equivalence of $ρ$ POMDP and POMDP-IR, two frameworks for modeling active perception tasks, that restore the PWLC property of the value function. To…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.