Multi-agent active perception with prediction rewards

Mikko Lauri; Frans A. Oliehoek

arXiv:2010.11835·cs.AI·October 24, 2020·1 cites

Multi-agent active perception with prediction rewards

Mikko Lauri, Frans A. Oliehoek

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper models multi-agent active perception as a Dec-POMDP with prediction rewards, providing theoretical bounds and practical algorithms to improve scalability and planning in decentralized observation tasks.

Contribution

It introduces a novel Dec-POMDP formulation with individual prediction actions, bounding decentralization loss and enabling application of existing algorithms to active perception.

Findings

01

Bounded loss due to decentralization

02

Application of Dec-POMDP algorithms improves scalability

03

Empirical results show enhanced planning efficiency

Abstract

Multi-agent active perception is a task where a team of agents cooperatively gathers observations to compute a joint estimate of a hidden variable. The task is decentralized and the joint estimate can only be computed after the task ends by fusing observations of all agents. The objective is to maximize the accuracy of the estimate. The accuracy is quantified by a centralized prediction reward determined by a centralized decision-maker who perceives the observations gathered by all agents after the task ends. In this paper, we model multi-agent active perception as a decentralized partially observable Markov decision process (Dec-POMDP) with a convex centralized prediction reward. We prove that by introducing individual prediction actions for each agent, the problem is converted into a standard Dec-POMDP with a decentralized prediction reward. The loss due to decentralization is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

laurimi/multiagent-prediction-reward
noneOfficial

Videos

Multi-agent active perception with prediction rewards· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Auction Theory and Applications · Game Theory and Applications