IMAS$^2$: Joint Agent Selection and Information-Theoretic Coordinated Perception In Dec-POMDPs
Chongyang Shi, Wesley A. Suttle, Michael Dorothy, Jie Fu

TL;DR
This paper introduces IMAS$^2$, a novel method for joint agent selection and perception policy synthesis in Dec-POMDPs using information-theoretic metrics, with theoretical guarantees and practical validation.
Contribution
It proposes a two-layer optimization framework leveraging mutual information and submodularity to efficiently select sensors and synthesize policies in multi-agent systems.
Findings
The approach maximizes mutual information for active perception tasks.
The IMAS$^2$ algorithm guarantees a (1 - 1/e) approximation ratio.
Demonstrated effectiveness in a grid-world perception task.
Abstract
We study the problem of jointly selecting sensing agents and synthesizing decentralized active perception policies for the chosen subset of agents within a Decentralized Partially Observable Markov Decision Process (Dec-POMDP) framework. Our approach employs a two-layer optimization structure. In the inner layer, we introduce information-theoretic metrics, defined by the mutual information between the unknown trajectories or some hidden property in the environment and the collective partial observations in the multi-agent system, as a unified objective for active perception problems. We employ various optimization methods to obtain optimal sensor policies that maximize mutual information for distinct active perception tasks. In the outer layer, we prove that under certain conditions, the information-theoretic objectives are monotone and submodular with respect to the subset of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Distributed Control Multi-Agent Systems · Distributed Sensor Networks and Detection Algorithms
