Deep Multi-Agent Reinforcement Learning for Decentralized Active Hypothesis Testing
Hadar Szostak, Kobi Cohen

TL;DR
This paper introduces MARLA, a deep multi-agent reinforcement learning algorithm for decentralized active hypothesis testing, enabling collaborative decision-making among agents to minimize Bayes risk in noisy environments.
Contribution
The paper presents a novel deep multi-agent RL framework for decentralized hypothesis testing, demonstrating improved performance over single-agent methods and providing an open-source implementation.
Findings
MARLA effectively learns collaborative strategies among agents.
MARLA outperforms single-agent learning approaches.
Experimental results validate the approach's efficiency in reducing Bayes risk.
Abstract
We consider a decentralized formulation of the active hypothesis testing (AHT) problem, where multiple agents gather noisy observations from the environment with the purpose of identifying the correct hypothesis. At each time step, agents have the option to select a sampling action. These different actions result in observations drawn from various distributions, each associated with a specific hypothesis. The agents collaborate to accomplish the task, where message exchanges between agents are allowed over a rate-limited communications channel. The objective is to devise a multi-agent policy that minimizes the Bayes risk. This risk comprises both the cost of sampling and the joint terminal cost incurred by the agents upon making a hypothesis declaration. Deriving optimal structured policies for AHT problems is generally mathematically intractable, even in the context of a single agent.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Distributed Sensor Networks and Detection Algorithms · Auction Theory and Applications
