Hybrid Information-driven Multi-agent Reinforcement Learning
William A. Dawson, Ruben Glatt, Edward Rusu, Braden C. Soper, Ryan A., Goldhahn

TL;DR
This paper proposes a hybrid approach combining information theory and reinforcement learning to improve the efficiency of multi-agent systems in large, sparse state spaces, showing significant exploration improvements.
Contribution
It introduces a novel hybrid MARL framework that uses information theoretic heuristics to guide exploration and learning in resource-constrained multi-agent environments.
Findings
Agents are approximately 1000 times more efficient at exploring sparse state spaces.
The approach effectively integrates information heuristics with RL for better exploration.
Preliminary results indicate promising directions for scalable multi-agent control.
Abstract
Information theoretic sensor management approaches are an ideal solution to state estimation problems when considering the optimal control of multi-agent systems, however they are too computationally intensive for large state spaces, especially when considering the limited computational resources typical of large-scale distributed multi-agent systems. Reinforcement learning (RL) is a promising alternative which can find approximate solutions to distributed optimal control problems that take into account the resource constraints inherent in many systems of distributed agents. However, the RL training can be prohibitively inefficient, especially in low-information environments where agents receive little to no feedback in large portions of the state space. We propose a hybrid information-driven multi-agent reinforcement learning (MARL) approach that utilizes information theoretic models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Distributed Control Multi-Agent Systems
