Context-Based Concurrent Experience Sharing in Multiagent Systems
Dan Garant, Bruno da Silva, Victor Lesser, Chongjie Zhang

TL;DR
This paper presents a scalable, context-based experience sharing method for multi-agent systems that enhances learning efficiency and performance in large, dynamic environments through adaptive, low-overhead communication.
Contribution
It introduces an online, distributed, supervisor-directed technique for identifying contexts and facilitating experience sharing among agents, improving scalability and robustness.
Findings
Significant performance improvements in large-scale distributed tasks.
Robustness to noise and suboptimal context features.
Communication costs scale linearly with supervisor-to-agent ratio.
Abstract
One of the key challenges for multi-agent learning is scalability. In this paper, we introduce a technique for speeding up multi-agent learning by exploiting concurrent and incremental experience sharing. This solution adaptively identifies opportunities to transfer experiences between agents and allows for the rapid acquisition of appropriate policies in large-scale, stochastic, homogeneous multi-agent systems. We introduce an online, distributed, supervisor-directed transfer technique for constructing high-level characterizations of an agent's dynamic learning environment---called contexts---which are used to identify groups of agents operating under approximately similar dynamics within a short temporal window. A set of supervisory agents computes contextual information for groups of subordinate agents, thereby identifying candidates for experience sharing. Our method uses a tiered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Smart Grid Energy Management
