The Signaler-Responder Game: Learning to Communicate using Thompson Sampling
Radhika Bhuckory, Bhaskar Krishnamachari

TL;DR
This paper introduces a distributed Bayesian learning approach using Thompson Sampling for agents in a signaler-responder game to learn effective communication and cooperation strategies without pre-programmed rules.
Contribution
It presents two novel Thompson Sampling-based algorithms enabling heterogeneous agents to learn optimal communication strategies in a multi-agent game setting.
Findings
Agents adapt strategies over time to maximize rewards.
Communication occurs only when beneficial, reducing unnecessary costs.
The algorithms converge to efficient equilibria under various conditions.
Abstract
We are interested in studying how heterogeneous agents can learn to communicate and cooperate with each other without being explicitly pre-programmed to do so. Motivated by this goal, we present and analyze a distributed solution to a two-player signaler-responder game which is defined as follows. The signaler agent has a random, exogenous need and can choose from four different strategies: never signal, always signal, signal when need, and signal when no need. The responder agent can choose to either ignore or respond to the signal. We define a reward to both agents when they cooperate to satisfy the signaler's need, and costs associated with communication, response and unmet needs. We identify pure Nash equilibria of the game and the conditions under which they occur. As a solution for this game, we propose two new distributed Bayesian learning algorithms, one for each agent, based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms
