Reinforcement learning in signaling game

Yilei Hu; Brian Skyrms; Pierre Tarr\`es

arXiv:1103.5818·math.PR·March 31, 2011·28 cites

Reinforcement learning in signaling game

Yilei Hu, Brian Skyrms, Pierre Tarr\`es

PDF

Open Access

TL;DR

This paper extends the analysis of a signaling game to more general settings, proving convergence of expected payoff and characterizing the structure of limit configurations in complex signaling scenarios.

Contribution

It generalizes previous models by analyzing signaling games with arbitrary numbers of states, signals, and acts, establishing convergence and structural properties of the resulting communication patterns.

Findings

01

Expected payoff converges almost surely.

02

A limit bipartite graph with no synonym or bottleneck emerges.

03

Any such graph can be a limit configuration with positive probability.

Abstract

We consider a signaling game originally introduced by Skyrms, which models how two interacting players learn to signal each other and thus create a common language. The first rigorous analysis was done by Argiento, Pemantle, Skyrms and Volkov (2009) with 2 states, 2 signals and 2 acts. We study the case of M_1 states, M_2 signals and M_1 acts for general M_1, M_2. We prove that the expected payoff increases in average and thus converges a.s., and that a limit bipartite graph emerges, such that no signal-state correspondence is associated to both a synonym and an informational bottleneck. Finally, we show that any graph correspondence with the above property is a limit configuration with positive probability.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Computability, Logic, AI Algorithms · Opinion Dynamics and Social Influence