Lewis's Signaling Game as beta-VAE For Natural Word Lengths and Segments
Ryo Ueda, Tadahiro Taniguchi

TL;DR
This paper reinterprets Lewis's signaling game as a beta-VAE, showing how prior distributions influence emergent language properties like word length and segmentation, aligning them more closely with natural language laws.
Contribution
It reformulates the signaling game as a beta-VAE, clarifies the role of priors, and demonstrates how choosing appropriate priors leads to emergent languages with natural statistical properties.
Findings
Appropriate priors lead to emergent languages following Zipf's law of abbreviation.
Proper priors result in segments that align with Harris's articulation scheme.
Conventional objectives hinder emergent languages from exhibiting natural language properties.
Abstract
As a sub-discipline of evolutionary and computational linguistics, emergent communication (EC) studies communication protocols, called emergent languages, arising in simulations where agents communicate. A key goal of EC is to give rise to languages that share statistical properties with natural languages. In this paper, we reinterpret Lewis's signaling game, a frequently used setting in EC, as beta-VAE and reformulate its objective function as ELBO. Consequently, we clarify the existence of prior distributions of emergent languages and show that the choice of the priors can influence their statistical properties. Specifically, we address the properties of word lengths and segmentation, known as Zipf's law of abbreviation (ZLA) and Harris's articulation scheme (HAS), respectively. It has been reported that the emergent languages do not follow them when using the conventional objective.…
Peer Reviews
Decision·ICLR 2024 poster
1. The author well introduces the problem, which is well motivated. 2. The authors provide a deteailed proof in supplementary material.
(1) The paper is hard to read. The theoretical section includes symbols and equations without full explanation. (2) The experiments are weak. The compared methods lack descriptions, and the performance improvement is not well explained. (3) The studied problem lacks audience in the community.
1. The originality of this paper is good. The authors propose a generative point of view of the signaling game and analyze the possible causing factors of the current problems of the emerging less meaningful linguistic properties using the conventional objectives. This can provide a fresh study framework for emergent communication. The rigorous formalization and mathematical equations can integrate previous designs of regularizers and help with future objective design, offering a valuable contri
No obvious weaknesses.
- (major) The paper aims at re-analyzing a common EC setting in a more formalized way, yielding the potential for theoretical insights that would not otherwise be possible. - (major) Furthermore, I think this analysis is largely in the correct direction with analyzing the signalling game as a VAE, looking at inductive biases, and tying in linguistic concepts like Zipf's Law of Abbreviation and Harris's Articulation Scheme (although this wide scope is also a bit of concern; cf. "Weakn
- (major) A critical part of the paper is the "prior" within a VAE or signalling game, but I did not get a concrete sense of what this prior actually is in the context of a signalling game with neural network-based agents (I expand on this in "Questions"). As a result, it makes me unsure how well the theoretical claims actually apply to a real setup. - (major) The paper, I think, tried to do too much, and ends up not spending enough time on the core claims, namely, the signalling game
Code & Models
Videos
Taxonomy
TopicsLanguage and cultural evolution · Evolutionary Algorithms and Applications · DNA and Biological Computing
MethodsBeta-VAE
