Searching for Structure: Investigating Emergent Communication with Large Language Models
Tom Kouwenhoven, Max Peeperkorn, Tessa Verhoef

TL;DR
This study explores whether large language models can develop structured artificial languages through simulated communication, revealing that structural properties emerge and evolve over generations, akin to human language development.
Contribution
It demonstrates that LLMs can induce structure in artificial languages through simulated communication, providing a new tool for studying language evolution.
Findings
Languages become more structured over generations
Transmission can lead to degenerate vocabularies
LLMs can simulate aspects of language evolution
Abstract
Human languages have evolved to be structured through repeated language learning and use. These processes introduce biases that operate during language acquisition and shape linguistic systems toward communicative efficiency. In this paper, we investigate whether the same happens if artificial languages are optimised for implicit biases of Large Language Models (LLMs). To this end, we simulate a classical referential game in which LLMs learn and use artificial languages. Our results show that initially unstructured holistic languages are indeed shaped to have some structural properties that allow two LLM agents to communicate successfully. Similar to observations in human experiments, generational transmission increases the learnability of languages, but can at the same time result in non-humanlike degenerate vocabularies. Taken together, this work extends experimental findings, shows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution
