Generative Emergent Communication: Large Language Model is a Collective World Model

Tadahiro Taniguchi; Ryo Ueda; Tomoaki Nakamura; Masahiro Suzuki; Akira Taniguchi

arXiv:2501.00226·cs.AI·July 17, 2025·2 cites

Generative Emergent Communication: Large Language Model is a Collective World Model

Tadahiro Taniguchi, Ryo Ueda, Tomoaki Nakamura, Masahiro Suzuki, Akira Taniguchi

PDF

Open Access

TL;DR

This paper introduces the Collective World Model hypothesis and a generative emergent communication framework, explaining how large language models implicitly learn societal-level representations through language as a collective inference process.

Contribution

It formalizes the Generative EmCom framework and connects it to world models and multi-agent learning, offering a new interpretation of LLM capabilities.

Findings

01

Provides a mathematical explanation for LLM capabilities

02

Models language emergence as decentralized Bayesian inference

03

Explains distributional semantics through societal representation reconstruction

Abstract

Large Language Models (LLMs) have demonstrated a remarkable ability to capture extensive world knowledge, yet how this is achieved without direct sensorimotor experience remains a fundamental puzzle. This study proposes a novel theoretical solution by introducing the Collective World Model hypothesis. We argue that an LLM does not learn a world model from scratch; instead, it learns a statistical approximation of a collective world model that is already implicitly encoded in human language through a society-wide process of embodied, interactive sense-making. To formalize this process, we introduce generative emergent communication (Generative EmCom), a framework built on the Collective Predictive Coding (CPC). This framework models the emergence of language as a process of decentralized Bayesian inference over the internal states of multiple agents. We argue that this process…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution