Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Liwei Jiang; Yuanjun Chai; Margaret Li; Mickel Liu; Raymond Fok; Nouha Dziri; Yulia Tsvetkov; Maarten Sap; Alon Albalak; Yejin Choi

arXiv:2510.22954·cs.CL·October 28, 2025

Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)

Liwei Jiang, Yuanjun Chai, Margaret Li, Mickel Liu, Raymond Fok, Nouha Dziri, Yulia Tsvetkov, Maarten Sap, Alon Albalak, Yejin Choi

PDF

TL;DR

This paper introduces Infinity-Chat, a large-scale dataset of diverse open-ended user queries, revealing a tendency of language models to produce similar outputs, and provides insights into human preferences and model calibration issues.

Contribution

The paper presents the first comprehensive taxonomy for open-ended prompts, a large-scale dataset with human annotations, and a study revealing the Artificial Hivemind effect in language models.

Findings

01

Language models show intra-model repetition and inter-model homogeneity.

02

Models and reward systems are less calibrated to human preferences.

03

Infinity-Chat enables systematic study of real-world open-ended queries.

Abstract

Language models (LMs) often struggle to generate diverse, human-like creative content, raising concerns about the long-term homogenization of human thought through repeated exposure to similar outputs. Yet scalable methods for evaluating LM output diversity remain limited, especially beyond narrow tasks such as random number or name generation, or beyond repeated sampling from a single model. We introduce Infinity-Chat, a large-scale dataset of 26K diverse, real-world, open-ended user queries that admit a wide range of plausible answers with no single ground truth. We introduce the first comprehensive taxonomy for characterizing the full spectrum of open-ended prompts posed to LMs, comprising 6 top-level categories (e.g., brainstorm & ideation) that further breaks down to 17 subcategories. Using Infinity-Chat, we present a large-scale study of mode collapse in LMs, revealing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.