The Shrinking Landscape of Linguistic Diversity in the Age of Large Language Models
Zhivar Sourati, Farzan Karimi-Malekabadi, Meltem Ozcan, Colin, McDaniel, Alireza Ziabari, Jackson Trager, Ala Tak, Meng Chen, Fred, Morstatter, Morteza Dehghani

TL;DR
This paper investigates how the widespread use of large language models (LLMs) reduces linguistic diversity, homogenizes writing styles, and risks societal and cultural insights, with four studies demonstrating these effects across various contexts.
Contribution
It provides empirical evidence that LLMs contribute to linguistic homogenization and biases, highlighting societal and cultural risks not previously quantified.
Findings
LLMs homogenize writing styles and amplify biases
Linguistic diversity declines with increased LLM usage
Homogenization impacts societal, diagnostic, and cultural processes
Abstract
Language is far more than a communication tool. A wealth of information - including but not limited to the identities, psychological states, and social contexts of its users - can be gleaned through linguistic markers, and such insights are routinely leveraged across diverse fields ranging from product development and marketing to healthcare. In four studies utilizing experimental and observational methods, we demonstrate that the widespread adoption of large language models (LLMs) as writing assistants is linked to notable declines in linguistic diversity and may interfere with the societal and psychological insights language provides. We show that while the core content of texts is retained when LLMs polish and rewrite texts, not only do they homogenize writing styles, but they also alter stylistic elements in a way that selectively amplifies certain dominant characteristics or biases…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultilingual Education and Policy · Second Language Learning and Teaching · Gender Studies in Language
