A blind spot for large language models: Supradiegetic linguistic   information

Julia Witte Zimmerman; Denis Hudon; Kathryn Cramer; Jonathan St. Onge,; Mikaela Fudolig; Milo Z. Trujillo; Christopher M. Danforth; Peter Sheridan; Dodds

arXiv:2306.06794·cs.CL·May 17, 2024·2 cites

A blind spot for large language models: Supradiegetic linguistic information

Julia Witte Zimmerman, Denis Hudon, Kathryn Cramer, Jonathan St. Onge,, Mikaela Fudolig, Milo Z. Trujillo, Christopher M. Danforth, Peter Sheridan, Dodds

PDF

Open Access

TL;DR

This paper explores the limitations of large language models by examining their exposure to linguistic information, distinguishing between diegetic and supradiegetic aspects, and analyzing how this affects their handling of certain linguistic and symbolic tasks.

Contribution

It introduces the concept of supradiegetic linguistic information and analyzes its impact on LLM capabilities, providing a new perspective on their limitations beyond traditional linguistic features.

Findings

01

LLMs struggle with palindromes and visual symbol characteristics.

02

They have difficulty translating Sumerian cuneiform.

03

Challenges in continuing integer sequences due to supradiegetic information.

Abstract

Large Language Models (LLMs) like ChatGPT reflect profound changes in the field of Artificial Intelligence, achieving a linguistic fluency that is impressively, even shockingly, human-like. The extent of their current and potential capabilities is an active area of investigation by no means limited to scientific researchers. It is common for people to frame the training data for LLMs as "text" or even "language". We examine the details of this framing using ideas from several areas, including linguistics, embodied cognition, cognitive science, mathematics, and history. We propose that considering what it is like to be an LLM like ChatGPT, as Nagel might have put it, can help us gain insight into its capabilities in general, and in particular, that its exposure to linguistic training data can be productively reframed as exposure to the diegetic information encoded in language, and its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Language and cultural evolution