Provable Limitations of Acquiring Meaning from Ungrounded Form: What   Will Future Language Models Understand?

William Merrill; Yoav Goldberg; Roy Schwartz; Noah A. Smith

arXiv:2104.10809·cs.CL·June 23, 2021

Provable Limitations of Acquiring Meaning from Ungrounded Form: What Will Future Language Models Understand?

William Merrill, Yoav Goldberg, Roy Schwartz, Noah A. Smith

PDF

TL;DR

This paper investigates the theoretical limits of ungrounded language models in acquiring true semantic understanding, showing that assertions alone are insufficient for full semantic emulation, especially in complex language classes.

Contribution

The paper provides a formal analysis demonstrating fundamental limitations of ungrounded models in capturing semantics, highlighting the necessity of grounding for genuine understanding.

Findings

01

Assertions enable semantic emulation in transparent languages.

02

Emulation becomes uncomputable for languages with context-dependent expressions.

03

Assertions are insufficient for full semantic understanding in ungrounded models.

Abstract

Language models trained on billions of tokens have recently led to unprecedented results on many NLP tasks. This success raises the question of whether, in principle, a system can ever ``understand'' raw text without access to some form of grounding. We formally investigate the abilities of ungrounded systems to acquire meaning. Our analysis focuses on the role of ``assertions'': textual contexts that provide indirect clues about the underlying semantics. We study whether assertions enable a system to emulate representations preserving semantic relations like equivalence. We find that assertions enable semantic emulation of languages that satisfy a strong notion of semantic transparency. However, for classes of languages where the same expression can take different values in different contexts, we show that emulation can become uncomputable. Finally, we discuss differences between our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.