Entailment Semantics Can Be Extracted from an Ideal Language Model

William Merrill; Alex Warstadt; Tal Linzen

arXiv:2209.12407·cs.CL·January 10, 2024

Entailment Semantics Can Be Extracted from an Ideal Language Model

William Merrill, Alex Warstadt, Tal Linzen

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that entailment semantics can be extracted from an ideal language model trained on pragmatically grounded data, revealing how semantic information can be decoded from language models.

Contribution

It proves that entailment judgments can be derived from an ideal language model trained on Gricean data, bridging the gap between language modeling and semantic inference.

Findings

01

Entailment judgments can be extracted from an ideal language model.

02

Decoding entailment is possible from models trained on pragmatically grounded data.

03

The results suggest a framework for understanding semantics in language models.

Abstract

Language models are often trained on text alone, without additional grounding. There is debate as to how much of natural language semantics can be inferred from such a procedure. We prove that entailment judgments between sentences can be extracted from an ideal language model that has perfectly learned its target distribution, assuming the training sentences are generated by Gricean agents, i.e., agents who follow fundamental principles of communication from the linguistic theory of pragmatics. We also show entailment judgments can be decoded from the predictions of a language model trained on such Gricean data. Our results reveal a pathway for understanding the semantic information encoded in unlabeled linguistic data and a potential framework for extracting semantics from language models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

viking-sudo-rm/formal-language-understanding
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems