Event knowledge in large language models: the gap between the impossible   and the unlikely

Carina Kauf; Anna A. Ivanova; Giulia Rambelli; Emmanuele Chersoni,; Jingyuan Selena She; Zawad Chowdhury; Evelina Fedorenko; Alessandro Lenci

arXiv:2212.01488·cs.CL·October 27, 2023·6 cites

Event knowledge in large language models: the gap between the impossible and the unlikely

Carina Kauf, Anna A. Ivanova, Giulia Rambelli, Emmanuele Chersoni,, Jingyuan Selena She, Zawad Chowdhury, Evelina Fedorenko, Alessandro Lenci

PDF

Open Access 1 Repo

TL;DR

This study investigates whether large language models understand event plausibility and likelihood, revealing they recognize possible events but are less consistent with likelihood, highlighting gaps in their semantic event knowledge.

Contribution

The paper provides empirical evidence that pre-trained LLMs possess substantial event knowledge, especially regarding possible vs. impossible events, but show limitations with likely vs. unlikely distinctions.

Findings

01

LLMs assign higher likelihood to plausible vs. implausible events

02

Scores are influenced by both plausibility and surface features

03

Knowledge generalizes across syntactic variants but less across semantic variants

Abstract

Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs' semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pre-trained LLMs (from 2018's BERT to 2023's MPT) assign higher likelihood to plausible descriptions of agent-patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n=1,215), we found that pre-trained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign higher likelihood to possible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

carina-kauf/lm-event-knowledge
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Weight Decay · Linear Layer · Attention Dropout · WordPiece · Softmax · Dense Connections · Layer Normalization