Event knowledge in large language models: the gap between the impossible and the unlikely
Carina Kauf, Anna A. Ivanova, Giulia Rambelli, Emmanuele Chersoni,, Jingyuan Selena She, Zawad Chowdhury, Evelina Fedorenko, Alessandro Lenci

TL;DR
This study investigates whether large language models understand event plausibility and likelihood, revealing they recognize possible events but are less consistent with likelihood, highlighting gaps in their semantic event knowledge.
Contribution
The paper provides empirical evidence that pre-trained LLMs possess substantial event knowledge, especially regarding possible vs. impossible events, but show limitations with likely vs. unlikely distinctions.
Findings
LLMs assign higher likelihood to plausible vs. implausible events
Scores are influenced by both plausibility and surface features
Knowledge generalizes across syntactic variants but less across semantic variants
Abstract
Word co-occurrence patterns in language corpora contain a surprising amount of conceptual knowledge. Large language models (LLMs), trained to predict words in context, leverage these patterns to achieve impressive performance on diverse semantic tasks requiring world knowledge. An important but understudied question about LLMs' semantic abilities is whether they acquire generalized knowledge of common events. Here, we test whether five pre-trained LLMs (from 2018's BERT to 2023's MPT) assign higher likelihood to plausible descriptions of agent-patient interactions than to minimally different implausible versions of the same event. Using three curated sets of minimal sentence pairs (total n=1,215), we found that pre-trained LLMs possess substantial event knowledge, outperforming other distributional language models. In particular, they almost always assign higher likelihood to possible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Weight Decay · Linear Layer · Attention Dropout · WordPiece · Softmax · Dense Connections · Layer Normalization
