Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
Yuhan Zhang, Edward Gibson, Forrest Davis

TL;DR
This study investigates whether language models can be fooled by language illusions similar to humans, revealing that they are more susceptible to structural illusions like NPI but less to semantic ones, highlighting their limitations.
Contribution
The paper provides a comparative analysis of language models' responses to various language illusions, demonstrating their limited capacity to mimic human nuanced language judgments.
Findings
LMs are more aligned with humans on NPI illusions involving structure.
LMs struggle with semantic illusions like the comparative and depth-charge illusions.
No LM fully replicates human-like judgments across all illusions.
Abstract
Language models (LMs) have been argued to overlap substantially with human beings in grammaticality judgment tasks. But when humans systematically make errors in language processing, should we expect LMs to behave like cognitive models of language and mimic human behavior? We answer this question by investigating LMs' more subtle judgments associated with "language illusions" -- sentences that are vague in meaning, implausible, or ungrammatical but receive unexpectedly high acceptability judgments by humans. We looked at three illusions: the comparative illusion (e.g. "More people have been to Russia than I have"), the depth-charge illusion (e.g. "No head injury is too trivial to be ignored"), and the negative polarity item (NPI) illusion (e.g. "The hunter who no villager believed to be trustworthy will ever shoot a bear"). We found that probabilities represented by LMs were more likely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Neurobiology of Language and Bilingualism
MethodsALIGN
