LLMs' Understanding of Natural Language Revealed
Walid S. Saba

TL;DR
This paper critically examines the language understanding capabilities of large language models (LLMs), revealing that their supposed understanding is superficial and primarily based on memorization rather than genuine comprehension.
Contribution
The study introduces a novel testing approach that assesses LLMs' understanding by querying them with snippets of text, demonstrating their limited true language comprehension.
Findings
LLMs' understanding is superficial and based on memorization.
Traditional evaluations overestimate LLMs' language understanding.
LLMs perform poorly on understanding tasks that require genuine comprehension.
Abstract
Large language models (LLMs) are the result of a massive experiment in bottom-up, data-driven reverse engineering of language at scale. Despite their utility in a number of downstream NLP tasks, ample research has shown that LLMs are incapable of performing reasoning in tasks that require quantification over and the manipulation of symbolic variables (e.g., planning and problem solving); see for example [25][26]. In this document, however, we will focus on testing LLMs for their language understanding capabilities, their supposed forte. As we will show here, the language understanding capabilities of LLMs have been widely exaggerated. While LLMs have proven to generate human-like coherent language (since that's how they were designed), their language understanding capabilities have not been properly tested. In particular, we believe that the language understanding capabilities of LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Library Science and Information Systems
MethodsFocus
