How Well Do Large Language Models Understand Syntax? An Evaluation by   Asking Natural Language Questions

Houquan Zhou; Yang Hou; Zhenghua Li; Xuebin Wang; Zhefeng Wang; Xinyu; Duan; Min Zhang

arXiv:2311.08287·cs.CL·November 15, 2023·1 cites

How Well Do Large Language Models Understand Syntax? An Evaluation by Asking Natural Language Questions

Houquan Zhou, Yang Hou, Zhenghua Li, Xuebin Wang, Zhefeng Wang, Xinyu, Duan, Min Zhang

PDF

Open Access 1 Repo

TL;DR

This paper evaluates how well large language models understand syntax by using a natural language question-answering approach, revealing limited syntactic comprehension and insights into training dynamics.

Contribution

It introduces a novel question-based evaluation method targeting key syntactic points and provides empirical insights into the syntactic understanding of 24 LLMs.

Findings

01

Most LLMs show limited syntactic understanding.

02

Prepositional phrase attachment is particularly challenging.

03

Syntactic knowledge is mostly acquired early in training.

Abstract

While recent advancements in large language models (LLMs) bring us closer to achieving artificial general intelligence, the question persists: Do LLMs truly understand language, or do they merely mimic comprehension through pattern recognition? This study seeks to explore this question through the lens of syntax, a crucial component of sentence comprehension. Adopting a natural language question-answering (Q&A) scheme, we craft questions targeting nine syntactic knowledge points that are most closely related to sentence comprehension. Experiments conducted on 24 LLMs suggest that most have a limited grasp of syntactic knowledge, exhibiting notable discrepancies across different syntactic knowledge points. In particular, questions involving prepositional phrase attachment pose the greatest challenge, whereas those concerning adjectival modifier and indirect object are relatively easier…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Jacob-Zhou/SynEval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications