Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese   Language Models

Yiwen Wang; Jennifer Hu; Roger Levy; Peng Qian

arXiv:2109.11058·cs.CL·September 24, 2021

Controlled Evaluation of Grammatical Knowledge in Mandarin Chinese Language Models

Yiwen Wang, Jennifer Hu, Roger Levy, Peng Qian

PDF

Open Access 1 Repo

TL;DR

This study evaluates whether structural supervision enhances Mandarin Chinese language models' ability to learn grammatical dependencies, showing benefits in syntactic representation and low-data scenarios, indicating cross-linguistic applicability.

Contribution

It provides the first controlled evaluation of grammatical knowledge in Mandarin Chinese language models, demonstrating the advantages of hierarchical inductive biases beyond English.

Findings

01

Structural supervision improves syntactic state representation.

02

Hierarchical biases benefit low-data learning.

03

Models show improved grammatical understanding with supervision.

Abstract

Prior work has shown that structural supervision helps English language models learn generalizations about syntactic phenomena such as subject-verb agreement. However, it remains unclear if such an inductive bias would also improve language models' ability to learn grammatical dependencies in typologically different languages. Here we investigate this question in Mandarin Chinese, which has a logographic, largely syllable-based writing system; different word order; and sparser morphology than English. We train LSTMs, Recurrent Neural Network Grammars, Transformer language models, and Transformer-parameterized generative parsing models on two Mandarin Chinese datasets of different sizes. We evaluate the models' ability to learn different aspects of Mandarin grammar that assess syntactic and semantic relationships. We find suggestive evidence that structural supervision helps with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yiwenwang03/syntactic-generalization-mandarin
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dense Connections · Softmax · Label Smoothing · Residual Connection · Layer Normalization · Position-Wise Feed-Forward Layer · Adam