Controlled Evaluation of Syntactic Knowledge in Multilingual Language   Models

Daria Kryvosheieva; Roger Levy

arXiv:2411.07474·cs.CL·December 13, 2024

Controlled Evaluation of Syntactic Knowledge in Multilingual Language Models

Daria Kryvosheieva, Roger Levy

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the syntactic understanding of multilingual language models in low-resource languages, revealing strengths and weaknesses across different syntactic tasks and models, and highlighting biases and performance issues.

Contribution

It introduces targeted syntactic evaluation tests for Basque, Hindi, and Swahili, expanding understanding of LMs' capabilities in low-resource language contexts.

Findings

01

Some syntactic tasks are easy for LMs, others are challenging.

02

Multilingual BERT shows bias toward habitual aspect in Hindi.

03

XGLM-4.5B underperforms compared to similar-sized models.

Abstract

Language models (LMs) are capable of acquiring elements of human-like syntactic knowledge. Targeted syntactic evaluation tests have been employed to measure how well they form generalizations about syntactic phenomena in high-resource languages such as English. However, we still lack a thorough understanding of LMs' capacity for syntactic generalizations in low-resource languages, which are responsible for much of the diversity of syntactic patterns worldwide. In this study, we develop targeted syntactic evaluation tests for three low-resource languages (Basque, Hindi, and Swahili) and use them to evaluate five families of open-access multilingual Transformer LMs. We find that some syntactic tasks prove relatively easy for LMs while others (agreement in sentences containing indirect objects in Basque, agreement across a prepositional phrase in Swahili) are challenging. We additionally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dariakryvosheieva/syntactic_generalization_multilingual
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Linear Warmup With Linear Decay · WordPiece · Dense Connections · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer