Evaluating German Transformer Language Models with Syntactic Agreement   Tests

Karolina Zaczynska; Nils Feldhus; Robert Schwarzenberg; Aleksandra; Gabryszak; Sebastian M\"oller

arXiv:2007.03765·cs.CL·July 9, 2020

Evaluating German Transformer Language Models with Syntactic Agreement Tests

Karolina Zaczynska, Nils Feldhus, Robert Schwarzenberg, Aleksandra, Gabryszak, Sebastian M\"oller

PDF

Open Access 3 Repos

TL;DR

This paper evaluates German transformer language models using syntactic agreement tests, revealing their strengths and limitations in handling complex German syntactic structures.

Contribution

It introduces German-specific agreement tasks and analyzes the performance of state-of-the-art German TLMs on these tasks.

Findings

01

German TLMs perform well on agreement tasks

02

Certain complex syntactic structures challenge TLMs

03

Analysis highlights areas for future improvement

Abstract

Pre-trained transformer language models (TLMs) have recently refashioned natural language processing (NLP): Most state-of-the-art NLP models now operate on top of TLMs to benefit from contextualization and knowledge induction. To explain their success, the scientific community conducted numerous analyses. Besides other methods, syntactic agreement tests were utilized to analyse TLMs. Most of the studies were conducted for the English language, however. In this work, we analyse German TLMs. To this end, we design numerous agreement tasks, some of which consider peculiarities of the German language. Our experimental results show that state-of-the-art German TLMs generally perform well on agreement tasks, but we also identify and discuss syntactic structures that push them to their limits.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification