Evaluating German Transformer Language Models with Syntactic Agreement Tests
Karolina Zaczynska, Nils Feldhus, Robert Schwarzenberg, Aleksandra, Gabryszak, Sebastian M\"oller

TL;DR
This paper evaluates German transformer language models using syntactic agreement tests, revealing their strengths and limitations in handling complex German syntactic structures.
Contribution
It introduces German-specific agreement tasks and analyzes the performance of state-of-the-art German TLMs on these tasks.
Findings
German TLMs perform well on agreement tasks
Certain complex syntactic structures challenge TLMs
Analysis highlights areas for future improvement
Abstract
Pre-trained transformer language models (TLMs) have recently refashioned natural language processing (NLP): Most state-of-the-art NLP models now operate on top of TLMs to benefit from contextualization and knowledge induction. To explain their success, the scientific community conducted numerous analyses. Besides other methods, syntactic agreement tests were utilized to analyse TLMs. Most of the studies were conducted for the English language, however. In this work, we analyse German TLMs. To this end, we design numerous agreement tasks, some of which consider peculiarities of the German language. Our experimental results show that state-of-the-art German TLMs generally perform well on agreement tasks, but we also identify and discuss syntactic structures that push them to their limits.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
