Multilingual Syntax-aware Language Modeling through Dependency Tree   Conversion

Shunsuke Kando; Hiroshi Noji; Yusuke Miyao

arXiv:2204.08644·cs.CL·April 20, 2022

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Shunsuke Kando, Hiroshi Noji, Yusuke Miyao

PDF

Open Access

TL;DR

This paper explores how different dependency-to-constituency conversion methods affect multilingual syntax-aware language models, demonstrating that optimal tree formats significantly improve performance across multiple languages.

Contribution

It systematically evaluates various conversion methods for dependency trees in multilingual RNNGs, providing insights into their impact on language modeling performance.

Findings

01

Best model achieves 19% higher accuracy than worst across languages

02

Syntax injection outperforms sequential/overparameterized models

03

Choosing the right tree formalism is crucial for multilingual syntax-aware LMs

Abstract

Incorporating stronger syntactic biases into neural language models (LMs) is a long-standing goal, but research in this area often focuses on modeling English text, where constituent treebanks are readily available. Extending constituent tree-based LMs to the multilingual setting, where dependency treebanks are more common, is possible via dependency-to-constituency conversion methods. However, this raises the question of which tree formats are best for learning the model, and for which languages. We investigate this question by training recurrent neural network grammars (RNNGs) using various conversion methods, and evaluating them empirically in a multilingual setting. We examine the effect on LM performance across nine conversion methods and five languages through seven types of syntactic tests. On average, the performance of our best model represents a 19 \% increase in accuracy over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification