Syntax-augmented Multilingual BERT for Cross-lingual Transfer

Wasi Uddin Ahmad; Haoran Li; Kai-Wei Chang; Yashar Mehdad

arXiv:2106.02134·cs.CL·June 7, 2021

Syntax-augmented Multilingual BERT for Cross-lingual Transfer

Wasi Uddin Ahmad, Haoran Li, Kai-Wei Chang, Yashar Mehdad

PDF

1 Repo

TL;DR

This paper enhances multilingual BERT by explicitly incorporating syntactic dependency structures, significantly improving cross-lingual transfer performance across multiple NLP tasks and benchmarks.

Contribution

It introduces a syntax-augmented training method for mBERT that explicitly encodes syntactic dependencies, advancing cross-lingual transfer capabilities.

Findings

01

Improves cross-lingual transfer on benchmarks by 1.4-1.6 points.

02

Achieves larger gains of 3.1-3.9 points in generalized transfer settings.

03

Demonstrates effectiveness across tasks like classification, QA, NER, and parsing.

Abstract

In recent years, we have seen a colossal effort in pre-training multilingual text encoders using large-scale corpora in many languages to facilitate cross-lingual transfer learning. However, due to typological differences across languages, the cross-lingual transfer is challenging. Nevertheless, language syntax, e.g., syntactic dependencies, can bridge the typological gap. Previous works have shown that pre-trained multilingual encoders, such as mBERT \cite{devlin-etal-2019-bert}, capture language syntax, helping cross-lingual transfer. This work shows that explicitly providing language syntax and training mBERT using an auxiliary objective to encode the universal dependency tree structure helps cross-lingual transfer. We perform rigorous experiments on four NLP tasks, including text classification, question answering, named entity recognition, and task-oriented semantic parsing. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wasiahmad/Syntax-MBERT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsmBERT