Does Typological Blinding Impede Cross-Lingual Sharing?

Johannes Bjerva; Isabelle Augenstein

arXiv:2101.11888·cs.CL·January 29, 2021

Does Typological Blinding Impede Cross-Lingual Sharing?

Johannes Bjerva, Isabelle Augenstein

PDF

TL;DR

This paper investigates whether explicitly using typological features enhances cross-lingual models, finding that blinding models to such features significantly reduces performance, while encouraging typological sharing can improve results.

Contribution

It demonstrates that explicit typological information is crucial for cross-lingual sharing, challenging prior assumptions of its limited utility.

Findings

01

Blinding models to typology reduces performance.

02

Encouraging typological sharing improves cross-lingual transfer.

03

Typological cues are implicitly learned by models in cross-lingual settings.

Abstract

Bridging the performance gap between high- and low-resource languages has been the focus of much previous work. Typological features from databases such as the World Atlas of Language Structures (WALS) are a prime candidate for this, as such data exists even for very low-resource languages. However, previous work has only found minor benefits from using typological information. Our hypothesis is that a model trained in a cross-lingual setting will pick up on typological cues from the input data, thus overshadowing the utility of explicitly using such features. We verify this hypothesis by blinding a model to typological information, and investigate how cross-lingual sharing and performance is impacted. Our model is based on a cross-lingual architecture in which the latent weights governing the sharing between languages is learnt during training. We show that (i) preventing this model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.