Cross-lingual, Character-Level Neural Morphological Tagging

Ryan Cotterell; Georg Heigold

arXiv:1708.09157·cs.CL·April 25, 2025

Cross-lingual, Character-Level Neural Morphological Tagging

Ryan Cotterell, Georg Heigold

PDF

TL;DR

This paper introduces a transfer learning approach using character-level neural models to improve morphological tagging in low-resource languages by leveraging related high-resource languages, achieving significant accuracy gains.

Contribution

It presents a novel joint training scheme for character-level neural taggers across multiple related languages, enabling effective knowledge transfer and improved performance in low-resource settings.

Findings

01

Up to 30% accuracy improvement over monolingual models.

02

Joint character representations facilitate cross-lingual transfer.

03

Effective for low-resource languages with related high-resource counterparts.

Abstract

Even for common NLP tasks, sufficient supervision is not available in many languages -- morphological tagging is no exception. In the work presented here, we explore a transfer learning scheme, whereby we train character-level recurrent neural taggers to predict morphological taggings for high-resource languages and low-resource languages together. Learning joint character representations among multiple related languages successfully enables knowledge transfer from the high-resource languages to the low-resource ones, improving accuracy by up to 30% over a monolingual model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.