Multilingual and cross-lingual document classification: A meta-learning   approach

Niels van der Heijden; Helen Yannakoudakis; Pushkar Mishra; Ekaterina; Shutova

arXiv:2101.11302·cs.CL·April 27, 2021

Multilingual and cross-lingual document classification: A meta-learning approach

Niels van der Heijden, Helen Yannakoudakis, Pushkar Mishra, Ekaterina, Shutova

PDF

1 Repo

TL;DR

This paper introduces a meta-learning approach for multilingual and cross-lingual document classification, effectively handling low-resource languages and limited data scenarios, and achieving state-of-the-art results.

Contribution

It presents a novel meta-learning method tailored for under-resourced languages, with a simple adjustment that improves stability and performance in multilingual classification tasks.

Findings

01

Meta-learning excels with heterogeneous task distributions.

02

Proposed adjustment enhances learning stability.

03

Achieves new state-of-the-art on several languages.

Abstract

The great majority of languages in the world are considered under-resourced for the successful application of deep learning methods. In this work, we propose a meta-learning approach to document classification in limited-resource setting and demonstrate its effectiveness in two different settings: few-shot, cross-lingual adaptation to previously unseen languages; and multilingual joint training when limited target-language data is available during training. We conduct a systematic comparison of several meta-learning methods, investigate multiple settings in terms of data availability and show that meta-learning thrives in settings with a heterogeneous task distribution. We propose a simple, yet effective adjustment to existing meta-learning methods which allows for better and more stable learning, and set a new state of the art on several languages while performing on-par on others,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mrvoh/meta_learning_multilingual_doc_classification
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.