MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource   Languages

Shashank Sonkar; Zichao Wang; Richard G. Baraniuk

arXiv:2212.09723·cs.CL·December 20, 2022

MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages

Shashank Sonkar, Zichao Wang, Richard G. Baraniuk

PDF

Open Access

TL;DR

This paper introduces MANER, a novel approach leveraging masked language models to improve NER performance in extremely low-resource languages with minimal training data.

Contribution

MANER re-purposes the <mask> token in pre-trained MLMs for NER prediction, enabling effective NER in languages with as few as 100 training samples.

Findings

01

MANER improves state-of-the-art results by up to 48% in low-resource settings.

02

It achieves an average F1 score increase of 12% across 100 languages.

03

Detailed analyses show scenarios where MANER performs best.

Abstract

This paper investigates the problem of Named Entity Recognition (NER) for extreme low-resource languages with only a few hundred tagged data samples. NER is a fundamental task in Natural Language Processing (NLP). A critical driver accelerating NER systems' progress is the existence of large-scale language corpora that enable NER systems to achieve outstanding performance in languages such as English and French with abundant training data. However, NER for low-resource languages remains relatively unexplored. In this paper, we introduce Mask Augmented Named Entity Recognition (MANER), a new methodology that leverages the distributional hypothesis of pre-trained masked language models (MLMs) for NER. The <mask> token in pre-trained MLMs encodes valuable semantic contextual information. MANER re-purposes the <mask> token for NER prediction. Specifically, we prepend the <mask> token to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications