Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically   Ambiguous Settings for Low Resource Languages

Amit Pandey; Swayatta Daw; Narendra Babu Unnam; Vikram Pudi

arXiv:2207.06882·cs.CL·July 15, 2022

Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages

Amit Pandey, Swayatta Daw, Narendra Babu Unnam, Vikram Pudi

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of pre-trained language models with Whole Word Masking for complex Named Entity Recognition in low-resource Chinese and Spanish, achieving significant improvements over baselines.

Contribution

It introduces the application of Whole Word Masking with various neural architectures to enhance NER performance in low-resource languages.

Findings

01

All models outperform baseline significantly

02

Best model achieves competitive leaderboard position

03

Whole Word Masking boosts masked language modeling in low-resource settings

Abstract

We leverage pre-trained language models to solve the task of complex NER for two low-resource languages: Chinese and Spanish. We use the technique of Whole Word Masking(WWM) to boost the performance of masked language modeling objective on large and unsupervised corpora. We experiment with multiple neural network architectures, incorporating CRF, BiLSTMs, and Linear Classifiers on top of a fine-tuned BERT layer. All our models outperform the baseline by a significant margin and our best performing model obtains a competitive position on the evaluation leaderboard for the blind test set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

amitpandey-research/complex_ner
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Test · Linear Layer · Attention Dropout · Adam · Residual Connection · Layer Normalization · Linear Warmup With Linear Decay