UM6P-CS at SemEval-2022 Task 11: Enhancing Multilingual and Code-Mixed   Complex Named Entity Recognition via Pseudo Labels using Multilingual   Transformer

Abdellah El Mekki; Abdelkader El Mahdaouy; Mohammed Akallouch; and Ismail Berrada; Ahmed Khoumsi

arXiv:2204.13515·cs.CL·April 29, 2022

UM6P-CS at SemEval-2022 Task 11: Enhancing Multilingual and Code-Mixed Complex Named Entity Recognition via Pseudo Labels using Multilingual Transformer

Abdellah El Mekki, Abdelkader El Mahdaouy, Mohammed Akallouch, and Ismail Berrada, Ahmed Khoumsi

PDF

Open Access

TL;DR

This paper presents a multilingual and code-mixed complex NER system using XLM-RoBERTa with span classification and self-training, achieving competitive results in the MultiCoNER shared task.

Contribution

It introduces a novel approach combining span classification and self-training with multilingual transformers for complex NER in multilingual and code-mixed contexts.

Findings

01

Ranked 6th in multilingual track

02

Ranked 8th in code-mixed track

03

Effective use of pseudo-labels improves performance

Abstract

Building real-world complex Named Entity Recognition (NER) systems is a challenging task. This is due to the complexity and ambiguity of named entities that appear in various contexts such as short input sentences, emerging entities, and complex entities. Besides, real-world queries are mostly malformed, as they can be code-mixed or multilingual, among other scenarios. In this paper, we introduce our submitted system to the Multilingual Complex Named Entity Recognition (MultiCoNER) shared task. We approach the complex NER for multilingual and code-mixed queries, by relying on the contextualized representation provided by the multilingual Transformer XLM-RoBERTa. In addition to the CRF-based token classification layer, we incorporate a span classification loss to recognize named entities spans. Furthermore, we use a self-training mechanism to generate weakly-annotated data from a large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Dense Connections · Residual Connection · Dropout · Adam · Position-Wise Feed-Forward Layer