UM6P-CS at SemEval-2022 Task 11: Enhancing Multilingual and Code-Mixed Complex Named Entity Recognition via Pseudo Labels using Multilingual Transformer
Abdellah El Mekki, Abdelkader El Mahdaouy, Mohammed Akallouch, and Ismail Berrada, Ahmed Khoumsi

TL;DR
This paper presents a multilingual and code-mixed complex NER system using XLM-RoBERTa with span classification and self-training, achieving competitive results in the MultiCoNER shared task.
Contribution
It introduces a novel approach combining span classification and self-training with multilingual transformers for complex NER in multilingual and code-mixed contexts.
Findings
Ranked 6th in multilingual track
Ranked 8th in code-mixed track
Effective use of pseudo-labels improves performance
Abstract
Building real-world complex Named Entity Recognition (NER) systems is a challenging task. This is due to the complexity and ambiguity of named entities that appear in various contexts such as short input sentences, emerging entities, and complex entities. Besides, real-world queries are mostly malformed, as they can be code-mixed or multilingual, among other scenarios. In this paper, we introduce our submitted system to the Multilingual Complex Named Entity Recognition (MultiCoNER) shared task. We approach the complex NER for multilingual and code-mixed queries, by relying on the contextualized representation provided by the multilingual Transformer XLM-RoBERTa. In addition to the CRF-based token classification layer, we incorporate a span classification loss to recognize named entities spans. Furthermore, we use a self-training mechanism to generate weakly-annotated data from a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Web Data Mining and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Byte Pair Encoding · Dense Connections · Residual Connection · Dropout · Adam · Position-Wise Feed-Forward Layer
