LLM-RM at SemEval-2023 Task 2: Multilingual Complex NER using   XLM-RoBERTa

Rahul Mehta; Vasudeva Varma

arXiv:2305.03300·cs.CL·May 8, 2023·2 cites

LLM-RM at SemEval-2023 Task 2: Multilingual Complex NER using XLM-RoBERTa

Rahul Mehta, Vasudeva Varma

PDF

Open Access

TL;DR

This paper presents a multilingual approach to complex named entity recognition by fine-tuning XLM-RoBERTa on datasets across 12 languages for the SemEval-2023 Task 2 challenge.

Contribution

It introduces a cross-lingual fine-tuning method using XLM-RoBERTa for multilingual complex NER in 12 languages, addressing the challenge of recognizing complex entities.

Findings

01

Achieved competitive results in SemEval-2023 Task 2

02

Demonstrated effectiveness of cross-lingual transfer learning

03

Improved NER performance across multiple languages

Abstract

Named Entity Recognition(NER) is a task of recognizing entities at a token level in a sentence. This paper focuses on solving NER tasks in a multilingual setting for complex named entities. Our team, LLM-RM participated in the recently organized SemEval 2023 task, Task 2: MultiCoNER II,Multilingual Complex Named Entity Recognition. We approach the problem by leveraging cross-lingual representation provided by fine-tuning XLM-Roberta base model on datasets of all of the 12 languages provided -- Bangla, Chinese, English, Farsi, French, German, Hindi, Italian, Portuguese, Spanish, Swedish and Ukrainian

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsBalanced Selection