mucAI at WojoodNER 2024: Arabic Named Entity Recognition with Nearest Neighbor Search
Ahmed Abdou, Tasneem Mohsen

TL;DR
This paper presents Arabic KNN-NER, a novel approach for Arabic Named Entity Recognition that combines a fine-tuned model with KNN search over training data, achieving top results in the Wojood NER Shared Task 2024.
Contribution
Introduction of Arabic KNN-NER, a method that enhances NER performance by integrating KNN search with a fine-tuned model specifically for Arabic language challenges.
Findings
Achieved 91% accuracy on WojoodFine dataset
Outperformed other submissions in the shared task
Demonstrated effectiveness of combining KNN with neural models
Abstract
Named Entity Recognition (NER) is a task in Natural Language Processing (NLP) that aims to identify and classify entities in text into predefined categories. However, when applied to Arabic data, NER encounters unique challenges stemming from the language's rich morphological inflections, absence of capitalization cues, and spelling variants, where a single word can comprise multiple morphemes. In this paper, we introduce Arabic KNN-NER, our submission to the Wojood NER Shared Task 2024 (ArabicNLP 2024). We have participated in the shared sub-task 1 Flat NER. In this shared sub-task, we tackle fine-grained flat-entity recognition for Arabic text, where we identify a single main entity and possibly zero or multiple sub-entities for each word. Arabic KNN-NER augments the probability distribution of a fine-tuned model with another label probability distribution derived from performing a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsSparse Evolutionary Training
