DeformAr: Rethinking NER Evaluation through Component Analysis and Visual Analytics
Ahmed Mustafa Younes

TL;DR
DeformAr is a comprehensive framework combining data analysis and visual tools to diagnose and explain performance gaps in Arabic NER systems compared to English, addressing tokenisation, dataset quality, and annotation issues.
Contribution
It introduces DeformAr, the first Arabic-specific interpretability framework that analyzes data and model components jointly through interactive visualizations and diagnostic measures.
Findings
DeformAr effectively identifies factors affecting Arabic NER performance.
The framework reveals interactions between data quality and model behaviour.
It provides actionable insights for improving Arabic NER systems.
Abstract
Transformer models have significantly advanced Natural Language Processing (NLP), demonstrating strong performance in English. However, their effectiveness in Arabic, particularly for Named Entity Recognition (NER), remains limited, even with larger pre-trained models. This performance gap stems from multiple factors, including tokenisation, dataset quality, and annotation inconsistencies. Existing studies often analyze these issues in isolation, failing to capture their joint effect on system behaviour and performance. We introduce DeformAr (Debugging and Evaluation Framework for Transformer-based NER Systems), a novel framework designed to investigate and explain the performance discrepancy between Arabic and English NER systems. DeformAr integrates a data extraction library and an interactive dashboard, supporting two modes of evaluation: cross-component analysis and behavioural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
