CV-18 NER: Augmented Common Voice for Named Entity Recognition from Arabic Speech

Youssef Saidi; Haroun Elleuch; Fethi Bougares

arXiv:2604.02209·cs.CL·April 3, 2026

CV-18 NER: Augmented Common Voice for Named Entity Recognition from Arabic Speech

Youssef Saidi, Haroun Elleuch, Fethi Bougares

PDF

1 Repo 1 Datasets

TL;DR

This paper introduces CV-18 NER, the first dataset for Arabic speech NER, and demonstrates that end-to-end models outperform traditional pipelines on this task, with publicly available resources for future research.

Contribution

It creates and releases the first annotated Arabic speech NER dataset and benchmarks end-to-end models, highlighting the effectiveness of multilingual pretraining and transfer learning.

Findings

01

End-to-end models outperform pipeline systems on Arabic speech NER.

02

Arabic-specific self-supervised pretraining improves ASR performance.

03

Larger models may be less effective in low-resource Arabic speech NER.

Abstract

End-to-end speech Named Entity Recognition (NER) aims to directly extract entities from speech. Prior work has shown that end-to-end (E2E) approaches can outperform cascaded pipelines for English, French, and Chinese, but Arabic remains under-explored due to its morphological complexity, the absence of short vowels, and limited annotated resources. We introduce CV-18 NER, the first publicly available dataset for NER from Arabic speech, created by augmenting the Arabic Common Voice 18 corpus with manual NER annotations following the fine-grained Wojood schema (21 entity types). We benchmark both pipeline systems (ASR + text NER) and E2E models based on Whisper and AraBEST-RQ. E2E systems substantially outperform the best pipeline configuration on the test set, reaching 37.0% CoER (AraBEST-RQ 300M) and 38.0% CVER (Whisper-medium). Further analysis shows that Arabic-specific…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://huggingface.co/datasets/Elyadata/CV18-NER
github

Datasets

Elyadata/CV18-NER
dataset· 56 dl
56 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.