TL;DR
This study evaluates how well existing English NER models perform on diverse global English varieties, revealing significant performance drops on non-American/British English data and suggesting the need for more inclusive training datasets.
Contribution
The paper introduces the Worldwide English NER Dataset and analyzes NER model performance across diverse English varieties, highlighting limitations of current models on global English data.
Findings
Significant performance drops (>10 F1) on global English varieties.
Models trained on standard datasets perform poorly on Oceania and Africa.
Combined training on the new dataset improves robustness with minimal F1 loss.
Abstract
The vast majority of the popular English named entity recognition (NER) datasets contain American or British English data, despite the existence of many global varieties of English. As such, it is unclear whether they generalize for analyzing use of English globally. To test this, we build a newswire dataset, the Worldwide English NER Dataset, to analyze NER model performance on low-resource English variants from around the world. We test widely used NER toolkits and transformer models, including models using the pre-trained contextual models RoBERTa and ELECTRA, on three datasets: a commonly used British English newswire dataset, CoNLL 2003, a more American focused dataset OntoNotes, and our global dataset. All models trained on the CoNLL or OntoNotes datasets experienced significant performance drops-over 10 F1 in some cases-when tested on the Worldwide English dataset. Upon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · 7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Dense Connections · Residual Connection · Softmax · Adam · Linear Warmup With Linear Decay · Layer Normalization · Attention Dropout
