Entity-Switched Datasets: An Approach to Auditing the In-Domain Robustness of Named Entity Recognition Models
Oshin Agarwal, Yinfei Yang, Byron C. Wallace, Ani Nenkova

TL;DR
This paper introduces a method to evaluate the in-domain robustness of named entity recognition systems by creating entity-switched datasets to analyze performance variations based on the entities' national origins.
Contribution
It proposes a novel auditing approach using entity-switched datasets to assess and improve the fairness and robustness of NER models across different national entity origins.
Findings
Performance varies widely across entity origins.
Best performance on American and Indian entities.
Worst performance on Vietnamese and Indonesian entities.
Abstract
Named entity recognition systems perform well on standard datasets comprising English news. But given the paucity of data, it is difficult to draw conclusions about the robustness of systems with respect to recognizing a diverse set of entities. We propose a method for auditing the in-domain robustness of systems, focusing specifically on differences in performance due to the national origin of entities. We create entity-switched datasets, in which named entities in the original texts are replaced by plausible named entities of the same type but of different national origin. We find that state-of-the-art systems' performance vary widely even in-domain: In the same context, entities from certain origins are more reliably recognized than entities from elsewhere. Systems perform best on American and Indian entities, and worst on Vietnamese and Indonesian entities. This auditing approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Data Quality and Management · Access Control and Trust
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
