NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity   Recognition

Elena Merdjanovska; Ansar Aynetdinov; Alan Akbik

arXiv:2405.07609·cs.CL·October 15, 2024·1 cites

NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition

Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik

PDF

Open Access 1 Repo 1 Video

TL;DR

NoiseBench is a new benchmark for evaluating how real-world label noise affects NER models, revealing that real noise is more challenging than simulated noise and current models struggle with it.

Contribution

This paper introduces NoiseBench, a benchmark with real noise types for NER, and analyzes the impact of real versus simulated noise on model performance.

Findings

01

Real noise is significantly more challenging than simulated noise.

02

State-of-the-art noise-robust models underperform on real noise.

03

Current models fall short of their theoretical upper bounds on noisy data.

Abstract

Available training data for named entity recognition (NER) often contains a significant percentage of incorrect labels for entity types and entity boundaries. Such label noise poses challenges for supervised learning and may significantly deteriorate model quality. To address this, prior work proposed various noise-robust learning approaches capable of learning from data with partially incorrect labels. These approaches are typically evaluated using simulated noise where the labels in a clean dataset are automatically corrupted. However, as we show in this paper, this leads to unrealistic noise that is far easier to handle than real noise caused by human error or semi-automatic annotation. To enable the study of the impact of various types of real noise, we introduce NoiseBench, an NER benchmark consisting of clean training data corrupted with 6 types of real noise, including expert…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

elenamer/noisebench
noneOfficial

Videos

NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques