Budget Sensitive Reannotation of Noisy Relation Classification Data Using Label Hierarchy
Akshay Parekh, Ashish Anand, Amit Awekar

TL;DR
This paper proposes budget-aware reannotation strategies for noisy relation classification datasets using label hierarchy, improving error detection efficiency and revealing inflated model performance on TACRED.
Contribution
It introduces a novel reannotation approach that leverages label hierarchy and a reannotation budget to efficiently identify annotation errors in RC datasets.
Findings
Strategies are effective in detecting new annotation errors.
Reannotation improves the accuracy of relation classification models.
Current model performance on TACRED is overestimated due to noise.
Abstract
Large crowd-sourced datasets are often noisy and relation classification (RC) datasets are no exception. Reannotating the entire dataset is one probable solution however it is not always viable due to time and budget constraints. This paper addresses the problem of efficient reannotation of a large noisy dataset for the RC. Our goal is to catch more annotation errors in the dataset while reannotating fewer instances. Existing work on RC dataset reannotation lacks the flexibility about how much data to reannotate. We introduce the concept of a reannotation budget to overcome this limitation. The immediate follow-up problem is: Given a specific reannotation budget, which subset of the data should we reannotate? To address this problem, we present two strategies to selectively reannotate RC datasets. Our strategies utilize the taxonomic hierarchy of relation labels. The intuition of our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Imbalanced Data Classification Techniques · Machine Learning and Data Classification
