TL;DR
This paper treats crowdsourced annotations as domain-specific data, applying domain adaptation techniques to improve named entity recognition, achieving state-of-the-art results with minimal expert annotations.
Contribution
It introduces an annotator-aware domain adaptation approach for crowdsourced NER, bridging crowdsourcing and domain adaptation methods for improved learning.
Findings
Achieves state-of-the-art NER performance on a crowdsourced dataset.
Effective with minimal expert annotations under supervised setting.
Demonstrates the viability of domain adaptation techniques in crowdsourcing contexts.
Abstract
Crowdsourcing is regarded as one prospective solution for effective supervised learning, aiming to build large-scale annotated training data by crowd workers. Previous studies focus on reducing the influences from the noises of the crowdsourced annotations for supervised models. We take a different point in this work, regarding all crowdsourced annotations as gold-standard with respect to the individual annotators. In this way, we find that crowdsourcing could be highly similar to domain adaptation, and then the recent advances of cross-domain methods can be almost directly applied to crowdsourcing. Here we take named entity recognition (NER) as a study case, suggesting an annotator-aware representation learning model that inspired by the domain adaptation methods which attempt to capture effective domain-aware features. We investigate both unsupervised and supervised crowdsourcing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBERT
