Temporal-aware Language Representation Learning From Crowdsourced Labels
Yang Hao, Xiao Zhai, Wenbiao Ding, Zitao Liu

TL;DR
This paper introduces TACMA, a simple yet effective heuristic for learning language representations from crowdsourced labels by modeling observer variability and aggregating confidence scores, improving accuracy over existing methods.
Contribution
The paper proposes TACMA, a novel temporal-aware heuristic that explicitly models intra- and inter-observer variability in crowdsourced labels for language representation learning.
Findings
Outperforms state-of-the-art baselines in accuracy and AUC
Effective on both synthetic and real-world datasets
Easy to implement with minimal code
Abstract
Learning effective language representations from crowdsourced labels is crucial for many real-world machine learning tasks. A challenging aspect of this problem is that the quality of crowdsourced labels suffer high intra- and inter-observer variability. Since the high-capacity deep neural networks can easily memorize all disagreements among crowdsourced labels, directly applying existing supervised language representation learning algorithms may yield suboptimal solutions. In this paper, we propose \emph{TACMA}, a \underline{t}emporal-\underline{a}ware language representation learning heuristic for \underline{c}rowdsourced labels with \underline{m}ultiple \underline{a}nnotators. The proposed approach (1) explicitly models the intra-observer variability with attention mechanism; (2) computes and aggregates per-sample confidence scores from multiple workers to address the inter-observer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Machine Learning and Data Classification · Data Stream Mining Techniques
