TL;DR
ConStance is a model that improves stance classification by analyzing how different annotation contexts influence label quality and consistency, leading to better predictions and insights into annotation biases.
Contribution
It introduces ConStance, a novel model that jointly estimates true labels and context effects in annotation data for stance detection.
Findings
ConStance outperforms baseline classifiers in political stance prediction.
The model reveals how varying context influences annotation reliability.
Providing optimal context improves annotation quality and model accuracy.
Abstract
Manual annotations are a prerequisite for many applications of machine learning. However, weaknesses in the annotation process itself are easy to overlook. In particular, scholars often choose what information to give to annotators without examining these decisions empirically. For subjective tasks such as sentiment analysis, sarcasm, and stance detection, such choices can impact results. Here, for the task of political stance detection on Twitter, we show that providing too little context can result in noisy and uncertain annotations, whereas providing too strong a context may cause it to outweigh other signals. To characterize and reduce these biases, we develop ConStance, a general model for reasoning about annotations across information conditions. Given conflicting labels produced by multiple annotators seeing the same instances with different contexts, ConStance simultaneously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
