Subjective Crowd Disagreements for Subjective Data: Uncovering Meaningful CrowdOpinion with Population-level Learning
Tharindu Cyril Weerasooriya, Sarah Luger, Saloni Poddar, Ashiqur R., KhudaBukhsh, Christopher M. Homan

TL;DR
This paper introduces CrowdOpinion, an unsupervised learning approach that leverages language features and label distributions to better understand and utilize annotator disagreements in subjective data, improving the analysis of crowd opinions.
Contribution
The paper presents a novel unsupervised clustering method, CrowdOpinion, for analyzing subjective crowd disagreements using language features and label distributions, applicable to various datasets.
Findings
CrowdOpinion effectively pools similar items into larger label distribution samples.
The approach improves understanding of minority and underrepresented opinions.
It performs well across multiple datasets with varying disagreement levels.
Abstract
Human-annotated data plays a critical role in the fairness of AI systems, including those that deal with life-altering decisions or moderating human-created web/social media content. Conventionally, annotator disagreements are resolved before any learning takes place. However, researchers are increasingly identifying annotator disagreement as pervasive and meaningful. They also question the performance of a system when annotators disagree. Particularly when minority views are disregarded, especially among groups that may already be underrepresented in the annotator population. In this paper, we introduce \emph{CrowdOpinion}\footnote{Accepted for publication at ACL 2023}, an unsupervised learning based approach that uses language features and label distributions to pool similar items into larger samples of label distributions. We experiment with four generative and one density-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Complex Network Analysis Techniques · Sentiment Analysis and Opinion Mining
