Learning Who Disagrees: Demographic Importance Weighting for Modeling Annotator Distributions with DiADEM
Samay U. Shetty, Tharindu Cyril Weerasooriya, Deepak Pandita, Christopher M. Homan

TL;DR
This paper introduces DiADEM, a neural model that predicts annotator disagreement by modeling demographic influences, outperforming large language models and revealing key demographic factors like race and age.
Contribution
DiADEM is a novel neural architecture that explicitly encodes demographic importance, improving prediction of human disagreement in subjective NLP tasks.
Findings
DiADEM achieves a disagreement prediction correlation of 0.75 on DICES.
Race and age are identified as the most influential demographic factors.
Explicit demographic modeling outperforms LLM-based approaches in capturing disagreement.
Abstract
When humans label subjective content, they disagree, and that disagreement is not noise. It reflects genuine differences in perspective shaped by annotators' social identities and lived experiences. Yet standard practice still flattens these judgments into a single majority label, and recent LLM-based approaches fare no better: we show that prompted large language models, even with chain-of-thought reasoning, fail to recover the structure of human disagreement. We introduce DiADEM, a neural architecture that learns "how much each demographic axis matters" for predicting who will disagree and on what. DiADEM encodes annotators through per-demographic projections governed by a learned importance vector , fuses annotator and item representations via complementary concatenation and Hadamard interactions, and is trained with a novel item-level disagreement loss that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
