Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives

Yinuo Xu; Veronica Derricks; Allison Earl; David Jurgens

arXiv:2508.02853·cs.CL·November 6, 2025

Modeling Annotator Disagreement with Demographic-Aware Experts and Synthetic Perspectives

Yinuo Xu, Veronica Derricks, Allison Earl, David Jurgens

PDF

TL;DR

This paper introduces DEM-MoE, a demographic-aware model for capturing annotator disagreement in subjective NLP tasks, and explores synthetic data generation via LLMs to enhance training data diversity.

Contribution

It presents DEM-MoE, a novel architecture that incorporates demographic information for better modeling of annotator disagreement, and investigates synthetic annotations for data augmentation.

Findings

01

DEM-MoE outperforms prior models across demographic groups.

02

Synthetic annotations from LLMs align moderately with human judgments.

03

Blending real and synthetic data improves model performance depending on dataset structure.

Abstract

We present an approach to modeling annotator disagreement in subjective NLP tasks through both architectural and data-centric innovations. Our model, DEM-MoE (Demographic-Aware Mixture of Experts), routes inputs to expert subnetworks based on annotator demographics, enabling it to better represent structured, group-level variation compared to prior models. DEM-MoE consistently performs competitively across demographic groups, and shows especially strong results on datasets with high annotator disagreement. To address sparse demographic coverage, we test whether LLM-generated synthetic annotations via zero-shot persona prompting can be used for data imputation. We show these synthetic judgments align moderately well with human annotations on our data and offer a scalable way to potentially enrich training data. We then propose and evaluate approaches for blending real and synthetic data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.