Human Preferences in Large Language Model Latent Space: A Technical   Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction

Sarah Ball; Simeon Allmendinger; Frauke Kreuter; Niklas K\"uhl

arXiv:2502.16280·cs.LG·February 25, 2025

Human Preferences in Large Language Model Latent Space: A Technical Analysis on the Reliability of Synthetic Data in Voting Outcome Prediction

Sarah Ball, Simeon Allmendinger, Frauke Kreuter, Niklas K\"uhl

PDF

Open Access

TL;DR

This paper critically examines the reliability of large language models in generating synthetic survey data, revealing significant limitations in replicating human opinion variance and demographic nuances, especially in political contexts.

Contribution

It introduces a probe-based methodology to analyze how LLMs encode political affiliations and demonstrates the systematic distortions affecting their use in social science research.

Findings

01

LLMs fail to replicate real-world response variance

02

Synthetic data shows limited demographic differentiation

03

Prompt sensitivity impacts output stability

Abstract

Generative AI (GenAI) is increasingly used in survey contexts to simulate human preferences. While many research endeavors evaluate the quality of synthetic GenAI data by comparing model-generated responses to gold-standard survey results, fundamental questions about the validity and reliability of using LLMs as substitutes for human respondents remain. Our study provides a technical analysis of how demographic attributes and prompt variations influence latent opinion mappings in large language models (LLMs) and evaluates their suitability for survey-based predictions. Using 14 different models, we find that LLM-generated data fails to replicate the variance observed in real-world human responses, particularly across demographic subgroups. In the political space, persona-to-party mappings exhibit limited differentiation, resulting in synthetic data that lacks the nuanced distribution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods