Measuring Self-Rating Bias in LLM-Generated Survey Data: A Semantic Similarity Framework for Independent Scale Mapping
Eduardo Vera Pichardo

TL;DR
This paper introduces a semantic similarity framework to calibrate and validate LLM-generated survey data, addressing self-rating bias and circularity issues through embedding-based cosine similarity with behavioral anchors.
Contribution
It proposes a novel SSR method that decouples generation from scale mapping, demonstrating improved calibration and cross-model generalization in synthetic survey data.
Findings
SSR achieves 65-67% exact match and 91% within ±1 accuracy.
Naturalistic anchors outperform formal jargon by 29 percentage points.
SSR shows robust cross-model generalization with 77% exact match using OpenAI embeddings.
Abstract
Synthetic survey data generated by large language models (LLMs) suffers from a fundamental circularity: the same model family that generates text responses also maps them to numerical scales. We calibrate and validate Semantic Similarity Rating (SSR; Maier et al., 2024), which decouples generation from scale mapping via embedding-based cosine similarity against predefined anchor statements. Configuration experiments (N=17 pilot, N=69 cross-validation across 8 domains) show that naturalistic behavioral anchors outperform formal jargon by 29 percentage points (pp), and that SSR achieves 65-67% exact match and 91% within plus/minus 1; a cross-model test with OpenAI text-embedding-3-small reaches 77% exact, confirming cross-provider generalization. Direct LLM baselines (Claude 87%, GPT-4o 83%) establish that SSR's contribution is methodological independence, not accuracy superiority. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Bayesian Methods and Mixture Models
