Measuring Self-Rating Bias in LLM-Generated Survey Data: A Semantic Similarity Framework for Independent Scale Mapping

Eduardo Vera Pichardo

arXiv:2602.13862·physics.soc-ph·February 24, 2026

Measuring Self-Rating Bias in LLM-Generated Survey Data: A Semantic Similarity Framework for Independent Scale Mapping

Eduardo Vera Pichardo

PDF

Open Access

TL;DR

This paper introduces a semantic similarity framework to calibrate and validate LLM-generated survey data, addressing self-rating bias and circularity issues through embedding-based cosine similarity with behavioral anchors.

Contribution

It proposes a novel SSR method that decouples generation from scale mapping, demonstrating improved calibration and cross-model generalization in synthetic survey data.

Findings

01

SSR achieves 65-67% exact match and 91% within ±1 accuracy.

02

Naturalistic anchors outperform formal jargon by 29 percentage points.

03

SSR shows robust cross-model generalization with 77% exact match using OpenAI embeddings.

Abstract

Synthetic survey data generated by large language models (LLMs) suffers from a fundamental circularity: the same model family that generates text responses also maps them to numerical scales. We calibrate and validate Semantic Similarity Rating (SSR; Maier et al., 2024), which decouples generation from scale mapping via embedding-based cosine similarity against predefined anchor statements. Configuration experiments (N=17 pilot, N=69 cross-validation across 8 domains) show that naturalistic behavioral anchors outperform formal jargon by 29 percentage points (pp), and that SSR achieves 65-67% exact match and 91% within plus/minus 1; a cross-model test with OpenAI text-embedding-3-small reaches 77% exact, confirming cross-provider generalization. Direct LLM baselines (Claude 87%, GPT-4o 83%) establish that SSR's contribution is methodological independence, not accuracy superiority. A…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Bayesian Methods and Mixture Models