Mitigating Social Desirability Bias in Random Silicon Sampling
Sashank Chapala, Maksym Mironov, Songgaojun Deng

TL;DR
This paper investigates how minimal, psychologically grounded prompt wording can reduce social desirability bias in Large Language Model-based population sampling, improving alignment with real human data.
Contribution
It introduces and tests four prompt-based methods to mitigate social desirability bias in LLM responses, demonstrating reformulated prompts as most effective.
Findings
Reformulated prompts significantly improve alignment with human data.
Reverse-coded prompts show mixed effectiveness.
Priming and Preamble methods do not systematically reduce bias.
Abstract
Large Language Models (LLMs) are increasingly used to simulate population responses, a method known as ``Silicon Sampling''. However, responses to socially sensitive questions frequently exhibit Social Desirability Bias (SDB), diverging from real human data toward socially acceptable answers. Existing studies on social desirability bias in LLM-based sampling remain limited. In this work, we investigate whether minimal, psychologically grounded prompt wording can mitigate this bias and improve alignment between silicon and human samples. We conducted a study using data from the American National Election Study (ANES) on three LLMs from two model families: the open-source Llama-3.1 series and GPT-4.1-mini. We first replicate a baseline silicon sampling study, confirming the persistent Social Desirability Bias. We then test four prompt-based mitigation methods: \emph{reformulated}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Methodology and Nonresponse · Mental Health via Writing · Computational and Text Analysis Methods
