Simulating Field Experiments with Large Language Models
Yaoyu Chen, Yuheng Hu, Yingda Lu

TL;DR
This paper explores using large language models to simulate field experiments through novel prompting strategies, showing promising alignment with real results and highlighting current limitations in social science research contexts.
Contribution
Introduces two new prompting strategies, observer and participant modes, for LLMs to simulate field experiments, expanding their application beyond lab environments.
Findings
66% accuracy in observer mode
Encouraging alignment with actual results in certain scenarios
Identifies topics where LLMs underperform, such as gender and social norms
Abstract
Prevailing large language models (LLMs) are capable of human responses simulation through its unprecedented content generation and reasoning abilities. However, it is not clear whether and how to leverage LLMs to simulate field experiments. In this paper, we propose and evaluate two prompting strategies: the observer mode that allows a direct prediction on main conclusions and the participant mode that simulates distributions of responses from participants. Using this approach, we examine fifteen well cited field experimental papers published in INFORMS and MISQ, finding encouraging alignments between simulated experimental results and the actual results in certain scenarios. We further identify topics of which LLMs underperform, including gender difference and social norms related research. Additionally, the automatic and standardized workflow proposed in this paper enables the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
