Simulating Field Experiments with Large Language Models

Yaoyu Chen; Yuheng Hu; Yingda Lu

arXiv:2408.09682·cs.AI·August 20, 2024

Simulating Field Experiments with Large Language Models

Yaoyu Chen, Yuheng Hu, Yingda Lu

PDF

Open Access

TL;DR

This paper explores using large language models to simulate field experiments through novel prompting strategies, showing promising alignment with real results and highlighting current limitations in social science research contexts.

Contribution

Introduces two new prompting strategies, observer and participant modes, for LLMs to simulate field experiments, expanding their application beyond lab environments.

Findings

01

66% accuracy in observer mode

02

Encouraging alignment with actual results in certain scenarios

03

Identifies topics where LLMs underperform, such as gender and social norms

Abstract

Prevailing large language models (LLMs) are capable of human responses simulation through its unprecedented content generation and reasoning abilities. However, it is not clear whether and how to leverage LLMs to simulate field experiments. In this paper, we propose and evaluate two prompting strategies: the observer mode that allows a direct prediction on main conclusions and the participant mode that simulates distributions of responses from participants. Using this approach, we examine fifteen well cited field experimental papers published in INFORMS and MISQ, finding encouraging alignments between simulated experimental results and the actual results in certain scenarios. We further identify topics of which LLMs underperform, including gender difference and social norms related research. Additionally, the automatic and standardized workflow proposed in this paper enables the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling