Exploring Large Language Model Agents for Piloting Social Experiments
Jinghua Piao, Yuwei Yan, Nian Li, Jun Zhang, and Yong Li

TL;DR
This paper introduces a novel framework using large language models as agents to simulate social experiments, enhancing the realism and effectiveness of computational social science research.
Contribution
It presents the first comprehensive framework for deploying LLM-driven agents in social experiments, integrating social science theories and experimental tools.
Findings
Replicated three social experiments with strong alignment to real-world data.
Demonstrated the effectiveness of LLM agents in piloting social interventions.
Showcased the potential of LLMs to transform computational social science.
Abstract
Computational social experiments, which typically employ agent-based modeling to create testbeds for piloting social experiments, not only provide a computational solution to the major challenges faced by traditional experimental methods, but have also gained widespread attention in various research fields. Despite their significance, their broader impact is largely limited by the underdeveloped intelligence of their core component, i.e., agents. To address this limitation, we develop a framework grounded in well-established social science theories and practices, consisting of three key elements: (i) large language model (LLM)-driven experimental agents, serving as "silicon participants", (ii) methods for implementing various interventions or treatments, and (iii) tools for collecting behavioral, survey, and interview data. We evaluate its effectiveness by replicating three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Computational and Text Analysis Methods · Topic Modeling
