'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants
Shivani Kapania, William Agnew, Motahhare Eslami, Hoda Heidari, Sarah, Fox

TL;DR
This paper explores the use of large language models as substitutes for human participants in qualitative research, highlighting ethical, epistemological, and technical limitations through interviews with researchers.
Contribution
It provides an empirical investigation into researchers' perspectives on LLMs as qualitative research proxies, revealing fundamental concerns and limitations.
Findings
Researchers initially skeptical but found LLMs produce similar narratives.
LLMs lack participant consent and contextual depth.
Use of LLMs raises ethical and epistemological issues.
Abstract
The recent excitement around generative models has sparked a wave of proposals suggesting the replacement of human participation and labor in research and development--e.g., through surveys, experiments, and interviews--with synthetic research data generated by large language models (LLMs). We conducted interviews with 19 qualitative researchers to understand their perspectives on this paradigm shift. Initially skeptical, researchers were surprised to see similar narratives emerge in the LLM-generated data when using the interview probe. However, over several conversational turns, they went on to identify fundamental limitations, such as how LLMs foreclose participants' consent and agency, produce responses lacking in palpability and contextual depth, and risk delegitimizing qualitative research methods. We argue that the use of LLMs as proxies for participants enacts the surrogate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis and Archiving · Qualitative Research Methods and Applications · Computational and Text Analysis Methods
