Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses

Yan Wang; Ziyi Guo; Christopher McCarty

arXiv:2605.19229·cs.AI·May 20, 2026

Can Large Language Models Revolutionize Survey Research? Experiments with Disaster Preparedness Responses

Yan Wang, Ziyi Guo, Christopher McCarty

PDF

TL;DR

This study evaluates the potential of large language models to improve survey research, especially in disaster contexts, by integrating them into a five-stage workflow and demonstrating their advantages over traditional methods.

Contribution

The paper introduces a novel framework for LLM integration in survey workflows and develops a theory-informed LLM configuration that outperforms classical imputation methods in disaster-related surveys.

Findings

01

A-TLM achieves lower RMSE than classical baselines under MNAR conditions.

02

Retrieval-augmented models outperform unstructured retrieval in bias and accuracy.

03

Subgroup bias auditing is proposed as a new reporting standard.

Abstract

Survey research faces mounting structural challenges: declining response rates, sample bias, block-wise missingness among at-risk respondents, and AI-assisted fraudulent completions in online panels. Large language models (LLMs) have been proposed as a remedy, yet rigorous evaluations across the full survey workflow remain scarce, particularly in disaster contexts where data quality matters most. We present and evaluate a five-stage framework for LLM integration covering questionnaire design, sample selection, pilot testing, missing-data imputation, and post-collection analysis, using the 2024 Hurricane Milton preparedness survey of Florida residents (n=946) as a shared empirical testbed. We introduce a Protection Motivation Theory (PMT)-constrained co-occurrence knowledge graph and develop seven LLM configurations spanning zero-shot inference, retrieval-augmented baselines, and novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.