An Investigation on How AI-Generated Responses Affect SoftwareEngineering Surveys
Ronnie de Souza Santos, Italo Santos, Maria Teresa Baldassarre, Cleyton Magalhaes, Mairieli Wessel

TL;DR
This paper investigates the impact of AI-generated responses on the validity of software engineering surveys, highlighting detection methods and emphasizing the need for combined verification to maintain research integrity.
Contribution
It identifies patterns of AI misuse in surveys and proposes a hybrid approach for detecting fabricated responses to ensure data authenticity.
Findings
49 responses showed signs of synthetic authorship
AI-generated responses mimic reasoning but conceal fabricated content
Combining automated and interpretive methods improves detection accuracy
Abstract
Survey research is a fundamental empirical method in software engineering, enabling the systematic collection of data on professional practices, perceptions, and experiences. However, recent advances in large language models (LLMs) have introduced new risks to survey integrity, as participants can use generative tools to fabricate or manipulate their responses. This study explores how LLMs are being misused in software engineering surveys and investigates the methodological implications of such behavior for data authenticity, validity, and research integrity. We collected data from two survey deployments conducted in 2025 through the Prolific platform and analyzed the content of participants' answers to identify irregular or falsified responses. A subset of responses suspected of being AI generated was examined through qualitative pattern inspection, narrative characterization, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Mobile Crowdsensing and Crowdsourcing · Artificial Intelligence in Healthcare and Education
