Identifying and Analyzing Bot-Generated Responses in Online Health Care Surveys: Methodological Study
Emily Hamovitch, Kaileah McKellar, Walter P Wodchis

TL;DR
This study develops methods to detect bot-generated responses in online health surveys and shows how bots can distort data and research conclusions.
Contribution
The paper introduces a 3-tier classification system for bot detection in health surveys and demonstrates its impact on data validity.
Findings
58% of survey responses were classified as suspected bot-generated.
Suspected bots showed response patterns centered on Likert scales, unlike probable humans.
Correlations between health indicators were reversed in bot-generated data, indicating compromised validity.
Abstract
The increasing reliance on online surveys for collecting patient-reported feedback for health care research has led to growing concerns over fraudulent responses generated by bots. These automated responses threaten data integrity by fabricating survey results, distorting statistical analyses, and potentially misguiding policy decisions. Addressing this issue is critical for maintaining the validity of research findings that inform health care practice and policy. This study aimed to develop a robust set of criteria for identifying bot-generated responses in online health care surveys and to examine how these responses impact data quality. We then explored differences in survey results between probable human and suspected bot respondents in a survey assessing patient-reported outcome measures and patient-reported experience measures within a geographic region in Ontario, Canada. A…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · AI in Service Interactions · Digital Mental Health Interventions
