Identifying and Analyzing Bot-Generated Responses in Online Health Care Surveys: Methodological Study

Emily Hamovitch; Kaileah McKellar; Walter P Wodchis

PMC · DOI:10.2196/73622·March 5, 2026

Identifying and Analyzing Bot-Generated Responses in Online Health Care Surveys: Methodological Study

Emily Hamovitch, Kaileah McKellar, Walter P Wodchis

PDF

Open Access

TL;DR

This study develops methods to detect bot-generated responses in online health surveys and shows how bots can distort data and research conclusions.

Contribution

The paper introduces a 3-tier classification system for bot detection in health surveys and demonstrates its impact on data validity.

Findings

01

58% of survey responses were classified as suspected bot-generated.

02

Suspected bots showed response patterns centered on Likert scales, unlike probable humans.

03

Correlations between health indicators were reversed in bot-generated data, indicating compromised validity.

Abstract

The increasing reliance on online surveys for collecting patient-reported feedback for health care research has led to growing concerns over fraudulent responses generated by bots. These automated responses threaten data integrity by fabricating survey results, distorting statistical analyses, and potentially misguiding policy decisions. Addressing this issue is critical for maintaining the validity of research findings that inform health care practice and policy. This study aimed to develop a robust set of criteria for identifying bot-generated responses in online health care surveys and to examine how these responses impact data quality. We then explored differences in survey results between probable human and suspected bot respondents in a survey assessing patient-reported outcome measures and patient-reported experience measures within a geographic region in Ontario, Canada. A…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

depression

Figures3

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpam and Phishing Detection · AI in Service Interactions · Digital Mental Health Interventions