Generating medical screening questionnaires through analysis of social media data
Ortal Ashkenazi, Elad Yom-Tov, Liron Vardi David

TL;DR
This study explores the feasibility of automatically generating medical screening questionnaires from social media data by analyzing user posts to identify symptom patterns and create decision rules validated by medical experts.
Contribution
The paper introduces a novel method to generate screening questionnaires from social media posts, validated through correlation with medical doctor scores for three conditions.
Findings
Generated questionnaires showed moderate correlation with doctor assessments.
The method successfully identified symptom clusters relevant to each condition.
Automated process can potentially reduce time and cost in questionnaire development.
Abstract
Screening questionnaires are used in medicine as a diagnostic aid. Creating them is a long and expensive process, which could potentially be improved through analysis of social media posts related to symptoms and behaviors prior to diagnosis. Here we show a preliminary investigation into the feasibility of generating screening questionnaires for a given medical condition from social media postings. The method first identifies a cohort of relevant users through their posts in dedicated patient groups and a control group of users who reported similar symptoms but did not report being diagnosed with the condition of interest. Posts made prior to diagnosis are used to generate decision rules to differentiate between the different groups, by clustering symptoms mentioned by these users and training a decision tree to differentiate between the two groups. We validate the generated rules by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Social Media in Health Education · Data-Driven Disease Surveillance
