Recovering Patient Journeys: A Corpus of Biomedical Entities and Relations on Twitter (BEAR)
Amelie W\"uhrl, Roman Klinger

TL;DR
This paper introduces a comprehensive Twitter corpus annotated with biomedical entities and relations to better understand patient experiences and journeys, filling a gap in social media medical data analysis.
Contribution
It presents a novel, richly annotated corpus of 2,100 tweets with 14 entity types and 20 relation types focused on patient-centered medical narratives.
Findings
Over 80% of tweets contain relevant biomedical entities.
More than 50% of tweets express key relations for patient journey analysis.
The dataset enables detailed modeling of patient experiences on social media.
Abstract
Text mining and information extraction for the medical domain has focused on scientific text generated by researchers. However, their direct access to individual patient experiences or patient-doctor interactions can be limited. Information provided on social media, e.g., by patients and their relatives, complements the knowledge in scientific text. It reflects the patient's journey and their subjective perspective on the process of developing symptoms, being diagnosed and offered a treatment, being cured or learning to live with a medical condition. The value of this type of data is therefore twofold: Firstly, it offers direct access to people's perspectives. Secondly, it might cover information that is not available elsewhere, including self-treatment or self-diagnoses. Named entity recognition and relation extraction are methods to structure information that is available in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Wikis in Education and Collaboration
