Health App Reviews for Privacy & Trust (HARPT): A Corpus for Analyzing Patient Privacy Concerns, Trust in Providers and Trust in Applications
Timoteo Kelly, Abdulkadir Korkmaz, Samuel Mallet, Connor Souders, Sadra Aliakbarpour, Praveen Rao

TL;DR
This paper introduces HARPT, a large-scale, annotated corpus of patient reviews from eHealth apps, enabling systematic NLP analysis of privacy and trust concerns, and provides benchmark models to facilitate future research.
Contribution
The study develops and releases HARPT, a comprehensive annotated dataset of 480,000 reviews, with benchmarks for machine learning models, advancing research in patient privacy and trust in health apps.
Findings
HARPT contains 480,000 reviews annotated across trust and privacy categories.
Benchmark results establish baseline performance for NLP models on privacy and trust detection.
The dataset supports reproducible research in health informatics privacy and trust.
Abstract
Background: User reviews of Telehealth and Patient Portal mobile applications (apps) hereon referred to as electronic health (eHealth) apps are a rich source of unsolicited patient feedback, revealing critical insights into patient perceptions. However, the lack of large-scale, annotated datasets specific to privacy and trust has limited the ability of researchers to systematically analyze these concerns using natural language processing (NLP) techniques. Objective: This study aims to develop and benchmark Health App Reviews for Privacy & Trust (HARPT), a large-scale annotated corpus of patient reviews from eHealth apps to advance research in patient privacy and trust. Methods: We employed a multistage data construction strategy. This integrated keyword-based filtering, iterative manual labeling with review, targeted data augmentation, and weak supervision using transformer-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy, Security, and Data Protection · Mobile Health and mHealth Applications · Digital Mental Health Interventions
