# Understanding the Engagement and Interaction of Superusers and Regular Users in UK Respiratory Online Health Communities: Deep Learning–Based Sentiment Analysis

**Authors:** Xiancheng Li, Emanuela Vaghi, Gabriella Pasi, Neil S Coulson, Anna De Simoni, Marco Viviani

PMC · DOI: 10.2196/56038 · Journal of Medical Internet Research · 2025-02-13

## TL;DR

This study uses deep learning to analyze how superusers and regular users interact in UK respiratory online communities, finding that superusers promote positive sentiment in discussions.

## Contribution

The study introduces a deep learning-based sentiment analysis approach to explore the role of superusers in fostering positive interactions in respiratory online health communities.

## Key findings

- Superusers tend to write shorter, more positive posts compared to regular users.
- Threads initiated by superusers encourage regular users to respond with positive sentiment.
- Superusers defined by activity or spreading ability largely overlap, showing they are both active and influential.

## Abstract

Online health communities (OHCs) enable people with long-term conditions (LTCs) to exchange peer self-management experiential information, advice, and support. Engagement of “superusers,” that is, highly active users, plays a key role in holding together the community and ensuring an effective exchange of support and information. Further studies are needed to explore regular users’ interactions with superusers, their sentiments during interactions, and their ultimate impact on the self-management of LTCs.

This study aims to gain a better understanding of sentiment distribution and the dynamic of sentiment of posts from 2 respiratory OHCs, focusing on regular users’ interaction with superusers.

We conducted sentiment analysis on anonymized data from 2 UK respiratory OHCs hosted by Asthma UK (AUK), and the British Lung Foundation (BLF) charities between 2006-2016 and 2012-2016, respectively, using the Bio-Bidirectional Encoder Representation from Transformers (BioBERT), a pretrained language representation model. Given the scarcity of health-related labeled datasets, BioBERT was fine-tuned on the COVID-19 Twitter Dataset. Positive, neutral, and negative sentiments were categorized as 1, 0, and –1, respectively. The average sentiment of aggregated posts by regular users and superusers was then calculated. Superusers were identified based on a definition already used in our previous work (ie, “the 1% users with the largest number of posts over the observation period”) and VoteRank, (ie, users with the best spreading ability). Sentiment analyses of posts by superusers defined with both approaches were conducted for correlation.

The fine-tuned BioBERT model achieved an accuracy of 0.96. The sentiment of posts was predominantly positive (60% and 65% of overall posts in AUK and BLF, respectively), remaining stable over the years. Furthermore, there was a tendency for sentiment to become more positive over time. Overall, superusers tended to write shorter posts characterized by positive sentiment (63% and 67% of all posts in AUK and BLF, respectively). Superusers defined by posting activity or VoteRank largely overlapped (61% in AUK and 79% in BLF), showing that users who posted the most were also spreaders. Threads initiated by superusers typically encouraged regular users to reply with positive sentiments. Superusers tended to write positive replies in threads started by regular users whatever the type of sentiment of the starting post (ie, positive, neutral, or negative), compared to the replies by other regular users (62%, 51%, 61% versus 55%, 45%, 50% in AUK; 71%, 62%, 64% versus 65%, 56%, 57% in BLF, respectively; P<.001, except for neutral sentiment in AUK, where P=.36).

Network and sentiment analyses provide insight into the key sustaining role of superusers in respiratory OHCs, showing they tend to write and trigger regular users’ posts characterized by positive sentiment.

## Linked entities

- **Diseases:** Asthma (MONDO:0004979), Lung disease (MONDO:0005275)

## Full-text entities

- **Diseases:** Asthma (MESH:D001249), COVID-19 (MESH:D000086382), LTCs (MESH:D000088562)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11888069/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11888069/full.md

## References

39 references — full list in the complete paper: https://tomesphere.com/paper/PMC11888069/full.md

---
Source: https://tomesphere.com/paper/PMC11888069