# Personalized insights into liver disease management: a text mining analysis of online consultation data

**Authors:** Kun Xiang, Danxi Shi

PMC · DOI: 10.3389/fpubh.2025.1467117 · Frontiers in Public Health · 2025-05-09

## TL;DR

This study uses text mining on online consultations to uncover patterns in liver disease management and highlights the need for personalized care strategies.

## Contribution

A novel integrated text mining framework combining KeyBERT, BERT-CRF, and association rule mining for liver disease insights from online consultations.

## Key findings

- The framework achieved high F1-scores in medical entity recognition (0.89–0.91) and keyword extraction (0.87).
- Significant clinical associations were identified with lift values between 2.2 and 4.5.
- Stratified analyses revealed demographic variations in liver disease patterns and progression.

## Abstract

Liver diseases pose a significant global health burden with complex management challenges. Online health consultation platforms provide a valuable resource of unstructured patient-physician interactions. This study applies an integrated text mining framework to extract insights from this data, aiming to inform liver disease research and care strategies.

We analyzed 8,149 liver disease-related online consultation records from a leading Chinese health platform. The analytical framework integrated KeyBERT-enhanced keyword extraction with traditional approaches (TF-IDF, TextRank), BERT-CRF medical entity recognition, topic modeling (LDA), and association rule mining. Expert validation by hepatology specialists provided clinical verification of extracted patterns. Stratified analyses across demographic factors and disease types identified subgroup-specific patterns.

Text mining analyses demonstrated robust performance in medical terminology extraction (KeyBERT F1-score: 0.87), identified key topic patterns in liver disease consultations through enhanced entity recognition (F1-scores: 0.89–0.91), and revealed significant clinical associations through comprehensive rule mining (lift: 2.2–4.5). Stratified analyses further highlighted notable demographic variations in disease patterns and progression pathways.

This study validates the effectiveness of integrated text mining approaches in uncovering clinically relevant patterns from online consultation data, with particular strength in medical entity recognition and association detection. The robust methodological framework provides empirical support for differentiated approaches in liver disease management, while demographic variations in disease patterns underscore the necessity for personalized clinical strategies. However, translation of these findings into clinical practice requires longitudinal validation studies integrating multiple data sources.

## Linked entities

- **Diseases:** liver disease (MONDO:0005154)

## Full-text entities

- **Diseases:** Liver diseases (MESH:D008107)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12098495/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12098495/full.md

## References

60 references — full list in the complete paper: https://tomesphere.com/paper/PMC12098495/full.md

---
Source: https://tomesphere.com/paper/PMC12098495