Topic-aware Large Language Models for Summarizing the Lived Healthcare Experiences Described in Health Stories

Maneesh Bilalpur; Megan Hamm; Young Ji Lee; Natasha Norman; Kathleen M. McTigue; Yanshan Wang

arXiv:2510.24765·cs.CY·October 30, 2025

Topic-aware Large Language Models for Summarizing the Lived Healthcare Experiences Described in Health Stories

Maneesh Bilalpur, Megan Hamm, Young Ji Lee, Natasha Norman, Kathleen M. McTigue, Yanshan Wang

PDF

TL;DR

This study demonstrates that combining topic modeling with hierarchical LLM-based summarization effectively captures and summarizes African American healthcare narratives, providing insights for research and clinical improvements.

Contribution

The paper introduces a novel approach integrating LDA and LLMs for topic-aware summarization of healthcare stories, validated by expert and GPT4 assessments.

Findings

01

26 topics identified in 50 stories

02

Summaries rated highly accurate and useful

03

Moderate to high agreement between GPT4 and experts

Abstract

Storytelling is a powerful form of communication and may provide insights into factors contributing to gaps in healthcare outcomes. To determine whether Large Language Models (LLMs) can identify potential underlying factors and avenues for intervention, we performed topic-aware hierarchical summarization of narratives from African American (AA) storytellers. Fifty transcribed stories of AA experiences were used to identify topics in their experience using the Latent Dirichlet Allocation (LDA) technique. Stories about a given topic were summarized using an open-source LLM-based hierarchical summarization approach. Topic summaries were generated by summarizing across story summaries for each story that addressed a given topic. Generated topic summaries were rated for fabrication, accuracy, comprehensiveness, and usefulness by the GPT4 model, and the model's reliability was validated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.