Topic-aware Large Language Models for Summarizing the Lived Healthcare Experiences Described in Health Stories
Maneesh Bilalpur, Megan Hamm, Young Ji Lee, Natasha Norman, Kathleen M. McTigue, Yanshan Wang

TL;DR
This study demonstrates that combining topic modeling with hierarchical LLM-based summarization effectively captures and summarizes African American healthcare narratives, providing insights for research and clinical improvements.
Contribution
The paper introduces a novel approach integrating LDA and LLMs for topic-aware summarization of healthcare stories, validated by expert and GPT4 assessments.
Findings
26 topics identified in 50 stories
Summaries rated highly accurate and useful
Moderate to high agreement between GPT4 and experts
Abstract
Storytelling is a powerful form of communication and may provide insights into factors contributing to gaps in healthcare outcomes. To determine whether Large Language Models (LLMs) can identify potential underlying factors and avenues for intervention, we performed topic-aware hierarchical summarization of narratives from African American (AA) storytellers. Fifty transcribed stories of AA experiences were used to identify topics in their experience using the Latent Dirichlet Allocation (LDA) technique. Stories about a given topic were summarized using an open-source LLM-based hierarchical summarization approach. Topic summaries were generated by summarizing across story summaries for each story that addressed a given topic. Generated topic summaries were rated for fabrication, accuracy, comprehensiveness, and usefulness by the GPT4 model, and the model's reliability was validated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
