Theme-driven Keyphrase Extraction to Analyze Social Media Discourse
William Romano, Omar Sharif, Madhusudan Basak, Joseph Gatto, and Sarah, Preum

TL;DR
This paper presents a novel theme-driven keyphrase extraction framework tailored for social media health content, demonstrating its effectiveness in extracting clinically relevant keyphrases, especially for opioid use disorder discussions, with large language models outperforming traditional methods.
Contribution
Introduces a new theme-driven keyphrase extraction approach, a curated dataset from Reddit, and evaluates the performance of large language models like ChatGPT for social media health text analysis.
Findings
Large language model (ChatGPT) outperforms unsupervised models in keyphrase extraction.
Developed the first dataset of human-annotated health keyphrases from social media.
Proved the feasibility of minimally supervised NLP models for extracting actionable health insights.
Abstract
Social media platforms are vital resources for sharing self-reported health experiences, offering rich data on various health topics. Despite advancements in Natural Language Processing (NLP) enabling large-scale social media data analysis, a gap remains in applying keyphrase extraction to health-related content. Keyphrase extraction is used to identify salient concepts in social media discourse without being constrained by predefined entity classes. This paper introduces a theme-driven keyphrase extraction framework tailored for social media, a pioneering approach designed to capture clinically relevant keyphrases from user-generated health texts. Themes are defined as broad categories determined by the objectives of the extraction task. We formulate this novel task of theme-driven keyphrase extraction and demonstrate its potential for efficiently mining social media text for the use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
