Large Language Models' Varying Accuracy in Recognizing Risk-Promoting and Health-Supporting Sentiments in Public Health Discourse: The Cases of HPV Vaccination and Heated Tobacco Products
Soojong Kim, Kwanho Kim, Hye Min Kim

TL;DR
This study evaluates the accuracy of three large language models in detecting risk-promoting and health-supporting sentiments in public health discussions about HPV vaccination and heated tobacco products, revealing platform and topic-dependent discrepancies.
Contribution
It provides a comparative analysis of GPT, Gemini, and LLAMA in sentiment detection across two health issues, highlighting their strengths and limitations in real-world social media data.
Findings
Models show higher accuracy for risk-promoting messages on Facebook.
Health-supporting messages are more accurately detected on Twitter.
LLMs face challenges in reliably detecting neutral messages.
Abstract
Machine learning methods are increasingly applied to analyze health-related public discourse based on large-scale data, but questions remain regarding their ability to accurately detect different types of health sentiments. Especially, Large Language Models (LLMs) have gained attention as a powerful technology, yet their accuracy and feasibility in capturing different opinions and perspectives on health issues are largely unexplored. Thus, this research examines how accurate the three prominent LLMs (GPT, Gemini, and LLAMA) are in detecting risk-promoting versus health-supporting sentiments across two critical public health topics: Human Papillomavirus (HPV) vaccination and heated tobacco products (HTPs). Drawing on data from Facebook and Twitter, we curated multiple sets of messages supporting or opposing recommended health behaviors, supplemented with human annotations as the gold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
