Large Language Models' Varying Accuracy in Recognizing Risk-Promoting and Health-Supporting Sentiments in Public Health Discourse: The Cases of HPV Vaccination and Heated Tobacco Products

Soojong Kim; Kwanho Kim; Hye Min Kim

arXiv:2507.04364·cs.CL·July 8, 2025

Large Language Models' Varying Accuracy in Recognizing Risk-Promoting and Health-Supporting Sentiments in Public Health Discourse: The Cases of HPV Vaccination and Heated Tobacco Products

Soojong Kim, Kwanho Kim, Hye Min Kim

PDF

TL;DR

This study evaluates the accuracy of three large language models in detecting risk-promoting and health-supporting sentiments in public health discussions about HPV vaccination and heated tobacco products, revealing platform and topic-dependent discrepancies.

Contribution

It provides a comparative analysis of GPT, Gemini, and LLAMA in sentiment detection across two health issues, highlighting their strengths and limitations in real-world social media data.

Findings

01

Models show higher accuracy for risk-promoting messages on Facebook.

02

Health-supporting messages are more accurately detected on Twitter.

03

LLMs face challenges in reliably detecting neutral messages.

Abstract

Machine learning methods are increasingly applied to analyze health-related public discourse based on large-scale data, but questions remain regarding their ability to accurately detect different types of health sentiments. Especially, Large Language Models (LLMs) have gained attention as a powerful technology, yet their accuracy and feasibility in capturing different opinions and perspectives on health issues are largely unexplored. Thus, this research examines how accurate the three prominent LLMs (GPT, Gemini, and LLAMA) are in detecting risk-promoting versus health-supporting sentiments across two critical public health topics: Human Papillomavirus (HPV) vaccination and heated tobacco products (HTPs). Drawing on data from Facebook and Twitter, we curated multiple sets of messages supporting or opposing recommended health behaviors, supplemented with human annotations as the gold…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.