Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data
Yuting Guo, Anthony Ovadje, Mohammed Ali Al-Garadi, Abeed Sarker

TL;DR
This study evaluates large language models for health-related social media text classification, demonstrating that data augmentation with LLMs enhances lightweight supervised models, while LLMs excel as zero-shot classifiers in reducing human annotation effort.
Contribution
It introduces a novel approach of using LLMs for data augmentation and zero-shot classification in health-related social media NLP tasks, outperforming traditional methods.
Findings
Data augmentation with GPT-4 improves model performance.
Supervised models outperform LLMs in zero-shot settings.
LLMs as zero-shot classifiers help reduce false negatives.
Abstract
Large language models (LLMs) have demonstrated remarkable success in NLP tasks. However, there is a paucity of studies that attempt to evaluate their performances on social media-based health-related natural language processing tasks, which have traditionally been difficult to achieve high scores in. We benchmarked one supervised classic machine learning model based on Support Vector Machines (SVMs), three supervised pretrained language models (PLMs) based on RoBERTa, BERTweet, and SocBERT, and two LLM based classifiers (GPT3.5 and GPT4), across 6 text classification tasks. We developed three approaches for leveraging LLMs for text classification: employing LLMs as zero-shot classifiers, us-ing LLMs as annotators to annotate training data for supervised classifiers, and utilizing LLMs with few-shot examples for augmentation of manually annotated data. Our comprehensive experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · WordPiece · Linear Layer · Absolute Position Encodings · Attention Dropout · Linear Warmup With Linear Decay · Position-Wise Feed-Forward Layer
