Evaluating Large Language Models for Health-Related Text Classification   Tasks with Public Social Media Data

Yuting Guo; Anthony Ovadje; Mohammed Ali Al-Garadi; Abeed Sarker

arXiv:2403.19031·cs.CL·March 29, 2024·1 cites

Evaluating Large Language Models for Health-Related Text Classification Tasks with Public Social Media Data

Yuting Guo, Anthony Ovadje, Mohammed Ali Al-Garadi, Abeed Sarker

PDF

Open Access

TL;DR

This study evaluates large language models for health-related social media text classification, demonstrating that data augmentation with LLMs enhances lightweight supervised models, while LLMs excel as zero-shot classifiers in reducing human annotation effort.

Contribution

It introduces a novel approach of using LLMs for data augmentation and zero-shot classification in health-related social media NLP tasks, outperforming traditional methods.

Findings

01

Data augmentation with GPT-4 improves model performance.

02

Supervised models outperform LLMs in zero-shot settings.

03

LLMs as zero-shot classifiers help reduce false negatives.

Abstract

Large language models (LLMs) have demonstrated remarkable success in NLP tasks. However, there is a paucity of studies that attempt to evaluate their performances on social media-based health-related natural language processing tasks, which have traditionally been difficult to achieve high scores in. We benchmarked one supervised classic machine learning model based on Support Vector Machines (SVMs), three supervised pretrained language models (PLMs) based on RoBERTa, BERTweet, and SocBERT, and two LLM based classifiers (GPT3.5 and GPT4), across 6 text classification tasks. We developed three approaches for leveraging LLMs for text classification: employing LLMs as zero-shot classifiers, us-ing LLMs as annotators to annotate training data for supervised classifiers, and utilizing LLMs with few-shot examples for augmentation of manually annotated data. Our comprehensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · WordPiece · Linear Layer · Absolute Position Encodings · Attention Dropout · Linear Warmup With Linear Decay · Position-Wise Feed-Forward Layer