Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models

Emma McDaniel; Samuel Scheele; Jeff Liu

arXiv:2410.00182·cs.CL·August 29, 2025

Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models

Emma McDaniel, Samuel Scheele, Jeff Liu

PDF

Open Access

TL;DR

This paper evaluates the zero-shot classification capabilities of three large language models on crisis tweets, focusing on identifying informativeness and humanitarian categories, revealing performance variability and dataset quality issues.

Contribution

It introduces an assessment of instruction-finetuned large language models for zero-shot crisis tweet classification, highlighting their varying performance and dataset challenges.

Findings

01

Models perform better on informativeness without extra info.

02

Providing event context improves humanitarian classification.

03

Performance varies significantly across datasets.

Abstract

Social media posts are frequently identified as a valuable source of open-source intelligence for disaster response, and pre-LLM NLP techniques have been evaluated on datasets of crisis tweets. We assess three commercial large language models (OpenAI GPT-4o, Gemini 1.5-flash-001 and Anthropic Claude-3-5 Sonnet) capabilities in zero-shot classification of short social media posts. In one prompt, the models are asked to perform two classification tasks: 1) identify if the post is informative in a humanitarian context; and 2) rank and provide probabilities for the post in relation to 16 possible humanitarian classes. The posts being classified are from the consolidated crisis tweet dataset, CrisisBench. Results are evaluated using macro, weighted, and binary F1-scores. The informative classification task, generally performed better without extra information, while for the humanitarian…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPublic Relations and Crisis Communication · Sentiment Analysis and Opinion Mining · Misinformation and Its Impacts