Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data

Poli Apollinaire Nemkova; Solomon Ubani; Mark V. Albert

arXiv:2505.10260·cs.CL·May 16, 2025

Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data

Poli Apollinaire Nemkova, Solomon Ubani, Mark V. Albert

PDF

Open Access 1 Repo

TL;DR

This study evaluates the annotation capabilities of various large language models in identifying human rights violations in social media posts in Russian and Ukrainian, comparing their performance to human annotations.

Contribution

It provides a comprehensive comparison of LLMs' effectiveness in complex, multilingual annotation tasks, highlighting their strengths and limitations in sensitive domains.

Findings

01

LLMs show varying accuracy in identifying human rights violations.

02

Prompt language influences model performance.

03

Models exhibit distinct error patterns and cross-linguistic differences.

Abstract

In the era of increasingly sophisticated natural language processing (NLP) systems, large language models (LLMs) have demonstrated remarkable potential for diverse applications, including tasks requiring nuanced textual understanding and contextual reasoning. This study investigates the capabilities of multiple state-of-the-art LLMs - GPT-3.5, GPT-4, LLAMA3, Mistral 7B, and Claude-2 - for zero-shot and few-shot annotation of a complex textual dataset comprising social media posts in Russian and Ukrainian. Specifically, the focus is on the binary classification task of identifying references to human rights violations within the dataset. To evaluate the effectiveness of these models, their annotations are compared against a gold standard set of human double-annotated labels across 1000 samples. The analysis includes assessing annotation performance under different prompting conditions,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PoliNemkova/LLM_labeling_skills
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Hate Speech and Cyberbullying Detection · Law, AI, and Intellectual Property

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Label Smoothing · Byte Pair Encoding · Attention Dropout · Softmax · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Linear Layer · Transformer