Rethinking Emotion Annotations in the Era of Large Language Models
Minxue Niu, Yara El-Tawil, Amrit Romana, Emily Mower Provost

TL;DR
This paper evaluates GPT-4's effectiveness in emotion annotation, highlighting its potential to supplement human labels, reduce annotation workload, and improve emotion dataset quality in affective computing.
Contribution
It introduces methods for integrating GPT-4 into emotion annotation workflows, demonstrating its ability to identify low-quality labels and enhance annotation efficiency.
Findings
GPT-4 achieves high ratings in emotion perception tasks.
GPT-4 can flag low-quality labels to improve dataset quality.
Using GPT-4 reduces human annotation workload and enhances downstream model performance.
Abstract
Modern affective computing systems rely heavily on datasets with human-annotated emotion labels, for training and evaluation. However, human annotations are expensive to obtain, sensitive to study design, and difficult to quality control, because of the subjective nature of emotions. Meanwhile, Large Language Models (LLMs) have shown remarkable performance on many Natural Language Understanding tasks, emerging as a promising tool for text annotation. In this work, we analyze the complexities of emotion annotation in the context of LLMs, focusing on GPT-4 as a leading model. In our experiments, GPT-4 achieves high ratings in a human evaluation study, painting a more positive picture than previous work, in which human labels served as the only ground truth. On the other hand, we observe differences between human and GPT-4 emotion perception, underscoring the importance of human input in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing
