Are Expert-Level Language Models Expert-Level Annotators?

Yu-Min Tseng; Wei-Lin Chen; Chung-Chi Chen; Hsin-Hsi Chen

arXiv:2410.03254·cs.CL·October 7, 2024

Are Expert-Level Language Models Expert-Level Annotators?

Yu-Min Tseng, Wei-Lin Chen, Chung-Chi Chen, Hsin-Hsi Chen

PDF

Open Access

TL;DR

This paper systematically evaluates large language models as expert-level data annotators across specialized domains, revealing their potential and limitations in expert tasks.

Contribution

It is the first comprehensive study assessing LLMs' performance as expert annotators in specialized fields, providing practical and cost-effective insights.

Findings

01

LLMs can perform at expert levels in certain specialized annotation tasks.

02

Performance varies significantly across different domains and tasks.

03

Practical guidelines for deploying LLMs as expert annotators are proposed.

Abstract

Data annotation refers to the labeling or tagging of textual data with relevant information. A large body of works have reported positive results on leveraging LLMs as an alternative to human annotators. However, existing studies focus on classic NLP tasks, and the extent to which LLMs as data annotators perform in domains requiring expert knowledge remains underexplored. In this work, we investigate comprehensive approaches across three highly specialized domains and discuss practical suggestions from a cost-effectiveness perspective. To the best of our knowledge, we present the first systematic evaluation of LLMs as expert-level data annotators.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)

MethodsFocus