Evaluating Named Entity Recognition Using Few-Shot Prompting with Large   Language Models

H\'edi Zeghidi; Ludovic Moncla

arXiv:2408.15796·cs.IR·September 5, 2024

Evaluating Named Entity Recognition Using Few-Shot Prompting with Large Language Models

H\'edi Zeghidi, Ludovic Moncla

PDF

Open Access 1 Repo

TL;DR

This paper assesses the effectiveness of few-shot prompting with large language models like GPT-4 for Named Entity Recognition, highlighting their ability to adapt to new entities with minimal data and exploring factors influencing performance.

Contribution

It provides a comprehensive evaluation of large language models' few-shot NER capabilities, comparing them to traditional methods and analyzing prompt engineering effects.

Findings

01

Large models perform well with limited data

02

Prompt engineering significantly impacts results

03

Models adapt to new domains and entity types

Abstract

This paper evaluates Few-Shot Prompting with Large Language Models for Named Entity Recognition (NER). Traditional NER systems rely on extensive labeled datasets, which are costly and time-consuming to obtain. Few-Shot Prompting or in-context learning enables models to recognize entities with minimal examples. We assess state-of-the-art models like GPT-4 in NER tasks, comparing their few-shot performance to fully supervised benchmarks. Results show that while there is a performance gap, large models excel in adapting to new entity types and domains with very limited data. We also explore the effects of prompt engineering, guided output format and context length on performance. This study underscores Few-Shot Learning's potential to reduce the need for large labeled datasets, enhancing NER scalability and accessibility.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

geode-project/ner-llm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies

MethodsLinear Layer · Adam · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Dense Connections · Residual Connection · Multi-Head Attention · Byte Pair Encoding · Absolute Position Encodings