A Cooperative Multi-Agent Framework for Zero-Shot Named Entity Recognition
Zihan Wang, Ziqi Zhao, Yougang Lyu, Zhumin Chen, Maarten de Rijke,, Zhaochun Ren

TL;DR
This paper introduces CMAS, a multi-agent framework that enhances zero-shot NER by explicitly modeling context correlations and controlling demonstration use, leading to significant performance improvements across multiple benchmarks.
Contribution
The paper proposes a novel multi-agent system for zero-shot NER that explicitly captures context correlations and manages demonstration influence, addressing key challenges in the field.
Findings
CMAS outperforms existing methods on six benchmarks.
It improves zero-shot NER performance in domain-specific and general scenarios.
Effective in few-shot settings and with various LLM backbones.
Abstract
Zero-shot named entity recognition (NER) aims to develop entity recognition systems from unannotated text corpora. This task presents substantial challenges due to minimal human intervention. Recent work has adapted large language models (LLMs) for zero-shot NER by crafting specialized prompt templates. It advances model self-learning abilities by incorporating self-annotated demonstrations. However, two important challenges persist: (i) Correlations between contexts surrounding entities are overlooked, leading to wrong type predictions or entity omissions. (ii) The indiscriminate use of task demonstrations, retrieved through shallow similarity-based strategies, severely misleads LLMs during inference. In this paper, we introduce the cooperative multi-agent system (CMAS), a novel framework for zero-shot NER that uses the collective intelligence of multiple agents to address the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques
MethodsSelf-Learning
