BANER: Boundary-Aware LLMs for Few-Shot Named Entity Recognition
Quanjiang Guo, Yihong Dong, Ling Tian, Zhao Kang, Yu Zhang, Sijie, Wang

TL;DR
This paper introduces BANER, a boundary-aware approach using contrastive learning and domain adaptation techniques to improve few-shot named entity recognition with large language models, achieving superior performance across benchmarks.
Contribution
The paper presents a novel boundary-aware contrastive learning strategy and domain alignment method for LLMs, significantly enhancing few-shot NER performance.
Findings
Outperforms prior methods on various benchmarks
Effective across multiple LLM architectures
Improves entity boundary perception and cross-domain classification
Abstract
Despite the recent success of two-stage prototypical networks in few-shot named entity recognition (NER), challenges such as over/under-detected false spans in the span detection stage and unaligned entity prototypes in the type classification stage persist. Additionally, LLMs have not proven to be effective few-shot information extractors in general. In this paper, we propose an approach called Boundary-Aware LLMs for Few-Shot Named Entity Recognition to address these issues. We introduce a boundary-aware contrastive learning strategy to enhance the LLM's ability to perceive entity boundaries for generalized entity spans. Additionally, we utilize LoRAHub to align information from the target domain to the source domain, thereby enhancing adaptive cross-domain classification capabilities. Extensive experiments across various benchmarks demonstrate that our framework outperforms prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
MethodsContrastive Learning · ALIGN
