Improving Few-Shot Cross-Domain Named Entity Recognition by Instruction Tuning a Word-Embedding based Retrieval Augmented Large Language Model
Subhadip Nandi, Neeraj Agrawal

TL;DR
This paper introduces IF-WRANER, a retrieval-augmented, instruction-finetuned large language model that significantly improves few-shot cross-domain NER performance, reducing the need for domain-specific model finetuning and enabling practical deployment.
Contribution
The paper presents a novel retrieval-augmented LLM with instruction finetuning and word-level embedding retrieval, outperforming previous SOTA methods in few-shot cross-domain NER tasks.
Findings
Over 2% F1 score improvement on CrossNER dataset.
Reduced customer escalation rates by nearly 15%.
Enabled practical deployment across multiple enterprise domains.
Abstract
Few-Shot Cross-Domain NER is the process of leveraging knowledge from data-rich source domains to perform entity recognition on data scarce target domains. Most previous state-of-the-art (SOTA) approaches use pre-trained language models (PLMs) for cross-domain NER. However, these models are often domain specific. To successfully use these models for new target domains, we need to modify either the model architecture or perform model finetuning using data from the new domains. Both of these result in the creation of entirely new NER models for each target domain which is infeasible for practical scenarios. Recently,several works have attempted to use LLMs to solve Few-Shot Cross-Domain NER. However, most of these are either too expensive for practical purposes or struggle to follow LLM prompt instructions. In this paper, we propose IF-WRANER (Instruction Finetuned Word-embedding based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text and Document Classification Technologies
