Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models
Zhuojun Ding, Wei Wei, Chenghao Fan

TL;DR
The paper introduces the SaM framework that dynamically selects and merges domain-specific expert models at inference time to improve the adaptability and scalability of named entity recognition across multiple domains without additional training.
Contribution
The SaM framework enables dynamic expert selection and merging for NER, enhancing domain adaptation and scalability without retraining models.
Findings
Outperforms unified models by an average of 10% across benchmarks.
Effective in improving generalization across various domains.
Flexible in adding or removing experts for scalability.
Abstract
Supervised fine-tuning (SFT) is widely used to align large language models (LLMs) with information extraction (IE) tasks, such as named entity recognition (NER). However, annotating such fine-grained labels and training domain-specific models is costly. Existing works typically train a unified model across multiple domains, but such approaches lack adaptation and scalability since not all training data benefits target domains and scaling trained models remains challenging. We propose the SaM framework, which dynamically Selects and Merges expert models at inference time. Specifically, for a target domain, we select domain-specific experts pre-trained on existing domains based on (i) domain similarity to the target domain and (ii) performance on sampled instances, respectively. The experts are then merged to create task-specific models optimized for the target domain. By dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning
MethodsALIGN · Segment Anything Model
