TL;DR
This paper introduces a neural approach for few-shot named entity recognition that utilizes label semantics via dual BERT encoders, significantly improving performance in low-resource scenarios.
Contribution
It proposes a novel architecture that encodes label semantics to enhance few-shot NER, achieving state-of-the-art results and robustness in low-resource settings.
Findings
Improved few-shot NER performance across multiple benchmarks.
Effective use of label semantics boosts low-resource NER accuracy.
Achieves on-par results with standard benchmarks.
Abstract
We study the problem of few shot learning for named entity recognition. Specifically, we leverage the semantic information in the names of the labels as a way of giving the model additional signal and enriched priors. We propose a neural architecture that consists of two BERT encoders, one to encode the document and its tokens and another one to encode each of the labels in natural language format. Our model learns to match the representations of named entities computed by the first encoder with label representations computed by the second encoder. The label semantics signal is shown to support improved state-of-the-art results in multiple few shot NER benchmarks and on-par performance in standard benchmarks. Our model is especially effective in low resource settings.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Dropout
