Label Semantics for Few Shot Named Entity Recognition

Jie Ma; Miguel Ballesteros; Srikanth Doss; Rishita Anubhai; Sunil; Mallya; Yaser Al-Onaizan; Dan Roth

arXiv:2203.08985·cs.CL·March 18, 2022

Label Semantics for Few Shot Named Entity Recognition

Jie Ma, Miguel Ballesteros, Srikanth Doss, Rishita Anubhai, Sunil, Mallya, Yaser Al-Onaizan, Dan Roth

PDF

1 Repo

TL;DR

This paper introduces a neural approach for few-shot named entity recognition that utilizes label semantics via dual BERT encoders, significantly improving performance in low-resource scenarios.

Contribution

It proposes a novel architecture that encodes label semantics to enhance few-shot NER, achieving state-of-the-art results and robustness in low-resource settings.

Findings

01

Improved few-shot NER performance across multiple benchmarks.

02

Effective use of label semantics boosts low-resource NER accuracy.

03

Achieves on-par results with standard benchmarks.

Abstract

We study the problem of few shot learning for named entity recognition. Specifically, we leverage the semantic information in the names of the labels as a way of giving the model additional signal and enriched priors. We propose a neural architecture that consists of two BERT encoders, one to encode the document and its tokens and another one to encode each of the labels in natural language format. Our model learns to match the representations of named entities computed by the first encoder with label representations computed by the second encoder. The label semantics signal is shown to support improved state-of-the-art results in multiple few shot NER benchmarks and on-par performance in standard benchmarks. Our model is especially effective in low resource settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TianWuYuJiangHenShou/LabelSemantics
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Dropout