ProtoNER: Few shot Incremental Learning for Named Entity Recognition   using Prototypical Networks

Ritesh Kumar; Saurabh Goyal; Ashish Verma; Vatche Isahagian

arXiv:2310.02372·cs.CL·October 5, 2023

ProtoNER: Few shot Incremental Learning for Named Entity Recognition using Prototypical Networks

Ritesh Kumar, Saurabh Goyal, Ashish Verma, Vatche Isahagian

PDF

Open Access

TL;DR

ProtoNER is a few-shot incremental learning model for Named Entity Recognition that efficiently adds new classes with minimal data, avoiding retraining on the entire dataset and maintaining performance on old classes.

Contribution

ProtoNER introduces a prototypical network-based approach enabling incremental addition of classes in NER without dataset dependency or synthetic data generation.

Findings

01

Achieves similar performance with 30 samples as traditional models with 2600 samples

02

Eliminates need for dataset re-annotation and synthetic data generation

03

Maintains knowledge of old classes while learning new ones

Abstract

Key value pair (KVP) extraction or Named Entity Recognition(NER) from visually rich documents has been an active area of research in document understanding and data extraction domain. Several transformer based models such as LayoutLMv2, LayoutLMv3, and LiLT have emerged achieving state of the art results. However, addition of even a single new class to the existing model requires (a) re-annotation of entire training dataset to include this new class and (b) retraining the model again. Both of these issues really slow down the deployment of updated model. \\ We present \textbf{ProtoNER}: Prototypical Network based end-to-end KVP extraction model that allows addition of new classes to an existing model while requiring minimal number of newly annotated training samples. The key contributions of our model are: (1) No dependency on dataset used for initial training of the model, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies · Natural Language Processing Techniques

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Attention Is All You Need · Average Pooling · Softmax · Grouped Convolution · Global Average Pooling · 1x1 Convolution · ResNeXt Block · Batch Normalization